Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41
Published:Aug 5, 2017 00:00
•1 min read
•Practical AI
Analysis
This article summarizes a podcast interview with Erin Shellman, a data science manager at Zymergen. The interview focuses on Zymergen's use of Apache Airflow for building reliable and repeatable data pipelines for its machine learning applications. The article highlights the company's innovative use of robots and machine learning to engineer microbes. It also acknowledges the presence of background noise in the recording. The article provides a concise overview of the interview's key topic: data pipeline management using Airflow within a company focused on bioengineering.
Key Takeaways
- •Zymergen uses Apache Airflow for data pipeline management.
- •The interview discusses the application of Airflow in a bioengineering context.
- •The podcast provides insights into real-world data science practices.
Reference
“Our conversation focuses on Zymergen’s use of Apache Airflow, an open-source data management platform originating at Airbnb, that Erin and her team uses to create reliable, repeatable data pipelines for its machine learning applications.”