Running Apache Airflow At Lyft

1 · Lyft · Dec. 20, 2018, 7:42 p.m.
By Tao Feng, Andrew Stahlman, and Junda YangETL is a process to extract data from various raw events, transform them for analysis and load the derived data into a queryable data store. Data engineers and scientists at Lyft build various ETL pipelines which run at a different set schedule to gain insight on topics ranging from the current ridesharing market to the experiences for driver/passenger, etc. A reliable, efficient, and trustworthy workflow management system is crucial to make sure these...