Hey
About Me 😄
- Software engineer on data tools / data products at
Lyft since 2018 - Apache Airflow PMC member and committer
- Maintainer on
Amundsen
Get in Touch 📫
Amundsen: github.com/amundsen-io/amundsen
🐦 Twitter: @photoft45
Slack: @amundsen / Tao Feng👔 LinkedIn: @tao-f-17195814
Talks & Writings 💬 📝
Conference & Meetup Presentations
- Airflow Summit 2020 invited key note (slide)
- Airflow @ Lyft @ SF Big Analytics Meetup April 2019
- Amundsen: A Data Discovery Platform from Lyft @ Data Council SF April 2019
- Disrupting Data Discovery @ Strata SF 2019
Engineering Blogs
- Open Sourcing Amundsen: A Data Discovery And Metadata Platform @ Lyft engineering blog 2019
- Securing Apache Airflow UI With DAG Level Access @ Lyft engineering blog 2019
- Running Apache Airflow At Lyft @ Lyft engineering blog 2018
- Common Issue Detection for CPU Profiling @ Linkedin engineering blog 2017
- ODP: An Infrastructure for On-Demand Service Profiling @ Linkedin engineering blog 2017
- Benchmarking Apache Samza: 1.2 million message per sec on a single node @ Linkedin engineering blog 2015
Conference Papers
- ODP: An Infrastructure for On-Demand Service Profiling @ IEEE ICPE 2018
- Effective Multi-stream Joining for Enhancing Data Quality in Apache Samza Framework @ IEEE Bigdata Congress 2016
- A Memory Capacity Model for High Performing Data-filtering Applications in Samza Framework @ IEEE Big Data 2015
Podcasts
- Interview with Software Engineering Daily on Data Discovery at Lyft
- Interview with Data Engineering Podcast on Amundsen

