Projects
I work on data processing and AI. I am an Apache Beam PMC member, member of Apache Software Foundation, and Engineering Director for Google Cloud Dataflow, leading a team that develops open-source data processing systems. I am passionate about building user-friendly, efficient, and reliable frameworks for Machine Learning.
Interested in having me speak at your next event? Or write a guest post for your publication? I would be happy to participate in your next event and engage with your audience! Let’s discuss!
Articles
You can read some of my blog posts:
- Leading open-source teams in large organizations (LeadDev) (Archive) - December 2023
- Contributor Spotlight: Johanna Öjeling - November 2023
- Machine learning patterns with Apache Beam and the Dataflow Runner, part I - December 2020
- Towards ML Engineering: A Brief History Of TensorFlow Extended (TFX) - September 2020
- Apache Beam 2.11.0 - March 2019
- Apache Beam 2.8.0 - October 2018
- Dataflow Stream Processing now supports Python - June 2018
- Announcing general availability of Google Cloud Dataflow for Python - March 2017
- Python SDK released in Apache Beam 0.6.0 - March 2017
Talks
You can watch some of my past talks:
- Open Source Syposium - Strategies from Apache Beam and Kubernetes (DevRelConf 2024 at Google, with Tim Hockin) - September 2024
- Beam Up Your GenAI Usage: Usability, Efficiency, Reliability with Apache Beam (Data @Scale 2024) Video, Q&A, Q&A Video, Slides - May 2024
- Advanced patterns for Windows and Triggers - (Beam College 2021) - April 2021
- Distributed Processing for Machine Learning Production Pipelines (Flink Forward 2020 Talk) - April 2020
- TensorFlow Extended: An End-to-End Machine Learning Platform for TensorFlow Slides (Archive) (Spark+AI 2019 Talk) - April 2019
- Cloud OnAir: Stream Processing with Cloud Dataflow: SDKs & Architectures (Webinar) - August 2018
- Learn About Stream & Batch Processing with Apache Beam (Meetup) - November 2017
Work
I cannot publicly discuss the majority of the projects I work on. You can see my GitHub page for those I could share publicly.
- Previously, I worked on a new operating system codenamed Midori at Microsoft. (Joe Duffy published a series of articles about Midori.)
- Before that, I wrote mobile standards, frameworks, and applications focusing on multimedia at SanDisk. I worked on projects such as the IEEE standard on media delivery (HQME) and one of the more successful premium music offerings of the time (Muve Music).
Occasionally, I work on small projects in my free time:
- TextClue! is a text analysis tool that analyzes free from user feedback. - April 2020