PyData Seattle 2023

Keynote: Distributed Computing 4 Kids -- with Spark (and guest appearances from Ray and Dask)
04-27, 13:30–14:15 (America/Los_Angeles), Kodiak Theatre

Distributed Computing is a lot of fun, so why don't we share it with our kids? Are you tired of kind of "hand waving" explanations of what you've been doing at work? In this talk we'll explore how to teach children about distributed computing (mostly data parallel) along with a little bit of Spark. We'll then talk about how we'll expand to teaching concepts like "actors" and "non-data-parallelism" to children. You don't need to have kids to enjoy this talk!

Come for the gnome filled slides, stay for the thinking about how to explain your work to people outside of your field.


Distributed Computing is a lot of fun, so why don't we share it with our kids? Are you tired of kind of "hand waving" explanations of what you've been doing at work? In this talk we'll explore how to teach children about distributed computing (mostly data parallel) along with a little bit of Spark. We'll then talk about how we'll expand to teaching concepts like "actors" and "non-data-parallelism" to children. You don't need to have kids to enjoy this talk!

Come for the gnome filled slides, stay for the thinking about how to explain your work to people outside of your field.


Prior Knowledge Expected

No previous knowledge expected

Holden is a transgender Canadian open source developer with a focus on Apache Spark, Airflow, Kubeflow, Ray, Dask and related “big data“ tools. She is the co-author of Learning Spark, High Performance Spark, Scaling Python with Ray, and Kubeflow for Machine Learning. She is a committer and PMC on Apache Spark. She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal. She has worked at Amazon, Apple, and Google and is now working at Netflix.