PyData Seattle 2023

Tom Drabas

Tom is a Field Engineer/Solutions Architect with Voltron Data. He has almost 20 years of experience working with data across multiple industries ranging from airlines, thru finance and banking, to high tech. Tom holds a PhD degree in Operations Research from the UNSW. He has extensive experience presenting at international conferences (KDD, PyData Seattle, GTC). Tom is an author of 3 books and a video series on data analytics and data engines, and has authored multiple blog posts and webinars on GPU applications for big data. He also received a patent for a solution that discovers patterns in extremely high-dimensional datasets while working at Microsoft. At Voltron Data, Tom works on building bespoke solutions for solving intricate problems for customers leveraging the capabilities of the Apache Arrow data ecosystem.

The speaker's profile picture

Sessions

04-28
15:00
45min
From prototype to deployment: Increase productivity and simplify data operations in Python
Tom Drabas

Designing ML pipelines is a complex process involving numerous changes along the way, from a prototype to deployment. It frequently involves iterating over multiple models on a smaller scale and then converting those models to run at scale. In this talk we will discuss the inefficiencies of this process and present a modern open source based solution that helps to mitigate many of these inefficiencies. The proposed tools and approaches help data scientists, data engineers, and machine learning engineers work more efficiently across all ranges of tasks and reduce the time-to-solution. We also present future development plans.

Hood