PyData Seattle 2023

Panel: The living nature of data: exploring the Lifecycle and Management of Data at Scale
04-28, 15:00–16:30 (America/Los_Angeles), Baker

As we continue to witness the exponential growth of data generation, especially with the proliferation of IoT devices, widespread deploy of LLMs, and synthetic data, it is essential to understand the dynamic nature of data and its lifecycle. This panel will delve into the living nature of data, exploring its various stages, from creation to effective processing, augmentation, and beyond: we will discuss tools, experiences and trends to look out for in 2023.


As we continue to witness the exponential growth of data generation, especially with the proliferation of IoT devices, widespread deploy of LLMs, and synthetic data, it is essential to understand the dynamic nature of data and its lifecycle. This panel will delve into the living nature of data, exploring its various stages, from creation to effective processing, augmentation, and beyond: we will discuss tools, experiences and trends to look out for in 2023.


Prior Knowledge Expected

No previous knowledge expected

Alan Descoins is an AI and technology leader with over thirteen years of professional services experience working for companies worldwide, mainly focused in the US and Silicon Valley.

Establishing technological strategies for a variety of clients, Alan is an expert at leading teams that solve business problems by applying machine learning techniques. Encouraging and working closely with his teams in Tryolabs, he is constantly in contact with high-profile leaders in different industries, helping them first discover opportunities for the use of AI that can positively impact their business and then effectively execute on those. The outcomes of his collaborations often result in significant cost savings through automation or an increase in revenue attributed to more intelligent decisions powered by data.

Alan has hands-on experience in building machine learning and deep learning models in the areas of natural language processing, computer vision, and predictive analytics. He has worked on a wide range of problems such as process automation, product recommendations, price optimization, predictive maintenance, and video analytics.

Additionally, Alan has been a speaker in multiple talks, keynotes, and workshops on AI-related topics across the world, and holds a BSc in Computer Science from the Universidad de la República (Uruguay).

Fabiana Clemente is the co-founder and CDO of YData, combining Data Understanding, Causality, and Privacy as her main fields of work and research, with the mission to make data actionable for organizations.

Passionate for data, Fabiana has vast experience leading data science teams in startups and multinational companies.

Host of “When Machine Learning meets privacy” podcast and a guest speaker at Datacast and Privacy Please, the previous WebSummit speaker, was recently awarded “Founder of the Year” by the South Europe Startup Awards.

This speaker also appears in:

Yucheng Low is the co-founder & CEO of XetHub. He is no stranger to the PyData community, having last presented at PyData 2015 Denver. He was the co-founder and Chief Architect at GraphLab, where he built the SFrame - the 1st out-of-core dataframe for Python, scaling to trillion cell dataframes on a laptop. GraphLab, which was renamed to Dato and Turi, was acquired by Apple in 2016. At Apple, after open-sourcing Turi Create, Yucheng worked on many parts of the ML platform stack ranging from storage to inference. In 2021 he left Apple and together with a couple colleagues, started XetHub: a data / model management service combining the best parts of S3 & Git. He has a PhD in Machine Learning from CMU where he worked on distributed ML.

David is CEO of Expanso and co-director of Bacalhau, the distributed computing framework that is changing the way people interact with data and machine learning models.

Previously, he led Open Source Machine Learning Strategy at Azure, product management for Kubernetes on behalf of Google, launched Google Kubernetes Engine, and co-founded the Kubeflow project and the SAME project. He has also worked at Amazon, Chef and co-founded three startups.

When not spending too much time in service of electrons, he can be found on a mountain (on skis), traveling the world (via restaurants) or participating in kid activities, of which there are a lot more than he remembers than when he was that age.