Online talk
For all Data Science and Machine Learning Practitioners
«Skrub: machine learning for dataframes»
by Guillaume Lemaitre, Chief ML Officer, PhD.
Oct.22nd,2025- 3:00pm UTC
San Francisco 8:00 AM - New York 11:00 AM
Paris 5:00 PM - Delhi 8:30 PM

Overview:
Machine-learning algorithms expect a numeric array with one row per observation. Typically, creating this table requires "wrangling" with Pandas or Polars (aggregations, selections, joins, ...), and to extract numeric features from structured data types such as datetimes. These transformations must be applied consistently when making predictions for unseen inputs, and choices must be informed by performance measured on a validation dataset, while preventing data leakage. This preprocessing is the most difficult and time-consuming part of many data-science projects.
About:
One-hour thematic conversations on data science and engineering topics. An expert speaker presents a problem / topic and his vision of best practices, methodology, tools, etc. for 20 to 30 minutes. The rest of the session is devoted to questions and discussions around the presentation. A knowledge sharing & networking space. All conversations are in English. 6 sessions per year for 3 time zones (US, Europe, Asia)
These talks are made possible thanks to following sponsors


