Following the announcement at the AI Summer School on August 28, we’re delighted to share with you the upcoming agenda of the Knowledge Program, available to all employees of BNP Paribas Group and its subsidiaries.
You are welcome to pre-register for any of the events on that dedicated page:
Online talk - Modeling - on Sept. 24th, 2025 - 3:00 PM UTC San Francisco 8:00 AM - New York 11:00 AM - Paris 5:00 PM - Delhi 8:30 PM
"Flexible Workflows with Metadata Routing API"
by Stefanie Senger, Open Source Software Engineer, PhD.
This session will explore how metadata—such as sample weights or grouping information—can now be seamlessly passed through complex machine learning pipelines in scikit-learn. You’ll gain practical insights into how this powerful feature improves flexibility and fairness in model development.
One-hour thematic online conversation in English We use Livestorm webinar platform. For all Data Science and Machine Learning Practitioners
Masterclass on October 8th, 2025 - 2:00pm-8:00pm CET Montparnasse Tower, Paris, France
"Time Series Forecasting"
By the scikit-learn core developers Olivier Grisel and Guillaume Lemaitre.
For experienced data scientists and ML Engineers. This hands-on workshop will use both real world production and environmental data to illustrate how to deal with common data-quality problems. We will also use some specially crafted synthetic data to emphasize catastrophic pitfalls related to edge cases to gain deeper insights about our modeling choices and their limitations.
Comprehensive information about the Learning Program is available on the following page https://scikit-learn.probabl.ai/masterclass-registration (please do not register from that page, all employees of BNP Paribas Group and its subsidiaries must use the dedicated Pre-registration form)
Online talk - Modeling - on October 22nd, 2025 - 3:00 PM UTC San Francisco 8:00 AM - New York 11:00 AM - Paris 5:00 PM - Delhi 8:30 PM
" Skrub: machine learning for dataframes"
by Guillaume Lemaitre, Chief ML Officer, PhD.
Machine-learning algorithms expect a numeric array with one row per observation. Typically, creating this table requires "wrangling" with Pandas or Polars (aggregations, selections, joins, ...), and to extract numeric features from structured data types such as datetimes. These transformations must be applied consistently when making predictions for unseen inputs, and choices must be informed by performance measured on a validation dataset, while preventing data leakage. This preprocessing is the most difficult and time-consuming part of many data-science projects.
One-hour thematic online conversation in English We use Livestorm webinar platform. For experienced data scientists and ML Engineers.
Other events still To Be Confirmed
Exclusive Masterclass on November 25th
"Bridging data engineering to machine learning with skrub"
by Gaël Varoquaux, Co-Founder of scikit-learn
Online talk - Recommended Practices on December 10th - 3:00 PM UTC
"Practical strategies for assessing and mitigating harm in AI systems-Exploring fairlearn."
by Adrin Jalali, VP Labs & core developer of scikit-learn.