The scikit-learn® Strategic Partnership Programs
Upcoming events
Sept. 17th, 2025 - 3:00 PM UTCTechnical Committee
«Sponsors' annual committee»
Upon invitation from Probabl
Half-day meeting composed of representatives of Probabl Open Source team / Core developers and maintainers of the project, as well as representatives of scikit-learn sponsors.
- Review the work achieved since previous committee
- Analyse the feedback of the community
- Review actual roadmap
- Request attendees' feedback and discuss expectations & opportunities
- Elaborate a strategic technical roadmap for Probabl Open Source team
- Networking / cocktail session
Sept. 24th, 2025 - 3:00 PM UTCOnline Talk
Modeling
«Flexible Workflows with Metadata Routing API»
by Stefanie Senger, Open Source Software Engineer, PhD.
For all Data Science and Machine Learning Practitioners
This session will explore how metadata—such as sample weights or grouping information—can now be seamlessly passed through complex machine learning pipelines in scikit-learn. You’ll gain practical insights into how this powerful feature improves flexibility and fairness in model development.
One-hour thematic conversation in English.
Oct. 8th, 2025 - 2:00pm-8:00pm CETMasterclass
Recommended Practices
«Time Series Forecasting»
By the scikit-learn core developers Olivier Grisel and Guillaume Lemaitre.
For experienced data scientists and ML Engineers - limited to 20 participants
This hands-on workshop will use both real world production and environmental data to illustrate how to deal with common data-quality problems. We will also use some specially crafted synthetic data to emphasize catastrophic pitfalls related to edge cases to gain deeper insights about our modeling choices and their limitations.
Oct. 22nd, 2025 - 3:00 PM UTCOnline Talk
Modeling
«Skrub: machine learning for dataframes»
by Guillaume Lemaitre, Chief ML Officer, PhD.
For experienced data scientists and ML Engineers
Machine-learning algorithms expect a numeric array with one row per observation. Typically, creating this table requires "wrangling" with Pandas or Polars (aggregations, selections, joins, ...), and to extract numeric features from structured data types such as datetimes. These transformations must be applied consistently when making predictions for unseen inputs, and choices must be informed by performance measured on a validation dataset, while preventing data leakage. This preprocessing is the most difficult and time-consuming part of many data-science projects.
One-hour thematic conversation in English.
25-26 November, 2025Probability One
Event
Probability One
The community event for scikit-learn and its open-source ecosystem.
We will gather a crowd of data scientists, data engineers, and data owners, all looking to harness the power of data and open source software, running on the most relevant hardware and AI infrastructure layers.
Conferences - Demos - One to One meetings
2025 edition is organised as a dedicated village during the AdoptAI event. For more information about AdoptAI, visit: https://adoptai.artefact.com/
Nov. 25th, 2025 - TBDExclusive Masterclass
«Bridging data engineering to machine learning with skrub»
By Gaël Varoquaux, Research Director at Inria, Co-Founder at scikit-learn & Probabl.
One hour practical lecture for experienced Data Science and Machine Learning Practitioners - limited to 70 participants
Dec. 10th, 2025 - 3:00 PM UTCOnline Talk
Recommended Practices
TO BE CONFIRMED
« Practical strategies for assessing and mitigating harm in AI systems-Exploring fairlearn. »
by Adrin Jalali, VP Labs & core developer of scikit-learn.
For experienced data scientists and ML Engineers
One-hour thematic conversation in English.
These events are made possible thanks to following sponsors



The scikit-learn® Strategic Partnership Programs
Recent events
July 10th, 2025 - 2:00pm-8:00pm CETMasterclass
Recommended Practices
«Time Series Forecasting»
By the scikit-learn core developers Olivier Grisel and Guillaume Lemaitre.
For experienced data scientists and ML Engineers - limited to 20 participants
This hands-on workshop will use both real world production and environmental data to illustrate how to deal with common data-quality problems. We will also use some specially crafted synthetic data to emphasize catastrophic pitfalls related to edge cases to gain deeper insights about our modeling choices and their limitations..
May 21st, 2025 - 3:00 PM GMTOnline Talk
Modeling
«When should you use Survival analysis and competing risk modeling using HAZARDOUS?»
by Vincent Maladière, AI/ML Research Engineer.
For all Data Science and Machine Learning Practitioners
One-hour thematic conversations on data science and engineering topics. An expert speaker presents a problem / topic and his vision of best practices, methodology, tools, etc. for 20 to 30 minutes. The rest of the session is devoted to questions and discussions around the presentation. A knowledge sharing & networking space. All conversations are in English.
May 20th, 2025 - 3:00 PM GMTSkore hub private reveal
New Product - Early Access Program
for data science at scale
We built Skore hub as the missing layer between your team's experimentation and production environments. It extends our open-source skore library (the scikit-learn side kick) to bring structure to your development workflow before heavy MLOps tooling becomes necessary.
April 29th, 2025 - 4:00 PM GMTML Challenge using skore
ML Challenge using skore
For all Data Science and Machine Learning Practitioners.
A team-building experience centered around a stimulating challenge. For our part, this will be an opportunity to see how skore works in real-life conditions: we'll add a few constraints along the way!
- Location: at Probabl, in the Montparnasse Tower in Paris
- Date: Tuesday, April 29th, all day
- Lunch: provided by probabl
- Task: A time series regression.
- Team size: 2 to 4 people
March 26th, 2025 - 4:00 PM GMTOnline Talk
Recommended Practices
«Best practices to open source a product and creating a community around it»
by Adrin Jalali, VP Labs & core developer of scikit-learn.
Designed for a general audience – no prior knowledge required.
One-hour thematic conversations on data science and engineering topics. An expert speaker presents a problem / topic and his vision of best practices, methodology, tools, etc. for 20 to 30 minutes. The rest of the session is devoted to questions and discussions around the presentation. A knowledge sharing & networking space. All conversations are in English.
Feb 13th, 2025 - 2:00pm-8:00pm CETMasterclass
Modeling
«Aligning probabilistic classification with business decisions using scikit-learn»
By the scikit-learn core developers Olivier Grisel and Guillaume Lemaitre.
For experienced data scientists and ML Engineers - limited to 20 participants
A hands-on, business-centric training experience based on real industry case studies. A great opportunity to enhance your data strategy skills while connecting with industry leaders during the networking cocktail