@glemaitre58.bsky.social
150 followers 650 following 4 posts
Posts Media Videos Starter Packs
Reposted
ogrisel.bsky.social
Today at #EuroScipy2025, @glemaitre58.bsky.social and I presented a tutorial on pitfalls of machine learning for imbalanced classification problems.

We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values.

probabl-ai.github.io/calibration-...
Imbalanced classification: pitfalls and solutions — Probabilistic calibration of cost-sensitive learning
probabl-ai.github.io
glemaitre58.bsky.social
A small update on the retrospective and future priorities of the open source team at @probabl.bsky.social for the next 6 months or so.
probabl.ai
At Probabl, together with the wider community, we continue our dedicated efforts to support and enhance @scikit-learn.org and its ecosystem. In this post, we provide a retrospective on the work accomplished over the last 6 months and the roadmap for the next 6:
papers.probabl.ai/open-source-...
Open source software priorities at Probabl - Chapter 2
Open source software priorities at Probabl - Chapter 2
papers.probabl.ai
glemaitre58.bsky.social
Sometimes you think you are right by doing everything "by the book." But sometimes the book is just a tiny part of the full story. Keep digging and writing a new chapter with more insights is actually fun...
probabl.ai
New podcast episode! This one is about imbalanced-learn and how the maintainer looks back with some lessons learned.

If you are dealing with imbalanced classification use-cases, like fraud, you'll want to listen in on this one!

youtu.be/npSkuNcm-Og
Imbalanced-learn: regrets and onwards - with Guillaume Lemaitre, core-maintainer
YouTube video by probabl
youtu.be
Reposted
probabl.ai
New podcast episode! This one is about imbalanced-learn and how the maintainer looks back with some lessons learned.

If you are dealing with imbalanced classification use-cases, like fraud, you'll want to listen in on this one!

youtu.be/npSkuNcm-Og
Imbalanced-learn: regrets and onwards - with Guillaume Lemaitre, core-maintainer
YouTube video by probabl
youtu.be
glemaitre58.bsky.social
OK it is an interesting feedback. We could support older versions. We saw that up-to-now, we don't have any code that we are eager to drop quickly. I understand about the runtime dependencies and on our side, the idea is only depending on scikit-learn. But agreed that it is one more dependency.
glemaitre58.bsky.social
We are working on a small package to ease developer life: github.com/glemaitre/sk.... The idea is that recurrent work could be centralized in a single package. Once we have a minimal version, we will do a first release to support scikit-learn 1.2 to 1.6
GitHub - glemaitre/sklearn-compat
Contribute to glemaitre/sklearn-compat development by creating an account on GitHub.
github.com
Reposted
ogrisel.bsky.social
I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.

Here is the recording of the presentation:

www.youtube.com/watch?v=-gYn...
A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner. Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve. Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem. Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.
Reposted
scikit-learn.org
Please help us test the first release candidate for scikit-learn 1.6: pip install scikit-learn==1.6.0rc1

Changelog: scikit-learn.org/1.6/whats_ne...

In particular, if you maintain a project with a dependency on
scikit-learn, please let us know about any regression.
Version 1.6
Legend for changelogs something big that you couldn’t do before., something that you couldn’t do before., an existing feature now may not require as much computation or memory., a miscellaneous min...
scikit-learn.org
Reposted
probabl.ai
With Artefact, we are delighted to invite data leaders to an exclusive Paris masterclass: ✨Aligning Probabilistic Classification with Business Decisions using @scikit-learn.bsky.social ✨ 🚨Limited seats available! Secure your spot now 👉🏻 lu.ma/fopoglzo #MachineLearning #Advanced #AI #Masterclass