Lightnews — Scholar-powered news

Eugene Berta

@eberta.bsky.social

61 followers 98 following 11 posts

PhD student at INRIA Paris. Working on calibration of machine learning classifiers.

Posts Media Videos Starter Packs

Pinned

Eugene Berta @eberta.bsky.social · Feb 3

Early stopping on validation loss? This leads to suboptimal calibration and refinement errors—but you can do better!
With @dholzmueller.bsky.social, Michael I. Jordan, and @bachfrancis.bsky.social, we propose a method that integrates with any model and boosts classification performance across tasks.

4 9 18

Reposted by Eugene Berta

Sacho @potosacho.bsky.social · Jul 9

COLT Workshop on Predictions and Uncertainty was a banger!

I was lucky to present our paper "Minimum Volume Conformal Sets for Multivariate Regression", alongside my colleague @eberta.bsky.social and his awsome work on calibration.

Big thanks to the organizers!

#ConformalPrediction #MarcoPolo

1 2

Reposted by Eugene Berta

Leshem (Legend) Choshen @ICML @ACL @lchoshen.bsky.social · Apr 8

What if we have been doing early stopping wrong all along?
When you break the validation loss into two terms, calibration and refinement
you can make the simplest (efficient) trick to stop training in a smarter position

1 8 34

Eugene Berta @eberta.bsky.social · Feb 5

This suggests a clear link with the ROC curve in the binary case, but writing it down formally, the relationship between the two is a bit ugly…

Eugene Berta @eberta.bsky.social · Feb 5

Isotonic regression minimizes the risk of any « Bregman loss function » (included cross-entropy, see section 2.1 below) up to monotonic relabeling, which looks a lot like our « refinement as a minimiser » formulation. It also find the ROC convex hull.
proceedings.mlr.press/v238/berta24...

Classifier Calibration with ROC-Regularized Isotonic Regression

Calibration of machine learning classifiers is necessary to obtain reliable and interpretable predictions, bridging the gap between model outputs and actual probabilities. One prominent technique, ...

proceedings.mlr.press

1 1

Eugene Berta @eberta.bsky.social · Feb 5

However, for calibration of the final model, adding an intercept or doing matrix scaling might work even better in certain scenario (imbalanced, non-centered). We’ve experimented with existing implementation with limited success for now, maybe we should look at that in more details…

Eugene Berta @eberta.bsky.social · Feb 5

Not yet! Vector/matrix scaling has more parameters so it is more prone to overfitting the validation set, and simple TS seems to calibrate well empirically, which is why we stuck with that to estimate refinement error for early stopping.

1 1

Eugene Berta @eberta.bsky.social · Feb 4

I’ve observed refinement being minimized before calibration for small (probably under-fitter) neural nets. In many cases, the refinement curve also starts « overfitting » at some point.

Eugene Berta @eberta.bsky.social · Feb 4

We’ve not tried what you’re suggesting but if the training cost is small this might indeed be a good option!

Eugene Berta @eberta.bsky.social · Feb 4

Indeed regularisation seems very important. It can have large impact on how calibration error behaves. Combined with learning rate schedulers, this can have surprising effects, like calibration error starting to go down again at some point.

1 1

Eugene Berta @eberta.bsky.social · Feb 4

Thanks! We have experimented with many models, observing various behaviours. The « calibration going up while refinement goes down » seems typical in deep learning from what I’ve seen. With smaller models other things can appear, as suggested by our logistic regression analysis (section 6).

2 2

Eugene Berta @eberta.bsky.social · Feb 3

📖 Read the full paper: arxiv.org/abs/2501.19195
💻 Check out our code: github.com/dholzmueller...

Rethinking Early Stopping: Refine, Then Calibrate

Machine learning classifiers often produce probabilistic predictions that are critical for accurate and interpretable decision-making in various domains. The quality of these predictions is generally ...

arxiv.org

1 4

Eugene Berta @eberta.bsky.social · Feb 3

4 9 18