Also at:
https://sigmoid.social/@ogrisel
https://github.com/ogrisel
- 60+ arch., up to 2B params
- 10+ datasets
- in-domain training (>DINOv3)
- corr(train loss, test perf)=95%
- 60+ arch., up to 2B params
- 10+ datasets
- in-domain training (>DINOv3)
- corr(train loss, test perf)=95%
PSF therefore declined the funding.
Science suffers, but commitment to core values remains
🧵
PSF therefore declined the funding.
Science suffers, but commitment to core values remains
Ticketing, practical infos and schedule at: pydata.org/paris2025
github.com/scikit-learn...
github.com/scikit-learn...
pydata.org/paris2025/so...
pydata.org/paris2025/ti...
pydata.org/paris2025/so...
pydata.org/paris2025/ti...
"Probabilistic regression models: let's compare different modeling strategies and discuss how to evaluate them", by @ogrisel.bsky.social from @probabl.ai .
📜 pretalx.com/pydata-paris-2025/talk/DVMZBT
📅 pydata.org/paris2025/schedule
🎟 pydata.org/paris2025/tickets
What's one thing you'd love to see improved in JupyterLab, Jupyter Notebook, or JupyterLite?
The team is prepping the upcoming 4.5/7.5 releases and wants to tackle some usability issues.
Drop your feedback below, this will help prioritize what gets fixed!👇
What's one thing you'd love to see improved in JupyterLab, Jupyter Notebook, or JupyterLite?
The team is prepping the upcoming 4.5/7.5 releases and wants to tackle some usability issues.
Drop your feedback below, this will help prioritize what gets fixed!👇
We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values.
probabl-ai.github.io/calibration-...
We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values.
probabl-ai.github.io/calibration-...
Here is the repo with the material for the tutorial: github.com/skrub-data/E...
Here is the repo with the material for the tutorial: github.com/skrub-data/E...
📊 an online leaderboard (submit!)
📑 carefully curated datasets
📈 strong tree-based, deep learning, and foundation models
🧵
📊 an online leaderboard (submit!)
📑 carefully curated datasets
📈 strong tree-based, deep learning, and foundation models
🧵
With Jingang Qu, @dholzmueller.bsky.social, and Marine Le Morvan
TL;DR: a well-designed architecture and pretraining gives best tabular learner, and more scalable
On top, it's 100% open source
1/9
With Jingang Qu, @dholzmueller.bsky.social, and Marine Le Morvan
TL;DR: a well-designed architecture and pretraining gives best tabular learner, and more scalable
On top, it's 100% open source
1/9
🚀 Major update! Skrub DataOps, various improvements for the TableReport, new tools for applying transformers to the columns, and a new robust transformer for numerical features are only some of the features included in this release.
My solution is short (48 LOC) and relatively general-purpose – I used skrub to preprocess string and date columns, and pytabkit to create an ensemble of RealMLP and TabM models. Link below👇
My solution is short (48 LOC) and relatively general-purpose – I used skrub to preprocess string and date columns, and pytabkit to create an ensemble of RealMLP and TabM models. Link below👇
Don't miss out on this chance to learn, connect, and grow. Save the date and get your ticket!
pydata.org/paris2025/sc...
Don't miss out on this chance to learn, connect, and grow. Save the date and get your ticket!
pydata.org/paris2025/sc...
The inductive bias of the neural network prevents from perfectly learning u* and overfitting.
In particular neural networks fail to learn the velocity field for two particular time values.
See the paper for a finer analysis 😀
The inductive bias of the neural network prevents from perfectly learning u* and overfitting.
In particular neural networks fail to learn the velocity field for two particular time values.
See the paper for a finer analysis 😀
Don't miss out! Early bird tickets are available only until the end of the day. Grab yours now and save!
pydata.org/paris2025/ti...
Don't miss out! Early bird tickets are available only until the end of the day. Grab yours now and save!
pydata.org/paris2025/ti...
Only a few days left to grab your early bird tickets! Don't wait—this special offer ends Sunday at the end of the day. Secure yours now!
pydata.org/paris2025/ti...
Only a few days left to grab your early bird tickets! Don't wait—this special offer ends Sunday at the end of the day. Secure yours now!
pydata.org/paris2025/ti...
The countdown is on — only 10 days left to grab your early bird tickets for our 2025 edition! Lock in your spot at the best price before rates go up.
pydata.org/paris2025/ti...
The countdown is on — only 10 days left to grab your early bird tickets for our 2025 edition! Lock in your spot at the best price before rates go up.
pydata.org/paris2025/ti...
Check this out (links in thread) ⬇️
Check this out (links in thread) ⬇️
Eric Snow: www.linkedin.com/in/ericsnowc...
Irit Katriel: www.linkedin.com/in/irit-katr...
Mark Shannon: www.linkedin.com/in/mark-shan...
Eric Snow: www.linkedin.com/in/ericsnowc...
Irit Katriel: www.linkedin.com/in/irit-katr...
Mark Shannon: www.linkedin.com/in/mark-shan...
docs.python.org/3.14/whatsne...
docs.python.org/3.14/whatsne...
"Betancourt, Michael (YEAR ACCESSED). CHAPTER NAME. Retrieved from GITHUB LINK, commit COMMIT NUMBER"
"Betancourt, Michael (YEAR ACCESSED). CHAPTER NAME. Retrieved from GITHUB LINK, commit COMMIT NUMBER"
Use the GitHub CLI command `gh issue develop -c` to create and checkout a local branch linked to the given issue. No need to name the branch, and when you later run `gh pr create`, the pull request will be linked to the issue automatically.
Use the GitHub CLI command `gh issue develop -c` to create and checkout a local branch linked to the given issue. No need to name the branch, and when you later run `gh pr create`, the pull request will be linked to the issue automatically.