Lightnews — Scholar-powered news

Reposted by David Picard

Vicky Kalogeiton @vickykalogeiton.bsky.social · 7h

Very proud of our recent work, kudos to the team! Read @davidpicard.bsky.social’s excellent post for more details or the paper arxiv.org/pdf/2502.21318

4 4

David Picard @davidpicard.bsky.social · 7h

And of course, all the authors who worked really hard to produce these valuable contributions:
@lucasdegeorge.bsky.social
@arrijitghosh.bsky.social
@nicolasdufour.bsky.social
@vickykalogeiton.bsky.social

3

David Picard @davidpicard.bsky.social · 7h

Final note: I'm (we're) tempted to organize a challenge on that topic as a workshop at a CV conf. ImageNet is the only source of images allowed and then you compete to get the bold numbers.

Do you think there would be people in for that? Do you think it would make for a nice competition?

1 2 5

David Picard @davidpicard.bsky.social · 7h

The paper has a gigantic number of supmat (owing to its reviewing curse): arxiv.org/abs/2502.21318

It's now 29 pages. All you ever need to know to train your own T2I model and then fine-tune/LoRa it to whatever you need.
We show you don't need to start from SDXL or Flux, you can be much more frugal

1 3

David Picard @davidpicard.bsky.social · 7h

We release everything:
The training code: github.com/lucasdegeorg...
The data (captions, cutmix, all): huggingface.co/arijitghosh/...
And even some models (eventually all, once it's user-friendly).

You're 500hrs away from training your T2I model from scratch! Can you wrap your head around that?🤯

GitHub - lucasdegeorge/T2I-ImageNet: Code for "How far can we go with ImageNet for Text-to-Image generation?" paper

Code for "How far can we go with ImageNet for Text-to-Image generation?" paper - lucasdegeorge/T2I-ImageNet

github.com

1 4

David Picard @davidpicard.bsky.social · 7h

But there's more: that checkpoint has all you can expect from a good pretrained model.

We take the checkpoint, upscale it to 1k² and fine-tune it on Laion-POP (400k imgs) for high aesthetics targets.

I would have never bet that you could get those images with ImageNet pretraining + a bit of FT.

1 3

David Picard @davidpicard.bsky.social · 7h

We train models at 256² resolution and then finetune at 512² to get competitive results on composition benchmarks.

This show that a rather small model (400M) trained on few but curated data has good understanding and generative capabilities.

Contrarily to popular belief: scale is not required!

1 4

David Picard @davidpicard.bsky.social · 7h

To enable training T2I on ImageNet, we:
- augment the entire dataset with rich detailed caption (TA)
- remove the object-centric bias with CutMix augmentations (IA)

Using both augmentations is sufficient to successfully train a model producing the images in the teaser (1 post), using only ImageNet😲

1 3

David Picard @davidpicard.bsky.social · 7h

Main motivation:
- T2I research is impossible to reproduce because massive datasets are kept secret and open datasets (LAION) are decaying
- Methods are impossible to compare because of the lack of common data

We proposed a reproducible setup: train on ImageNet! It's affordable and ticks all boxes

1 2

David Picard @davidpicard.bsky.social · 7h

🚨Updated: "How far can we go with ImageNet for Text-to-Image generation?"

TL;DR: train a text2image model from scratch on ImageNet only and beat SDXL.

Paper, code, data available! Reproducible science FTW!
🧵👇

📜 arxiv.org/abs/2502.21318
💻 github.com/lucasdegeorg...
💽 huggingface.co/arijitghosh/...

1 5 16

David Picard @davidpicard.bsky.social · 8h

Tu sais, nous on est en train de changer de logiciel de RH¹ et je ne serais pas surpris que certains ne soient pas payés à cause de problème de bascule. Les outils numériques sont toujours la cata.

¹On va utiliser RenoiRH. Oui, tu as bien lu. Et personne n'a tiqué sur le nom lors du développement.

1

Reposted by David Picard

Nicolas Holzschuch @nholzschuch.bsky.social · 16h

To paraphrase Douglas Adams, the US had always considered it was vastly superior the EU because it had generated billionnaires like Sam Altman, Elon Musk or Marc Andreessen, and the EU had not.
An the EU considered it was vastly superior to the US for exactly the same reason.

2 14

David Picard @davidpicard.bsky.social · 8h

Ce qui en soit n'a pas l'air trop difficile à trouver.

David Picard @davidpicard.bsky.social · 8h

Oui, pareil ! Franchement, j'étais pas emballé au départ, mais la différence avec la MGEN est le jour et la nuit. Je ne sais pas combien de temps ça va durer, mais c'est très appréciable.

1

David Picard @davidpicard.bsky.social · 9h

à 3 euros près, ce sera le maximum de toute façon.

1 1

Reposted by David Picard

CNRS Sciences informatiques @cnrsinformatics.bsky.social · 18h

#Distinction 🏆| Charlotte Pelletier, lauréate d'une chaire #IUF, développe des méthodes d’intelligence artificielle appliquées aux séries temporelles d’images satellitaires.
➡️ www.ins2i.cnrs.fr/fr/cnrsinfo/...
🤝 @irisa-lab.bsky.social @cnrs-bretagneloire.bsky.social

5 10

David Picard @davidpicard.bsky.social · 20h

C'est l'automne 🍂🍁🍄🪾

1 19

Reposted by David Picard

Pois Chiche de la Darelle-jédide 🌻🇺🇦🇵🇸 @poischiche.bsky.social · 21h

Il y a cent ans, entre fin septembre et début octobre 1925, la barbarie fasciste tombait méthodiquement sur Florence, tuait, brûlait et bastonnait pour "purifier" dans sa "Sainte violence" anti-maçonnique l'Italie d'une part de son heritage vivant des Lumières.
agone.org/les-crimes-f...

Les crimes fascistes (Florence, 3 octobre 1925)

De la Grande Guerre à la Guerre d'Espagne, Camillo Berneri (1897–1937) a lutté contre le fascisme par la plume et par l'action. Cet intellectuel et militant anarchiste italien est l'un des premiers à ...

agone.org

13 22

David Picard @davidpicard.bsky.social · 23h

Ce simulateur de coalition est hyper intéressant. Ça permet de comprendre qu'en France, on aime pas les compromis menant à des coalitions.
datan.fr/outils/coali...

Assemblée nationale : formez votre majorité avec notre simulateur de coalition | Datan

Créez votre propre coalition à l'Assemblée nationale. Après les élections législatives de 2024, aucun groupe politique n'a obtenu de majorité absolue. Utilisez notre simulateur pour sélectionner les d...

datan.fr

1

Reposted by David Picard

marc rees @reesmarc.bsky.social · 1d

« This blocks the required majority in the EU Council, derailing the plan to pass the surveillance law next week ».

www.patrick-breyer.de/en/citizen-p...

Citizen Protest Halts Chat Control; Breyer Celebrates Major Victory for Digital Privacy

In a major breakthrough for the digital rights movement, the German government has refused to back the EU's controversial Chat Control regulation today after facing massive public pressure. This block...

www.patrick-breyer.de

2 24 26

David Picard @davidpicard.bsky.social · 1d

High dimensional wolves or low dimensional wolves? That changes quite a lot what I think.

1

David Picard @davidpicard.bsky.social · 1d

That's the best way of catching a pneumonia. Not sure I would call that fun. 🥶

1 1

David Picard @davidpicard.bsky.social · 1d

If you were to make the most potent and addictive drug ever, how would you know? Because if you test it on yourself...😅

Reposted by David Picard

Alexandre Archambault @archambault-avocat.fr · 1d

Imaginez que le gouvernement décide d'imposer à La Poste d'ouvrir tout le courrier envoyé par les Français, pour vérifier qu'il n'y ait pas de contenu pédocriminel.

C'est pourtant ce que 🇪🇺 s'apprête potentiellement à faire, en disant "mais ne vous inquiétez pas on refermera juste après".

Pierre Beyssac @pierreb.bsky.social · 1d

La décrépitude politique fr est certes passionnante à commenter, mais n'oublions pas #chatcontrol qui va être proposé au parlement UE si la réunion du 14/10 (mardi prochain) le décide, or France et Allemagne sont maintenant pour :( lel.media/chatcontrol-...

« ChatControl », la perquisition numérique systématique de nos conversations

Demain, l’Europe va-t-elle lire tous vos messages ? C’est le principe de « ChatControl », projet relancé aujourd’hui au nom de la lutte contre la pédocriminalité. Une surveillance de masse qui pourrai...

lel.media

2 120 110

Reposted by David Picard

Tetiana Martyniuk @t-martyniuk.bsky.social · 1d

Another great event for @valeoai.bsky.social team: a PhD defense of Corentin Sautier.

His thesis «Learning Actionable LiDAR Representations w/o Annotations» covers the papers BEVContrast (learning self-sup LiDAR features), SLidR, ScaLR (distillation), UNIT and Alpine (solving tasks w/o labels).

1 4 9