Lightnews — Scholar-powered news

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

Special thanks to the BlueSky DCVLR crew: @yuhuiz.bsky.social @thaottn.bsky.social @vishaalurao.bsky.social @saining.bsky.social @sarameghanbeery.bsky.social

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

Check out:

Our Website: dcvlr-neurips.github.io

Our Starter Kit (Curate, Train, Eval): github.com/oumi-ai/oumi...

🧵 6 / n

DCVLR: Data Curation for Vision Language Reasoning - NeurIPS 2025 Competition

Join the DCVLR NeurIPS 2025 Competition. Advance visual reasoning in VLMs through data curation.

dcvlr-neurips.github.io

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

* A submission = a curated reasoning dataset on @huggingface with 1k or 10k samples and a scalable, reproducible curation strategy you document in a write-up
* You don’t need to train a model
* You can submit with nothing more than a free Colab or Kaggle account for basic testing

🧵 5 / n

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

💪anyone can compete for free 💪: Thanks to our sponsor @LambdaAPI we offer three free submissions for up to 500 teams. This is unprecedented in data-centric research, which tends to be very expensive because you have to train lots of models!

🧵 4 / n

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

🤖 open-models 🤖: every model we present results for will have open weights, and one of those models will be Molmo-O from @allen_ai (a recent best paper honorable mention from @cvpr at #CVPR2025), trained on open data.

🧵 3 / n

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

DCVLR is data-centric: we train an ~7B VLM on your dataset. The best performer (on benchmarks like MathVista, VMCBench and LiveXiv) will be eligible to win $1500 and a talk at #NeurIPS2025!

We also have a few twists compared to prior data-centric competitions –

🧵 2 / n

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Jun 18

So excited to announce the DCVLR (Data Curation for Vision-Language Reasoning) competition at #NeurIPS2025, led by @oumi-pbc.bsky.social and Lambda AI!

🌟open-data 🌟
🤖 open-models 🤖
💻 open-source 💻
💪anyone can compete for free 💪

dcvlr-neurips.github.io

🧵 1 / n

DCVLR: Data Curation for Vision Language Reasoning - NeurIPS 2025 Competition

Join the DCVLR NeurIPS 2025 Competition. Advance visual reasoning in VLMs through data curation.

dcvlr-neurips.github.io

1 3 6

Benjamin Feuer @benjaminfeuer.bsky.social · May 1

Co-organizing with wonderful collaborators from MIT, NYU, Stanford and UW: @thaottn.bsky.social , @sewoong79.bsky.social , @sarameghanbeery.bsky.social , @yuhuiz.bsky.social !

1 2

Benjamin Feuer @benjaminfeuer.bsky.social · May 1

We are excited to be sponsored by @datologyai.com
, who will be providing prizes for best paper awards 🏆

Benjamin Feuer @benjaminfeuer.bsky.social · May 1

🚀We welcome any submission that discusses domain-specific data curation pipelines and/or generalizable curation principles, getting us closer to building data-centric methods that are robust, efficient, and adaptable across domains.

Refer to our website for the call for papers!

Benjamin Feuer @benjaminfeuer.bsky.social · May 1

📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains!

📅 Deadline: May 24, AoE
🔗 Website: dataworldicml2025.github.io

We have an amazing lineup of speakers + panelists from various institutions and application areas!

ICML 2025 Workshop on Unifying Data Curation Frameworks Across Domains

dataworldicml2025.github.io

3 1 2

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 22

That's not what they did, they used gpt-4o for program synthesis, it's fundamentally different than asking the LLM to provide the correct response in the prompt

1

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 22

Thanks for sharing! FWIW, I sensed mostly optimism and excitement at NeurIPS -- the people I spoke to were eager to talk about their research and learn about mine. Let's meet up in the new year and compare notes @kyunghyuncho.bsky.social

1

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 14

That does seem like a sound rule! Although, interestingly, they did not apply it to me. 😅

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 11

Hello Vancouver!

1

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 8

or AI for science!

baskargroup.github.io/BioTrove/

SOCIAL MEDIA TITLE TAG

SOCIAL MEDIA DESCRIPTION TAG TAG

baskargroup.github.io

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 8

Or tabular deep learning ...

www.linkedin.com/posts/benjam...

Ben Feuer on LinkedIn: GitHub - penfever/TuneTables: TuneTables is a tabular classifier that…

Very happy to report that our paper describing TuneTables, a new tabular classification and regression model, will appear at #NeurIPS 2024! 🎊 Built on the…

www.linkedin.com

1

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 8

Or LLMs ...

sites.google.com/berkeley.edu...

Statistics in LLMs - Schedule

Saturday, December 14th, 2024

sites.google.com

1

Benjamin Feuer @benjaminfeuer.bsky.social · Dec 8

NeurIPS folks, excited to connect next week at the conference!

HMU to talk about VLMs ...

neurips.cc/virtual/2024...

NeurIPS Poster ImageNet++: A Large-Scale Benchmark of Data Curation StrategiesNeurIPS 2024

neurips.cc

1

Benjamin Feuer @benjaminfeuer.bsky.social · Nov 28

Unpopular opinion: the #ICLR2025 reviews were better quality than in the last few years.

I think its mainly because they had people review fewer papers.

Opinions?

1 1

Benjamin Feuer @benjaminfeuer.bsky.social · Nov 28

This book helped me learn how to understand ideological and inconsistent intellectual stances (a bit)

www.amazon.com/Righteous-Mi...

The Righteous Mind: Why Good People Are Divided by Politics and Religion

The Righteous Mind: Why Good People Are Divided by Politics and Religion [Haidt, Jonathan] on Amazon.com. *FREE* shipping on qualifying offers. The Righteous Mind: Why Good People Are Divided by Politics and Religion

www.amazon.com

Benjamin Feuer @benjaminfeuer.bsky.social · Nov 27

I feel like my Macbook Pro battery is starting to go; it used to last all day, now it's dead by the afternoon. The thing is only 2.5 years old. 🤨

Benjamin Feuer @benjaminfeuer.bsky.social · Nov 20

Excited to be making my first post on BlueSky! Let's talk AI research.

@eugenevinitsky.bsky.social, can I get a who's who on here? :-)

1