Lightnews — Scholar-powered news

Lucas Beyer (bl16)

@giffmana.ai

I noticed that I'm not using bsky much anymore. Not sure why, vibes.

Anyways, someone noticing that DeepSeek refuses to answer *anything* about Xi Jinping, even the question whether he exists at all, triggered me writing a short snippet on safety fine-tuning: lb.eyer.be/s/safety-sft...

January 26, 2025 at 9:21 PM

Lucas Beyer (bl16)

@giffmana.ai

First candidate for banger of the year appeared, only 2 days in:

January 2, 2025 at 10:26 PM

Reposted by Lucas Beyer (bl16)

Nathan Lambert

@natolambert.bsky.social

OpenAI skips o2, previews o3 scores, and they're truly crazy. Huge progress on the few benchmarks we think are truly hard today. Including ARC AGI.
Rip to people who say any of "progress is done," "scale is done," or "llms cant reason"
2024 was awesome. I love my job.

December 20, 2024 at 6:08 PM

Lucas Beyer (bl16)

@giffmana.ai

A post by @cloneofsimo on Twitter made me write up some lore about residuals, ResNets, and Transformers. And I couldn't resist sliding in the usual cautionary tale about small/mid-scale != large-scale.

Blogpost: lb.eyer.be/s/residuals....

December 18, 2024 at 11:14 PM

Lucas Beyer (bl16)

@giffmana.ai

Good morning Vancouver!

Things are different here: this guy is alone, chonky, and not scared at all, I was more scared of him towards the end lol.

Also look at that … industrialization

December 10, 2024 at 6:09 PM

Lucas Beyer (bl16)

@giffmana.ai

Good morning! On my way to NeurIPS, slightly sad to leave this beautiful place and my family for the week, but also excited to meet many new and old friends at NeurIPS!

December 9, 2024 at 8:40 AM

Reposted by Lucas Beyer (bl16)

Sai Kumar Dwivedi

@saidwivedi.in

One of the best tutorials for understanding Transformers!

📽️ Watch here: www.youtube.com/watch?v=bMXq...

Big thanks to @giffmana.ai for this excellent content! 🙌

[M2L 2024] Transformers - Lucas Beyer

YouTube video by Mediterranean Machine Learning (M2L) summer school

www.youtube.com

December 8, 2024 at 9:58 AM

Reposted by Lucas Beyer (bl16)

Ibrahim Alabdulmohsin

@ibomohsin.bsky.social

Attending #NeurIPS2024? If you're interested in multimodal systems, building inclusive & culturally aware models, and how fractals relate to LLMs, we've 3 posters for you. I look forward to presenting them on behalf of our GDM team @ Zurich & collaborators. Details below (1/4)

December 7, 2024 at 6:50 PM

Reposted by Lucas Beyer (bl16)

Andrei Bursuc

@abursuc.bsky.social

This aged well

December 5, 2024 at 7:52 AM

Lucas Beyer (bl16)

@giffmana.ai

The fourth nice thing we* have for you this week: PaliGemma 2.

It’s also a perfect transition: this v2 was carried a lot more by @andreaspsteiner.bsky.social André and Michael than by us.

Crazy new sota tasks! Interesting res vs LLM size study! Better OCR! Less hallucination!

Andreas Steiner @andreaspsteiner.bsky.social · Dec 5

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7

December 5, 2024 at 8:19 PM

Reposted by Lucas Beyer (bl16)

Marvin Schmitt

@marvin-schmitt.com

OpenAI is coming to Switzerland, and the founding team of the new Zurich office is nothing short of stellar🇨🇭⭐️

What a fantastic win for the European AI landscape 🇪🇺

Congrats to @giffmana.ai @kolesnikov.ch @xzhai.bsky.social for that move, and to OpenAI for making these hires!

Lucas Beyer (bl16) @giffmana.ai · Dec 4

So, now that our move to OpenAI became public, @kolesnikov.ch @xzhai.bsky.social and I are drowning in notifications. I read everything, but may not reply.

Excited about this new journey! 🚀

Quick FAQ thread...

Alexander Kolesnikov @kolesnikov.ch · Dec 4

Ok, it is yesterdays news already, but good night sleep is important.

After 7 amazing years at Google Brain/DM, I am joining OpenAI. Together with @xzhai.bsky.social and @giffmana.ai, we will establish OpenAI Zurich office. Proud of our past work and looking forward to the future.

December 5, 2024 at 8:24 AM

Lucas Beyer (bl16)

@giffmana.ai

@francois.fleuret.org hey, can you buy me a few of tomorrow's Le Temps, if the news about us is printed in it? I'll pay you in beers at neurips.

December 4, 2024 at 9:52 PM

Lucas Beyer (bl16)

@giffmana.ai

So, now that our move to OpenAI became public, @kolesnikov.ch @xzhai.bsky.social and I are drowning in notifications. I read everything, but may not reply.

Excited about this new journey! 🚀

Quick FAQ thread...

Alexander Kolesnikov @kolesnikov.ch · Dec 4

Ok, it is yesterdays news already, but good night sleep is important.

After 7 amazing years at Google Brain/DM, I am joining OpenAI. Together with @xzhai.bsky.social and @giffmana.ai, we will establish OpenAI Zurich office. Proud of our past work and looking forward to the future.

December 4, 2024 at 9:23 PM

Reposted by Lucas Beyer (bl16)

Alexander Kolesnikov

@kolesnikov.ch

Ok, it is yesterdays news already, but good night sleep is important.

After 7 amazing years at Google Brain/DM, I am joining OpenAI. Together with @xzhai.bsky.social and @giffmana.ai, we will establish OpenAI Zurich office. Proud of our past work and looking forward to the future.

December 4, 2024 at 9:14 AM

Reposted by Lucas Beyer (bl16)

Dmytro Mishkin

@ducha-aiki.bsky.social

JetFormer: An Autoregressive Generative Model of Raw Images and Text

@mtschannen.bsky.social @asusanopinto.bsky.social
@kolesnikov.ch

tl;dr: VQGAN quality w/o VQGAN, but optimizing pixel tokens with NLL. Also some normalizing flow magic, which I have to read up
arxiv.org/abs/2411.19722

December 3, 2024 at 7:16 PM

Lucas Beyer (bl16)

@giffmana.ai

So this was the second cool thing that we* got this week.

* this time the « we » is really just me, but hey, academic we 🙃

Lucas Beyer (bl16) @giffmana.ai · Dec 3

Our big_vision codebase is really good! And it's *the* reference for ViT, SigLIP, PaliGemma, JetFormer, ... including fine-tuning them.

However, it's criminally undocumented. I tried using it outside Google to fine-tune PaliGemma and SigLIP on GPUs, and wrote a tutorial: lb.eyer.be/a/bv_tuto.html

December 3, 2024 at 8:40 AM

Lucas Beyer (bl16)

@giffmana.ai

Our big_vision codebase is really good! And it's *the* reference for ViT, SigLIP, PaliGemma, JetFormer, ... including fine-tuning them.

However, it's criminally undocumented. I tried using it outside Google to fine-tune PaliGemma and SigLIP on GPUs, and wrote a tutorial: lb.eyer.be/a/bv_tuto.html

December 3, 2024 at 12:18 AM

Reposted by Lucas Beyer (bl16)

Alexander Kolesnikov

@kolesnikov.ch

The answer has just dropped: bsky.app/profile/kole...

Jia-Bin Huang @jbhuang0604.bsky.social · Dec 1

2021: Replace every CNN with a Transformer

2022: Replace every GAN with diffusion models

2023: Replace every NeRF with 3DGS

2024: Replace every diffusion model with Flow Matching

2025: ???

December 2, 2024 at 7:00 PM

Lucas Beyer (bl16)

@giffmana.ai

The first of the cool things we* got this week!

Typically, you'd train a VQ-VAE/GAN tokenizer first, and then use its tokens for your LLM/DiT/... But we all know eventually end-to-end wins over pipelines.

With flow models, you can actually learn pixel-LLM-pixel end-to-end!

Michael Tschannen @mtschannen.bsky.social · Dec 2

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?

We have been pondering this during summer and developed a new model: JetFormer 🌊🤖

arxiv.org/abs/2411.19722

A thread 👇

1/

December 2, 2024 at 7:23 PM

Lucas Beyer (bl16)

@giffmana.ai

November 30, 2024 at 8:26 AM

Lucas Beyer (bl16)

@giffmana.ai

Some recent discussions made me write up a short read on how I think about doing computer vision research when there's clear potential for abuse.

Alternative title: why I decided to stop working on tracking.

Curious about other's thoughts on this.

lb.eyer.be/s/cv-ethics....

November 29, 2024 at 2:51 PM

Lucas Beyer (bl16)

@giffmana.ai

hahahahhahaah holy cow, paligemma is good =)

BusyBrain⌨ @w42.bsky.social · Nov 28

November 28, 2024 at 1:18 PM

Lucas Beyer (bl16)

@giffmana.ai

segment cow

not on the beach, but as you may know I like cows in "out of distribution" places and poses.
huggingface.co/spaces/big-v...

November 28, 2024 at 11:37 AM

Lucas Beyer (bl16)

@giffmana.ai

Sorry, but I was strangely attracted... @5trange4ttractor.bsky.social =D

November 27, 2024 at 9:01 PM

Lucas Beyer (bl16)

@giffmana.ai

Here's a fun real-life prompt where 30min of Googling didn't really help me.

The small/fast/mini chatbots all failed miserably

The large ones work great (except Gemini).

Also:
1. I really like C3.6 giving only answer and asking if explain
2. Wild that Chat understood the calls and added comments!

November 27, 2024 at 8:38 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news