https://gidariss.github.io/
Andrei Bursuc @abursuc.bsky.social
Anh-Quan Cao @anhquancao.bsky.social
Renaud Marlet
Eloi Zablocki @eloizablocki.bsky.social
@iccv.bsky.social
iccv.thecvf.com/Conferences/...
Andrei Bursuc @abursuc.bsky.social
Anh-Quan Cao @anhquancao.bsky.social
Renaud Marlet
Eloi Zablocki @eloizablocki.bsky.social
@iccv.bsky.social
iccv.thecvf.com/Conferences/...
Papers popped up on different platforms, but mainly on ResearchGate with ~80 papers in just 3 weeks.
[1/]
Awesome works in generative modeling, multi-token prediction, and future prediction.
Congratulations to all collaborators!
@nasosger.bsky.social, sta8is.bsky.social, @nicolabourbaki.bsky.social, @ikakogeorgiou.bsky.social & N. Komodakis!
Awesome works in generative modeling, multi-token prediction, and future prediction.
Congratulations to all collaborators!
@nasosger.bsky.social, sta8is.bsky.social, @nicolabourbaki.bsky.social, @ikakogeorgiou.bsky.social & N. Komodakis!
Papers popped up on different platforms, but mainly on ResearchGate with ~80 papers in just 3 weeks.
[1/]
Papers popped up on different platforms, but mainly on ResearchGate with ~80 papers in just 3 weeks.
[1/]
MOCA ☕ - Predicting Masked Online Codebook Assignments w/ @spyrosgidaris.bsky.social O. Simeoni, A. Vobecky, @matthieucord.bsky.social, N. Komodakis, @ptrkprz.bsky.social #TMLR #ICLR2025
Grab a ☕ & brace for a story & a🧵
MOCA ☕ - Predicting Masked Online Codebook Assignments w/ @spyrosgidaris.bsky.social O. Simeoni, A. Vobecky, @matthieucord.bsky.social, N. Komodakis, @ptrkprz.bsky.social #TMLR #ICLR2025
Grab a ☕ & brace for a story & a🧵
Introducing DIP: unsupervised post-training that enhances dense features in pretrained ViTs for dense in-context scene understanding
Below: Low-shot in-context semantic segmentation examples. DIP features outperform DINOv2!
Introducing DIP: unsupervised post-training that enhances dense features in pretrained ViTs for dense in-context scene understanding
Below: Low-shot in-context semantic segmentation examples. DIP features outperform DINOv2!
Paper : arxiv.org/abs/2506.11136
Project Page: jafar-upsampler.github.io
Github: github.com/PaulCouairon...
Paper : arxiv.org/abs/2506.11136
Project Page: jafar-upsampler.github.io
Github: github.com/PaulCouairon...
📅 Sat 14/6, 10:30-12:30
📍 Poster #395, ExHall D
📅 Sat 14/6, 10:30-12:30
📍 Poster #395, ExHall D
Presenting "Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers" on multi-modal semantic future prediction.
Come discuss!
Fri 13 Jun 10:30-12:30, poster #345
bsky.app/profile/sta8...
👇 Links to the arxiv and github below
Presenting "Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers" on multi-modal semantic future prediction.
Come discuss!
Fri 13 Jun 10:30-12:30, poster #345
bsky.app/profile/sta8...
– Low-level image details (via VAE latents)
– High-level semantic features (via DINOv2)🧵
– Low-level image details (via VAE latents)
– High-level semantic features (via DINOv2)🧵
Check them out 👇
Find out more below 🧵
valeoai.github.io/posts/2025-0...
Check them out 👇
📄 arxiv.org/abs/2502.09509
🐍 github.com/zelaki/eqvae
📄 arxiv.org/abs/2502.09509
🐍 github.com/zelaki/eqvae
🚀REPA: 4x training speedup
🚀MaskGIT: 2x training speedup
🚀DiT-XL/2: 7x faster convergence
Kudos @nicolabourbaki.bsky.social et al.
🚀REPA: 4x training speedup
🚀MaskGIT: 2x training speedup
🚀DiT-XL/2: 7x faster convergence
Kudos @nicolabourbaki.bsky.social et al.
It has a package and pretrained models!
🖥️ nicolas-dufour.github.io/cad.html
🤖 github.com/nicolas-dufo...
It has a package and pretrained models!
🖥️ nicolas-dufour.github.io/cad.html
🤖 github.com/nicolas-dufo...
Links to the arXiv and Github 👇
Links to the arXiv and Github 👇
This is also an excellent occasion to fit all team members in a photo 📸
Mark it in your agendas and also in your registration #cvpr2025
opendrivelab.com/cvpr2025/wor...
Mark it in your agendas and also in your registration #cvpr2025
opendrivelab.com/cvpr2025/wor...