Excited to announce our new work: "Large-scale Pre-training for Grounded Video Caption Generation" with Cordelia Schmid & @josef-sivic.bsky.social.
Paper: arxiv.org/abs/2503.10781
Project: ekazakos.github.io/grounded_vid...
Code (coming soon): github.com/ekazakos/grove 1/7
We will release code, models and datasets within next 2 weeks.
We are also working on a search demo for the proposed datasets with user prompts!
I hope to see you all in Honolulu!
TLDR; Spiritual successor to CroCo with a simpler multi-view objective and larger scale. Beats DINOv3 and CroCo v2 in RoMa, feedforward reconstruction, and rel. pose.
arxiv.org/abs/2511.17309
github.com/davnords/mum
TLDR; Spiritual successor to CroCo with a simpler multi-view objective and larger scale. Beats DINOv3 and CroCo v2 in RoMa, feedforward reconstruction, and rel. pose.
arxiv.org/abs/2511.17309
github.com/davnords/mum
I LOVE THE CZECH REPUBLIC! 🇨🇿
I LOVE THE CZECH REPUBLIC! 🇨🇿
⚡100x Training Throughput
🎯Fast Convergence
🔢Pure Int8 Pretraining of RNN LLMs
Large Behavior Models (LBM) by TRI
Presented by Adrien
toyotaresearchinstitute.github.io/lbm1/
Large Behavior Models (LBM) by TRI
Presented by Adrien
toyotaresearchinstitute.github.io/lbm1/
#AcademicSky ⚗️ 🧪
- Latent long/short-term memory
- Continual learning on experience (not datasets)
- Exploration and information gathering
- Counterfactual world models from sensors
- Sensory abstraction facilitating reasoning
- Long-horizon planning
- Latent long/short-term memory
- Continual learning on experience (not datasets)
- Exploration and information gathering
- Counterfactual world models from sensors
- Sensory abstraction facilitating reasoning
- Long-horizon planning
My personal reaction is no. We've made tremendous progress scaling and improving distributional learning & other existing solutions, but not on cracking hard open problems.
My personal reaction is no. We've made tremendous progress scaling and improving distributional learning & other existing solutions, but not on cracking hard open problems.
DreamCoder-like robot skill learning. Refactoring helps!
PDF: arxiv.org/abs/2406.18746
DreamCoder-like robot skill learning. Refactoring helps!
PDF: arxiv.org/abs/2406.18746
developers.googleblog.com/en/unlocking...
developers.googleblog.com/en/unlocking...
To improve system stability and provide a clearer submission process, we have just introduced 2 new deadlines that are now separate from the Abstract and the Paper Submission deadlines.
cvpr.thecvf.com/Conferences/...
To improve system stability and provide a clearer submission process, we have just introduced 2 new deadlines that are now separate from the Abstract and the Paper Submission deadlines.
cvpr.thecvf.com/Conferences/...
“‘Rest easy, king,’ read the final message sent to his phone. ‘You did good.’”