Sayan Deb Sarkar
@sayandsarkar.bsky.social
320 followers 400 following 15 posts
PhD in 3D Vision @Stanford | MSc CS @ETH | Ex @Qualcomm, @MercedesBenz W: sayands.github.io
Posts Media Videos Starter Packs
sayandsarkar.bsky.social
🗓️ Thursday 12 June 3:00 p.m. - 3:45 p.m. CDT
📍 OpenSun3D Workshop Poster Session Arch 211

(2/3)
sayandsarkar.bsky.social
✨ Excited to head off to Nashville for #CVPR2025

🎤 Catch me at the poster sessions or just come say hi to grab ☕

🗓️ Friday 13 June 4:00 p.m. - 6:00 p.m. CDT
📍 Poster Session #2 — Exhibit Hall D Highlight Poster #346

(1/3)
sayandsarkar.bsky.social
🎉 Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨

We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities — no semantic annotations needed!🚀
Reposted by Sayan Deb Sarkar
jianhao97.bsky.social
🥳Excited to share our latest work, WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments, accepted to #CVPR2025 🌐

We present a robust monocular RGB SLAM system that uses uncertainty-aware tracking and mapping to handle dynamic scenes.
sayandsarkar.bsky.social
🏆 CrossOver is accepted as a 𝗛𝗶𝗴𝗵𝗹𝗶𝗴𝗵𝘁 at #CVPR2025! ✨
💻 Fully open-sourced code with all pre-trained checkpoints: github.com/GradientSpac...

📡 Stay tuned for a deep-dive thread and what else we are cooking 🍳
sayandsarkar.bsky.social
🎉 Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨

We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities — no semantic annotations needed!🚀
sayandsarkar.bsky.social
But, the multimodal problem is same as in image generative tasks — as in, what is the perfect 3D scan given a text input?
sayandsarkar.bsky.social
In this case, what would be a definitive ground truth?
sayandsarkar.bsky.social
Thanks for sharing our work! Yes, I think that’d be a pretty neat downstream application but maybe it is more multimodal generation rather than reconstruction.
sayandsarkar.bsky.social
🎉 Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨

We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities — no semantic annotations needed!🚀
Reposted by Sayan Deb Sarkar
andreaspsteiner.bsky.social
🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7
sayandsarkar.bsky.social
Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.
sayandsarkar.bsky.social
Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.
sayandsarkar.bsky.social
Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.
sayandsarkar.bsky.social
Could you add me? I’m a PhD student working on 3D scene understanding.