Tengda Han
@tengda.bsky.social
62 followers 55 following 10 posts
Researcher at Google DeepMind. Computer vision and machine learning.
Posts Media Videos Starter Packs
tengda.bsky.social
Organized by: Junyu Xie, Ridouane Ghermi, @tengda.bsky.social, Max Bain, Arsha Nagrani, @vickykalogeiton.bsky.social, @gulvarol.bsky.social, Weidi Xie, Ivan Laptev and Andrew Zisserman.

See you in Hawaii! 🌺
tengda.bsky.social
We’re excited to have a fantastic lineup of speakers:
@amypavel.bsky.social, Anna Rohrbach, Mike Zheng Shou, Makarand Tapaswi. We’ll also host a panel discussion with the organizers!
tengda.bsky.social
Movies are more than just video clips, they are stories! 🎬

We’re hosting the 1st SLoMO Workshop at #ICCV2025 to discuss Story-Level Movie Understanding & Audio Descriptions!

Website: slomo-workshop.github.io
Competition: huggingface.co/spaces/SLoMO...
tengda.bsky.social
Thank @dimadamen.bsky.social for presenting our Orthogonal Optimizer! It’s a simple modification on standard optimizers for streaming video learning. We have code available at sites.google.com/view/orthogo...
dimadamen.bsky.social
Now @cvprconference.bsky.social poster session 3 #286
Our GoogleDeepMind paper:
Learning from Streaming Video with Orthogonal Gradients
As @tengda.bsky.social couldn’t make it for visa reasons, you’ll have the second best option of me presenting our work 😅
See you there #CVPR2025
tengda.bsky.social
Humans learn from one continuous visual stream, but large video models have to be trained on billions of web videos.
We found that learning from such sequential streams is challenging for video models—and we introduce a family of "orthogonal optimizers" to bridge the gap!
tengda.bsky.social
It's interesting to see that visual counting remains to be quite challenging for generalist AI models. But this specialist model counts very well. Nice work from @nikigoliai.bsky.social last year!
nikigoliai.bsky.social
I was recently really excited to find out many people have been successfully using our CountGD model (NeurIPS'24) for products, open-source tools and science applications.

Annolid:
www.youtube.com/watch?v=CQvP...

LandingAI
landing.ai/blog/simplif...

Saiwa
saiwa.ai/landing/trai...

Dhruva Space
Count Anything! Object Counting & Segmentation with Annolid & CountGD!
YouTube video by Annolid
www.youtube.com
tengda.bsky.social
We are looking for a student researcher to work on video understanding plus 3D, in Google DeepMind London. DM/Email me or pass it to someone if you feel it may be a good fit!
tengda.bsky.social
How do you know he is not 🤔😆
Reposted by Tengda Han
dimadamen.bsky.social
From an award candidate... to best paper #ACCV2024
Glad to share that "It's Just Another Day" received the top award at the conference.
@bristoluni.bsky.social @ox.ac.uk

This paper is worth reading :-) based on the reviewers, AC and awards committee. We thank them for their time and effort.