Lightnews — Scholar-powered news

Haoliang Wang

@wanghaoliang.bsky.social

17 followers 27 following 9 posts

Postdoc at MIT studying intuitive physics with Prof. Josh Tenenbaum | https://haoliangwang.github.io

haoliangwang.github.io

Posts Media Videos Starter Packs

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

A big thank you to my co-authors: @nishadg.bsky.social, @joshtenenbaum.bsky.social and @thiskevinsmith.bsky.social, see us at #CogSci2025 @cogscisociety.bsky.social

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

In ongoing work, we are exploring ways to incorporate more physics knowledge into the inverse-graphics framework to model people’s inference about physical properties, object’s motion under occlusion, etc.

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

We found that this image-computable model performs as well as the previous state-based model, and also better predicts participants’ behavioral patterns than alternative models.

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

Here’s an example of how the model (right) tracks the object in the video (left) as it moves and continues simulating after the video ends.

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

In this work, we treat vision as inverse graphics and develop a model that infers a distribution over object states from raw visual input. This uncertainty is then passed to a probabilistic physics simulator to generate predictions about what will happen next.

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

Some theories suggest we run mental simulations to predict physical events. But it's unclear how people figure out what's in a scene just by looking — and how that affects their physical predictions. Most current models assume the 3D states (poses, velocities, etc.) of objects are already known.

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

We found that people did a good job at this task (80% accuracy)! And more importantly, they were highly consistent with each other (95% inter-participant correlation), meaning they not only made good predictions but also similar errors. What explains this interesting behavior?

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

We asked participants to predict whether an object would touch another after viewing a short video clip. The videos feature realistic 3D scenarios generated using a game engine (see my CogSci paper from last year for more details! tinyurl.com/43fh62yt).

1 1

Haoliang Wang @wanghaoliang.bsky.social · Jul 29

Can you tell if a tower will fall or if two objects will collide — just by looking? 🧠👀 Come check out my #CogSci2025‪ poster (P1-W-207) on July 31, 13:00–14:15 PT to learn how people do general-purpose physical reasoning from visual input!

1 4 13