Lightnews — Scholar-powered news

Alex Chen

@chenalexander.bsky.social

14 followers 7 following 29 posts

Creative Director at Google Creative Lab. Exploring multimodal AI. Opinions are my own. https://chenalexander.com/

Posts Replies Media Videos

Alex Chen

@chenalexander.bsky.social

How it works: I prompt Gemini with the first frame of video to give me (x, y) coordinates with descriptions. Then I use optical flow to track those positions over time. Of course optical flow has limitations, but it was a fast lightweight way to prototype the idea quickly.

January 30, 2026 at 9:47 PM

Alex Chen

@chenalexander.bsky.social

I made it with Gemini spatial intelligence + opencv.js in Google AI Studio. Link here: ai.studio/apps/drive/1... It's been fun testing on old videos from my photo library as Gemini is able to uncover new details.

January 30, 2026 at 9:47 PM

Alex Chen

@chenalexander.bsky.social

Visualizing Gemini intelligence with things in my life 🔊🧵

January 30, 2026 at 9:47 PM

Alex Chen

@chenalexander.bsky.social

Gemini spatial understanding + opencv.js 🔊

January 29, 2026 at 4:34 PM

Alex Chen

@chenalexander.bsky.social

Testing streaming vector shapes with Gemini 3 Flash. Really fast. Realtime screencap⚡🔊

January 22, 2026 at 10:00 PM

Alex Chen

@chenalexander.bsky.social

I always ask Gemini to discuss ideas with me before coding, so I can craft the details I really care about - the sine wave, wobbly-ness, the amplitude ... 🥁

January 20, 2026 at 10:24 PM

Alex Chen

@chenalexander.bsky.social

Video has sound 🔊

January 20, 2026 at 10:24 PM

Alex Chen

@chenalexander.bsky.social

Sketch ➡️ animation ✏️ Multimodal prompting has been really powerful for this drum machine prototype ... 🧵

January 20, 2026 at 10:24 PM

Alex Chen

@chenalexander.bsky.social

Porting features with language ... 🧵

January 20, 2026 at 6:22 PM

Alex Chen

@chenalexander.bsky.social

Vibe coding an interface inspired by this fun 1960s drum machine with Gemini.

January 20, 2026 at 3:07 PM

Alex Chen

@chenalexander.bsky.social

Polyrhythms are especially easy and fun to make. Song link: alexanderchen.github.io/typeloop/?so...

January 16, 2026 at 9:23 PM

Alex Chen

@chenalexander.bsky.social

Every song is represented as a string of text, so you can share it by just copy-pasting or making a link. Here are all the sounds you can try. 🥁

January 16, 2026 at 9:23 PM

Alex Chen

@chenalexander.bsky.social

Type Loop 🎵⌨️ Create and share music by typing! Play here: alexanderchen.github.io/typeloop/ 🔊 Open-source. Built w/ Gemini. 🧵 #genuary #genuary10

January 16, 2026 at 9:23 PM

Alex Chen

@chenalexander.bsky.social

Dot Motion 🔴🎵 A simple way to explore motion, time, and sound. Code is open-source. Video has 🔊 Link in 🧵 #genuary #genuary2

January 7, 2026 at 10:05 PM

Alex Chen

@chenalexander.bsky.social

You can click "+" to add more agents. (Fun to watch the parallel conversations all unfold)

January 7, 2026 at 4:53 PM

Alex Chen

@chenalexander.bsky.social

Each agent remembers past conversations and uses memory to decide on goal.

January 7, 2026 at 4:50 PM

Alex Chen

@chenalexander.bsky.social

Information is passed through back-and-forth conversations like this one.

January 7, 2026 at 4:50 PM

Alex Chen

@chenalexander.bsky.social

TinyTown 🙂🍄 Lightweight multi-agent social simulation built w/ Gemini 3 Flash. Agents look for 🍄. When one finds it, it tells coordinates to others. Not sure where I'm taking this prototype next (open to ideas!) Code is open-source ➡️ ai.studio/apps/drive/1...

January 7, 2026 at 4:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news