Prem Seetharaman
@pseeth.bsky.social
320 followers 1K following 8 posts
Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern. https://pseeth.github.io/
Posts Media Videos Starter Packs
pseeth.bsky.social
Fun model to work on! More fun stuff to come!
justinsalamon.bsky.social
Generative Extend just released in Premiere Pro! Use GenAI to extend your video *and audio* clips for a perfectly timed edit.

The audio model was built by our team, the Sound Design AI (SODA) group at Adobe Research w/ @pseeth.bsky.social and @urinieto.bsky.social 🙌

www.youtube.com/watch?v=_Bv5...
Generative Extend | Premiere Pro 2025 Updates | Adobe Video
YouTube video by Adobe Video & Motion
www.youtube.com
pseeth.bsky.social
good story from someone completely unrelated to me i swear
deepa.bsky.social
We’ve been hearing so much about reasoning lately but what happened to OpenAI’s big project: GPT-5 aka Orion?

It was to be a “significant leap forward” per Altman. Microsoft expected to see GPT-5 in mid-2024, per sources. But one problem after another popped up which…

www.wsj.com/tech/ai/open...
The Next Great Leap in AI Is Behind Schedule and Crazy Expensive
The startup has run into problem after problem on its new artificial-intelligence project, code-named Orion.
www.wsj.com
pseeth.bsky.social
👀
jonathanleroux.bsky.social
@waspaa.com is on Bluesky!
Big changes are coming for WASPAA 2025!
waspaa.com
WASPAA is moving, for the first time in its almost 40-year history. Stay tuned for the announcement of the new venue!
pseeth.bsky.social
neat - i think all these spaces are basically a linear layer / permutation away from each other. with one codebook (or a vae setup) you could maybe just solve it with the embedding matrices directly, no audio needed
pseeth.bsky.social
Great work from @hugofloresgarcia.bsky.social’s internship at Adobe - turn your voice into basically anything!
hugofloresgarcia.bsky.social
new paper! 🗣️Sketch2Sound💥

Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.

paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound
Reposted by Prem Seetharaman
kris.art
Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🎉

⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video

Link to research in comments:
by Adobe Research
pseeth.bsky.social
Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!
czyang.bsky.social
🎥 Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🔊
We can
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video.

arXiv: arxiv.org/abs/2411.17698
website: ificl.github.io/MultiFoley/
Reposted by Prem Seetharaman
sniklaus.com
A nifty application of depth estimation, creating a mockup of a digital design on real-world objects: sniklaus.com/mockup
pseeth.bsky.social
Here's one that seems to catch a bit more "thread-like" content, sorts by recency instead of likes, and drops arxiv bots: bsky.app/profile/psee.... Seems to work ok for now, and catches some non-ML threads too
pseeth.bsky.social
Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
Reposted by Prem Seetharaman
pseeth.bsky.social
Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
Reposted by Prem Seetharaman
kashyap7x.bsky.social
For those of you who haven't yet, give scholar-inbox.com a try! It's a free personal paper recommender which helps you stay up-to-date by sending daily/weekly paper digests directly to your inbox. Your votes train your own classifier, and you can have a peek at its feature words. Here are mine!
pseeth.bsky.social
Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
Reposted by Prem Seetharaman
jonathanleroux.bsky.social
I initiated a starter pack for Audio ML. Let me know if you'd like to be added/removed.
go.bsky.app/LGmct4z