hugofloresgarcía
@hugofloresgarcia.bsky.social
270 followers 140 following 5 posts
human computer musical instruments https://hugofloresgarcia.art/ phd candidate @northwestern research intern @adobe prev @spotify, @descript chicago // honduras
Posts Media Videos Starter Packs
Reposted by hugofloresgarcía
iammelsmith.bsky.social
Black music powered every significant musical movement.

Every.
hugofloresgarcia.bsky.social
thank u ben i tried i practiced 🙂‍↕️🙏❤️
Reposted by hugofloresgarcía
pseeth.bsky.social
Great work from @hugofloresgarcia.bsky.social’s internship at Adobe - turn your voice into basically anything!
hugofloresgarcia.bsky.social
new paper! 🗣️Sketch2Sound💥

Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.

paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound
Reposted by hugofloresgarcía
arxiv-sound.bsky.social
Sketch2Sound generates audio from time-varying control signals (loudness, brightness, pitch) and text prompts, using sonic imitations; it uses random median filters on control signals during training for flexible temporal specificity.
Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Hugo Flores García, Oriol Nieto, Justin Salamon, Bryan Pardo, Prem Seetharaman
arxiv.org
hugofloresgarcia.bsky.social
here's a guitar -> sfx example!

Sketch2Sound can be implemented on top of any text-to-audio DiT and requires 40k steps of fine-tuning and a single linear layer per control!

in collaboration w/
@urinieto.bsky.social, @justinsalamon.bsky.social, #bryanpardo and the legendary @pseeth.bsky.social!
hugofloresgarcia.bsky.social
new paper! 🗣️Sketch2Sound💥

Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.

paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound