takehika
takehika.bsky.social
takehika
@takehika.bsky.social
Specializing in audio tech development, leveraging AI/ML and data analysis. Freelancer in Japan.
Digital sound from Sora 2.

Theme: "motion against nature’s vastness"
October 31, 2025 at 4:48 AM
Digital audio from Sora 2.

It feels like there's still plenty of room for improvement after some testing.
October 31, 2025 at 4:40 AM
microsoft/VibeVoice-1.5B on Hugging Face.
huggingface.co/microsoft/Vi...

It can generate multi-speaker conversational audio from text, supporting up to 4 speakers and 90 minutes.

You can even try it yourself with the demo page on GitHub.
August 27, 2025 at 5:07 AM