takehika
@takehika.bsky.social
Specializing in audio tech development, leveraging AI/ML and data analysis. Freelancer in Japan.
Digital sound from Sora 2.
Theme: "motion against nature’s vastness"
Theme: "motion against nature’s vastness"
October 31, 2025 at 4:48 AM
Digital sound from Sora 2.
Theme: "motion against nature’s vastness"
Theme: "motion against nature’s vastness"
Digital audio from Sora 2.
It feels like there's still plenty of room for improvement after some testing.
It feels like there's still plenty of room for improvement after some testing.
October 31, 2025 at 4:40 AM
Digital audio from Sora 2.
It feels like there's still plenty of room for improvement after some testing.
It feels like there's still plenty of room for improvement after some testing.
microsoft/VibeVoice-1.5B on Hugging Face.
huggingface.co/microsoft/Vi...
It can generate multi-speaker conversational audio from text, supporting up to 4 speakers and 90 minutes.
You can even try it yourself with the demo page on GitHub.
huggingface.co/microsoft/Vi...
It can generate multi-speaker conversational audio from text, supporting up to 4 speakers and 90 minutes.
You can even try it yourself with the demo page on GitHub.
August 27, 2025 at 5:07 AM
microsoft/VibeVoice-1.5B on Hugging Face.
huggingface.co/microsoft/Vi...
It can generate multi-speaker conversational audio from text, supporting up to 4 speakers and 90 minutes.
You can even try it yourself with the demo page on GitHub.
huggingface.co/microsoft/Vi...
It can generate multi-speaker conversational audio from text, supporting up to 4 speakers and 90 minutes.
You can even try it yourself with the demo page on GitHub.