I work on music and ML.
havenpersona.github.io
Based on adversarial post-training, it does not rely on distillation or CFG
Runtime is reduced to milliseconds with GPUs or seconds with CPUs
Weights huggingface.co/stabilityai/...
Blog stability.ai/news/stabili...
Paper arxiv.org/abs/2505.08175
Based on adversarial post-training, it does not rely on distillation or CFG
Runtime is reduced to milliseconds with GPUs or seconds with CPUs
Weights huggingface.co/stabilityai/...
Blog stability.ai/news/stabili...
Paper arxiv.org/abs/2505.08175
🔍We explored a new task of generating teasers for long documentaries.
🤩We presented a new dataset, new models, and new evaluation metrics for teaser generation.
🔍We explored a new task of generating teasers for long documentaries.
🤩We presented a new dataset, new models, and new evaluation metrics for teaser generation.
Have you ever wondered what an open model means? Help us shape the definition of open models in generative AI for music by taking our survey — just 10 minutes!
👉 forms.gle/Z48t6HPBXwWC3r…
thank you 💫
Have you ever wondered what an open model means? Help us shape the definition of open models in generative AI for music by taking our survey — just 10 minutes!
👉 forms.gle/Z48t6HPBXwWC3r…
thank you 💫
go.bsky.app/PBvFCxa
go.bsky.app/PBvFCxa