alphacep.bsky.social
@alphacep.bsky.social
Reposted
WavChat: Surveyed existing spoken dialogue systems chronologically, categorized them, and reviewed core technologies, datasets, metrics, and benchmarks; identified limitations and future research directions.
WavChat: A Survey of Spoken Dialogue Models
Shengpeng Ji, Yifu Chen, Minghui Fang, Jialong Zuo, Jingyu Lu, Hanting Wang, Ziyue Jiang, Long Zhou, Shujie Liu, Xize Cheng, Xiaoda Yang, Zehan Wang, Qian Yang, Jian Li, Yidi Jiang, Jingzhen He, Yunfei Chu, Jin Xu, Zhou Zhao
arxiv.org
November 22, 2024 at 7:06 AM
SANE 2024 Videos, interesting things on speech synthesis, diarization, etc

www.youtube.com/playlist?lis...
SANE 2024 @ Google Cambridge - YouTube
SANE 2024, a one-day event gathering researchers and students in speech and audio from the Northeast of the American continent, was held on Thursday October ...
www.youtube.com
November 23, 2024 at 1:02 AM