Our lab is committed to cutting-edge research in speech generation, spoken dialogue systems, and spatial audio generation. We strive to develop intelligent, natural, and immersive audio technologies that advance human–machine interaction and multimedia experiences.
MM-Speech
Popular repositories Loading
-
DiTReducio
DiTReducio PublicThe project page of DiTReducio (Accepted by ACL 2026 Findings)
-
DualAxisRM
DualAxisRM PublicThe project page of DualAxisRM (Accepted by ACL 2026 Main Conference)
Python 11
-
SwanBench-Speech
SwanBench-Speech PublicThe project page of SwanBench-Speech (Accepted by ACL 2026 Findings)
-
SDiaReward
SDiaReward PublicOfficial repository for "SDiaReward" (ACL 2026 Main) . An end-to-end multi-turn reward model for spoken dialogue systems.
Repositories
- TMD-Bench Public
[ICML 2026] TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation
MM-Speech/TMD-Bench’s past year of commit activity - emo-tts Public
MM-Speech/emo-tts’s past year of commit activity - WavAlign Public
Official repository for "WavAlign" (ACL 2026 Findings) . An Post-Train Framework for spoken dialogue models.
MM-Speech/WavAlign’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…