Skip to content
@MM-Speech

MM-Speech

Welcome to MM-Speech 👋

Our lab is committed to cutting-edge research in speech generation, spoken dialogue systems, and spatial audio generation. We strive to develop intelligent, natural, and immersive audio technologies that advance human–machine interaction and multimedia experiences.

Popular repositories Loading

  1. VoxMind VoxMind Public

    The project page of VoxMind (Accepted by ACL 2026 Main Conference)

    Python 32 3

  2. DiTReducio DiTReducio Public

    The project page of DiTReducio (Accepted by ACL 2026 Findings)

    Python 12 1

  3. DualAxisRM DualAxisRM Public

    The project page of DualAxisRM (Accepted by ACL 2026 Main Conference)

    Python 11

  4. SwanBench-Speech SwanBench-Speech Public

    The project page of SwanBench-Speech (Accepted by ACL 2026 Findings)

    10

  5. WavAlign WavAlign Public

    Official repository for "WavAlign" (ACL 2026 Findings) . An Post-Train Framework for spoken dialogue models.

    Python 7 2

  6. SDiaReward SDiaReward Public

    Official repository for "SDiaReward" (ACL 2026 Main) . An end-to-end multi-turn reward model for spoken dialogue systems.

    Python 6 1

Repositories

Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…