Interspeech 2025 Multilingual Conversational Speech Language Model (MLC-SLM) Challenge

The Multilingual Conversational Speech LLM (MLC-SLM) Challenge is now open as a satellite event of Interspeech 2025! Hosted by Meta, Google, Samsung Electronics, NAVER Corp, China Mobile, Northwestern Polytechnical University and Nexdata, this challenge aims to advance multilingual conversational AI by developing cutting-edge speech language models and providing access to a real-world multilingual conversational speech dataset. The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs): Task I: Multilingual Conversational Speech Recognition Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation. Task II: Multilingual Conversational Speech Diarization and Recognition Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation. The training set (Train) comprises approximately 11 languages: English (en), French (fr), German (de), Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi). Important Dates (AOT Time) March 10, 2025: Registration opens March 15, 2025: Training data release March 20, 2025: Development set and baseline system release May 15, 2025: Evaluation set release and Leaderboard open May 30, 2025: Leaderboard freeze and paper submission portal opens (CMT system) June 15, 2025: Paper submission deadline July 1, 2025: Notification of acceptance August 18, 2025: Workshop date We have set a prize pool of $20,000 for the winners. Based on performance, the top three teams in each track will be awarded: 1st Prize: $5,000 2nd Prize: $3,000 3rd Prize: $2,000

Mar 20, 2025 - 09:40
 0
Interspeech 2025 Multilingual Conversational Speech Language Model (MLC-SLM) Challenge

The Multilingual Conversational Speech LLM (MLC-SLM) Challenge is now open as a satellite event of Interspeech 2025!

Hosted by Meta, Google, Samsung Electronics, NAVER Corp, China Mobile, Northwestern Polytechnical University and Nexdata, this challenge aims to advance multilingual conversational AI by developing cutting-edge speech language models and providing access to a real-world multilingual conversational speech dataset.

The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs):

Task I: Multilingual Conversational Speech Recognition

Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.

Task II: Multilingual Conversational Speech Diarization and Recognition

Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.

The training set (Train) comprises approximately 11 languages: English (en), French (fr), German (de), Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi).

Important Dates (AOT Time)

March 10, 2025: Registration opens

March 15, 2025: Training data release

March 20, 2025: Development set and baseline system release

May 15, 2025: Evaluation set release and Leaderboard open

May 30, 2025: Leaderboard freeze and paper submission portal opens (CMT system)

June 15, 2025: Paper submission deadline

July 1, 2025: Notification of acceptance

August 18, 2025: Workshop date

We have set a prize pool of $20,000 for the winners. Based on performance, the top three teams in each track will be awarded:

1st Prize: $5,000

2nd Prize: $3,000

3rd Prize: $2,000