[Feature] Add WenetSpeech #829

YichenG170 · 2025-09-18T17:09:09Z

This PR adds comprehensive support for parts of WenetSpeech datasets to lmms-eval.

🔧 Core Implementation
New Dataset: WenetSpeech
2 New Splits with around 25h testing audio data

📊 Example Usage
TASK="wenet_speech"
TASK_SUFFIX="${TASK//,/_}"

🧩 File Changes
New:
wenet_speech/utils.py - Evaluation methods for each subset
4 YAML files - Detailed evaluation information for each task & task group

…rompts

kcz358 · 2025-09-26T02:10:59Z

lmms_eval/tasks/wenet_speech/utils.py

+from lmms_eval.llm_judge import ServerConfig, get_server
+
+API_TYPE = os.getenv("API_TYPE", "openai")
+# Use JUDGE_MODEL_VERSION instead of MODEL_VERSION
+JUDGE_MODEL_VERSION = os.getenv("JUDGE_MODEL_VERSION", "gpt-4o-mini")
+
+server_config = ServerConfig(
+    model_name=JUDGE_MODEL_VERSION,
+)
+server = get_server(server_name=API_TYPE, config=server_config)


This part is redundant

kcz358 · 2025-09-26T02:13:47Z

Hi, Thank you for the PR. I review the wenet part and I think most of the part LGTM. I think the commits is a bit chaos and the file changes include the file changes from your last PR. Do you mind only include the commit that contains the changes for wenet speech? Thanks!

You can do that by checkout from a new main and then cherry pick the commit. Thanks!

YichenG170 · 2025-09-26T11:30:38Z

Hi, Thank you for the PR. I review the wenet part and I think most of the part LGTM. I think the commits is a bit chaos and the file changes include the file changes from your last PR. Do you mind only include the commit that contains the changes for wenet speech? Thanks!

You can do that by checkout from a new main and then cherry pick the commit. Thanks!

Ahh sorry for this, I will make a new one!

YichenG170 and others added 6 commits August 29, 2025 23:48

[Feature] Add VoiceBench

0b67a10

[Debug] Fix Lint Errors

41b9211

[Debug] Fix Lint Errors for previous files

29316e3

Refactor(step2_audio_paralinguistic): Improve semantic matching and p…

319f2c9

…rompts

Add WenetSpeech

59055e1

Merge branch 'main' of https://github.com/YichenG170/lmms-eval-G

13c75c8

kcz358 reviewed Sep 26, 2025

View reviewed changes

YichenG170 closed this Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add WenetSpeech #829

[Feature] Add WenetSpeech #829

Uh oh!

YichenG170 commented Sep 18, 2025

Uh oh!

kcz358 Sep 26, 2025

Uh oh!

kcz358 commented Sep 26, 2025

Uh oh!

YichenG170 commented Sep 26, 2025

Uh oh!

Uh oh!

[Feature] Add WenetSpeech #829

[Feature] Add WenetSpeech #829

Uh oh!

Conversation

YichenG170 commented Sep 18, 2025

Uh oh!

kcz358 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

kcz358 commented Sep 26, 2025

Uh oh!

YichenG170 commented Sep 26, 2025

Uh oh!

Uh oh!