LibriSpeech#

概述#

LibriSpeech 是一个大规模语料库,包含约 1,000 小时的朗读英语语音,源自有声读物。它是评估自动语音识别(ASR)系统最广泛使用的基准测试之一。

任务描述#

  • 任务类型:自动语音识别(ASR)

  • 输入:来自有声读物的朗读英语语音录音

  • 输出:转录文本

  • 语言:英语

主要特点#

  • 1,000 小时高质量朗读语音

  • 源自 LibriVox 有声读物(公共领域)

  • 包含“clean”和“other”测试集,用于不同难度评估

  • ASR 研究中广泛使用的基线数据集

  • 标准化的评估协议

评估说明#

  • 默认配置使用 test_clean 划分

  • 主要指标:词错误率(WER)

  • 评估过程中应用文本归一化

  • 提示词:"Please recognize the speech and only output the recognized content"

  • 元数据包含音频 ID 和时长信息

属性#

属性

基准测试名称

librispeech

数据集ID

lmms-lab/Librispeech-concat

论文

N/A

标签

Audio, SpeechRecognition

指标

wer

默认样本数

0-shot

评估划分

test_clean

数据统计#

指标

总样本数

87

提示词长度(平均)

67 字符

提示词长度(最小/最大)

67 / 67 字符

音频统计:

指标

音频文件总数

87

每样本音频数量

最小: 1, 最大: 1, 平均: 1

格式

wav

样例示例#

子集: default

{
  "input": [
    {
      "id": "fd0309e6",
      "content": [
        {
          "text": "Please recognize the speech and only output the recognized content:"
        },
        {
          "audio": "[BASE64_AUDIO: wav, ~22.7MB]",
          "format": "wav"
        }
      ]
    }
  ],
  "target": "Eleven o'clock had struck it was a fine clear night they were the only persons on the road and they sauntered leisurely along to avoid paying the price of fatigue for the recreation provided for the toledans in their valley or on the banks of ... [TRUNCATED] ...  less surprised than they and the better to assure himself of so wonderful a fact he begged leocadia to give him some token which should make perfectly clear to him that which indeed he did not doubt since it was authenticated by his parents.",
  "id": 0,
  "group_id": 0,
  "metadata": {
    "audio_id": "5639-40744",
    "audio_duration": 496.6899719238281
  }
}

注:部分内容为显示目的已截断。

提示模板#

提示模板:

Please recognize the speech and only output the recognized content:

使用方法#

使用 CLI#

evalscope eval \
    --model YOUR_MODEL \
    --api-url OPENAI_API_COMPAT_URL \
    --api-key EMPTY_TOKEN \
    --datasets librispeech \
    --limit 10  # 正式评估时请删除此行

使用 Python#

from evalscope import run_task
from evalscope.config import TaskConfig

task_cfg = TaskConfig(
    model='YOUR_MODEL',
    api_url='OPENAI_API_COMPAT_URL',
    api_key='EMPTY_TOKEN',
    datasets=['librispeech'],
    limit=10,  # 正式评估时请删除此行
)

run_task(task_cfg=task_cfg)