LLM评估体系LASER在印度语语音识别中的应用

cs.AI updates on arXiv.org 10月10日

LLM评估体系LASER在印度语语音识别中的应用

本文介绍了基于LLM的评估体系LASER在印度语语音识别中的应用，通过利用先进的LLM模型在上下文学习方面的能力，提高了对语音识别错误分析的准确性。

arXiv:2510.07437v1 Announce Type: cross Abstract: Standard ASR evaluation metrics like Word Error Rate (WER) tend to unfairly penalize morphological and syntactic nuances that do not significantly alter sentence semantics. We introduce an LLM-based scoring rubric LASER that leverages state-of-the-art LLMs' in-context learning abilities to learn from prompts with detailed examples. Hindi LASER scores using Gemini 2.5 Pro achieved a very high correlation score of 94% with human annotations. Hindi examples in the prompt were also effective in analyzing errors in other Indian languages such as Marathi, Kannada and Malayalam. We also demonstrate how a smaller LLM like Llama 3 can be finetuned on word-pair examples derived from reference and ASR predictions to predict what kind of penalty should be applied with close to 89% accuracy.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

LLM评估体系语音识别印度语

相关文章

Delivering Neural Speech Services at Scale with Li Jiang - #522

Acoustic Word Embeddings for Low Resource Speech Processing with Herman Kamper - TWiML Talk #191

有光科技完成B轮融资

Building a better sarcasm detector

AI headphones let wearer listen to a single person in a crowd, by looking at them just once

中国电信人工智能研究院发布支持超多方言语音识别大模型

ChatGPT-4o发布了，所有人都可以免费用

SecWiki News 2024-06-12 Review

麦当劳“炒掉”AI 点餐员，叫停与 IBM 合作的自动点餐测试项目

Meta：悄悄发布多款模型、研究和数据集