热点
"语音识别" 相关文章
微软竟然开源了整套、能打电话的 AI
小众软件 2025-11-05T07:18:10.000000Z
智能会议纪要助手:基于 TRAE IDE 和 MCP 的完整实践
豆包MarsCode 2025-11-04T12:27:35.000000Z
How Switchboard, MD automates real-time call transcription in clinical contact centers with Amazon Nova Sonic
AWS Machine Learning Blog 2025-11-03T17:31:55.000000Z
Top 5 ASR Companies in 2025: Audio Transcription and Labeling Services
Cogito Tech 2025-11-03T07:04:09.000000Z
“修复地球 OL 和女生聊天时没有对话框 Bug”:开发者打造 LiveGalGame“整活项目”
IT之家 2025-11-03T01:36:55.000000Z
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
cs.AI updates on arXiv.org 2025-10-30T04:15:51.000000Z
A Neural Model for Contextual Biasing Score Learning and Filtering
cs.AI updates on arXiv.org 2025-10-29T04:23:54.000000Z
Generative Annotation for ASR Named Entity Correction
cs.AI updates on arXiv.org 2025-10-27T06:35:03.000000Z
[HomeKit] Siri 无法打开指定的灯具和空调,会反复问你是哪个房间的。
V2EX 2025-10-26T18:09:40.000000Z
智能头盔技术:对抗音频深度伪造的未来方向
FreeBuf互联网安全新媒体平台 2025-10-24T06:48:47.000000Z
EchoFake: A Replay-Aware Dataset for Practical Speech Deepfake Detection
cs.AI updates on arXiv.org 2025-10-23T04:20:04.000000Z
[分享创造] 求推荐 YouTube 视频把印度口音🇮🇳替换成标准美式口音🇺🇸的方案
V2EX 2025-10-21T05:12:50.000000Z
RWKV 2025 生态内容征集大赛 | 9 月投稿作品及评审结果
RWKV元始智能 2025-10-18T11:36:31.000000Z
RLAIF-SPA: Optimizing LLM-based Emotional Speech Synthesis via RLAIF
cs.AI updates on arXiv.org 2025-10-17T04:18:47.000000Z
Do Slides Help? Multi-modal Context for Automatic Transcription of Conference Talks
cs.AI updates on arXiv.org 2025-10-17T04:06:58.000000Z
多位机主称华为小艺AI把亳州读成hao州 客服:读音出错是语音库情况 需要核实
快科技资讯 2025-10-16T08:38:23.000000Z
Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation
cs.AI updates on arXiv.org 2025-10-16T04:23:08.000000Z
AI圈再颠覆!中国AI翻译耳机通话翻译,实测震撼
新智元 2025-10-15T10:17:39.000000Z
上海→迪拜现场连线,多亏有这位“同传翻译官”🤟
科大讯飞 2025-10-15T09:59:15.000000Z
Assessing Latency in ASR Systems: A Methodological Perspective for Real-Time Use
cs.AI updates on arXiv.org 2025-10-15T05:12:36.000000Z