热点
关于我们
xx
xx
"
Speech Recognition
" 相关文章
Top 5 ASR Companies in 2025: Audio Transcription and Labeling Services
Cogito Tech
2025-11-03T07:04:09.000000Z
AI圈再颠覆!中国AI翻译耳机通话翻译,实测震撼
新智元
2025-10-15T10:17:39.000000Z
上海→迪拜现场连线,多亏有这位“同传翻译官”🤟
科大讯飞
2025-10-15T09:59:15.000000Z
Google Introduces Speech-to-Retrieval (S2R) Approach that Maps a Spoken Query Directly to an Embedding and Retrieves Information without First Converting Speech to Text
MarkTechPost@AI
2025-10-13T02:15:35.000000Z
FunASR 前端语音识别代码解析
掘金 人工智能
2025-10-09T23:50:45.000000Z
FunASR 前端语音识别代码解析
掘金 人工智能
2025-10-09T23:50:45.000000Z
How to Build an Advanced Voice AI Pipeline with WhisperX for Transcription, Alignment, Analysis, and Export?
MarkTechPost@AI
2025-10-03T04:09:00.000000Z
Liquid AI Released LFM2-Audio-1.5B: An End-to-End Audio Foundation Model with Sub-100 ms Response Latency
MarkTechPost@AI
2025-10-01T17:11:55.000000Z
PotPlayer 史诗级更新!实时字幕生成+实时翻译,看片真的无敌了!
阿虚同学
2025-09-21T08:23:55.000000Z
PotPlayer 史诗级更新!实时字幕生成+实时翻译,看片真的无敌了!
阿虚同学
2025-09-20T03:26:49.000000Z
Qwen3-ASR-Toolkit: An Advanced Open Source Python Command-Line Toolkit for Using the Qwen-ASR API Beyond the 3 Minutes/10 MB Limit
MarkTechPost@AI
2025-09-19T07:58:42.000000Z
没想到,音频大模型开源最彻底的,居然是小红书
机器之心
2025-09-17T17:45:50.000000Z
How to Build an Advanced End-to-End Voice AI Agent Using Hugging Face Pipelines?
MarkTechPost@AI
2025-09-17T17:13:33.000000Z
Alibaba Qwen Team Releases Qwen3-ASR: A New Speech Recognition Model Built Upon Qwen3-Omni Achieving Robust Speech Recogition Performance
MarkTechPost@AI
2025-09-09T09:28:27.000000Z
社区供稿 | 开源SOTA:阶跃发布端到端语音大模型Step-Audio 2 mini!
智源社区
2025-09-04T09:51:16.000000Z
What is OLMoASR and How Does It Compare to OpenAI’s Whisper in Speech Recognition?
MarkTechPost@AI
2025-09-04T09:50:21.000000Z
FFmpeg 8.0 的这个更新,可能要改变世界
小众软件
2025-08-30T11:41:40.000000Z
Stop wasting time typing with this AI-powered dictation tool
Mashable
2025-08-29T09:51:00.000000Z
Vocal Image is using AI to help people communicate better
TechCrunch News
2025-08-29T08:50:23.000000Z