热点
关于我们
xx
xx
"
实时
" 相关文章
Thai Semantic End-of-Turn Detection for Real-Time Voice Agents
cs.AI updates on arXiv.org
2025-10-07T04:16:11.000000Z
Kyutai Releases 2B Parameter Streaming Text-to-Speech TTS with 220ms Latency and 2.5M Hours of Training
MarkTechPost@AI
2025-07-05T08:30:50.000000Z
Kyutai Open Sources Moshi: A Real-Time Native Multimodal Foundation AI Model that can Listen and Speak
MarkTechPost@AI
2024-07-03T19:31:59.000000Z