TechCrunch News 11月07日 02:56
Subtle Computing:用AI技术解决嘈杂环境下的语音识别难题
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

加州初创公司Subtle Computing开发了创新的语音隔离模型,旨在解决嘈杂环境下的语音捕获问题,为语音AI产品和服务带来福音。随着语音AI应用的飞速增长,如AI会议笔记、语音转录和语音输入等领域都面临着在各种环境下(如咖啡馆、办公室)准确捕捉用户声音的挑战。Subtle Computing通过训练针对特定设备声学特性和用户声音的定制化模型,而非通用模型,实现了在极端噪音下仍能清晰理解语音,显著提升了语音识别的准确性和用户体验。该技术已获得600万美元的种子轮融资,并与高通合作,未来还将推出自有软硬件消费产品。

🎯 **核心技术突破:** Subtle Computing的核心竞争力在于其先进的语音隔离模型,能够有效过滤掉背景噪音,即便在嘈杂的环境下也能精准捕捉和理解用户的语音指令。这得益于其训练定制化模型的策略,该模型能够适应特定设备的声学特性和用户的声音,而非采用通用的解决方案,从而实现了性能上的数量级提升。

🚀 **市场机遇广阔:** 随着语音AI产品和服务的快速增长,如AI会议记录、语音输入和智能助手等,对高质量语音捕获的需求日益迫切。Subtle Computing的技术直接解决了行业痛点,有望在智能设备、消费电子和企业级应用等多个领域得到广泛应用,为用户提供更可靠、更便捷的语音交互体验。

🤝 **战略合作与融资:** 公司已获得600万美元的种子轮融资,并与高通达成合作,这意味着其技术将集成到高通芯片中,并有望推广至更多OEM设备。此外,公司还与其他消费电子和汽车品牌建立了合作关系,为其技术落地奠定了坚实基础。

💡 **未来发展规划:** Subtle Computing不仅致力于成为模型供应商,还计划在未来推出集硬件与软件于一体的自有消费级产品,进一步拓展其在语音AI领域的市场影响力,并为用户提供更全面的解决方案。

California-based startup Subtle Computing is tackling the problem of capturing people’s voices in noisy environments with its own voice isolation models — a technology that could benefit voice-based AI products and services.

Consumer apps using voice AI are today seeing tremendous growth. AI Meeting notetakers like Granola, Fireflies, Fathom, and Read AI have received both user and investor attention. Existing companies like OpenAI, ClickUp, and Notion have integrated voice transcription solutions. App makers like Wispr Flow and Willow are working on voice dictation. Then there are hardware companies like Plaud and Sandbar that are using devices as a medium to transcribe your voice, then use AI for insight generation and interaction.

One of the challenges for these companies is capturing users’ voices in any kind of environment, such as loud cafes or offices.

To address this, Subtle Computing developed an end-to-end voice isolation model that can understand what you are saying even in noisy environments. Chen said that there are a lot of companies working on voice understanding. He noted that at times, device manufacturers send the voice to the cloud to get a clean output, but that’s not efficient.

The startup trains specific models to suit the acoustics of a particular device and adapt to the user’s voice instead of training one model that works across devices.

“What we found is that when we preserve the acoustic characteristics of a device, we get an order of magnitude better performance than generic solutions. This also means we can give personalized solutions to the user,” Chen said.

The company was founded by Tyler Chen, David Harrison, Savannah Cofer, and Jackie Yang, who met at Stanford. Chen, Cofer, and Yang were pursuing their PhDs while Harrison was doing an MBA. They came together in Steve Blank’s Lean Launchpad course, where they worked on alternative interfaces for computing and started building Subtle Computing.

Techcrunch event

San Francisco | October 13-15, 2026

“As we are interacting more with AI, we are moving towards a future where we talk with our devices,” Chen said. “But the obvious question is how much our devices understand us, the users, in all the environments where we work day to day. Be it a super loud coffee shop or a shared office where there are other people around you, and you might be talking about something private — voice doesn’t work that way today,” he added.

The startup said it can run the model just for voice isolation on some devices, which is just a few megabytes in size and has 100ms of latency. The company can also run a different model to transcribe the voice and give text output for other devices. Chen said thanks to its isolation model, the company’s transcription model can understand users better, and in turn, creates a more accurate transcript.

Subtle Computing said that Qualcomm has selected the startup as a member of its voice and music extension program. This means that the startup’s tech would be compatible with Qualcomm’s chips and be available on devices produced by OEMs.

The company has raised $6 million in seed funding led by Entrada Ventures, with participation from Amplify Partners, Abstract Ventures, and angel investors, including founders like Twitter’s Biz Stone, Pinterest’s Evan Sharp, and Perplexity’s Johnny Ho.

Karen Roter Davis, Managing Partner at Entrada Ventures and a former director of an early project at X (Alphabet), noted that voice AI is a noisy space, and though interactions through this medium are picking up, the overall voice experience is not great. She thinks that the startup’s focus on voice isolation brings a different perspective to the market.

“While you can debate whether AI will increase or decrease that time spent on a day-to-day basis, we can all agree that advances in compute power and machine learning / AI provide opportunities for voice interface breakthroughs – if done right,” Davis said. “Subtle Computing is meeting people where they are with voice interfaces that hold up in extreme noise and extreme quiet, providing a voice experience that is reliable, easy, and fun. It’s a game changer,” she added.

The company said it has also partnered with a consumer hardware brand and an automotive brand — without naming them — to deploy its solutions. But Subtle Computing doesn’t want to be just a model supplier to other companies.

The startup also said it plans to announce a consumer product that spans both hardware and software next year, without offering details.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Subtle Computing 语音隔离 AI语音 嘈杂环境 语音识别 Voice Isolation AI Voice Noisy Environments Voice Recognition
相关文章