Microsoft AI News 09月25日 23:49
微软AI发布新语音与基础模型,赋能未来AI助手
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

微软AI(MAI)致力于打造普惠AI,并发布了其首个高度自然且富有表现力的语音生成模型MAI-Voice-1,已集成至Copilot Daily和Podcasts,并推出Copilot Labs体验。该模型能在一块GPU上每秒生成一分钟音频,效率极高。同时,MAI-1-preview基础模型已在LMArena平台开启公测,这是MAI首个端到端训练的基础模型,预示着Copilot未来能力的提升。MAI-1-preview采用混合专家模型架构,并在海量NVIDIA H100 GPU上进行了训练,旨在为用户提供更智能、更贴心的服务。微软AI正加速模型迭代,并计划在未来数月内分享更多进展,同时也在招募优秀人才共同构建AI的未来。

✨ **MAI-Voice-1语音生成模型:** 微软AI推出了其首个高度富有表现力的自然语音生成模型MAI-Voice-1。该模型速度极快,能在单块GPU上每秒生成一分钟的音频,效率显著。目前已应用于Copilot Daily和Podcasts功能,并在Copilot Labs提供体验,用户可尝试其在讲故事和创作个性化引导冥想等方面的能力。

🚀 **MAI-1-preview基础模型:** MAI-1-preview是微软AI首个端到端训练的基础模型,目前在LMArena平台进行公测。该模型采用混合专家架构,并在大量NVIDIA H100 GPU上进行了训练,旨在为Copilot提供更强大的指令遵循和响应能力,未来几周内将逐步应用于Copilot的文本功能中。

💡 **AI赋能与未来愿景:** 微软AI秉持“AI赋能每个人”的愿景,致力于构建负责任、可靠且富有洞察力的AI平台。通过不断优化模型和基础设施,MAI旨在为用户提供更智能的体验,并计划通过整合一系列专业化模型来解锁更大的价值,同时积极招募顶尖人才共同推动AI的创新发展。

At Microsoft AI (MAI) we believe AI should be used to empower every person on the planet. We are creating AI for everyone, a supportive, helpful presence always in the service of humanity. It will be the gateway to a universe of knowledge and a set of capabilities that enable people and organizations to achieve more. Responsible, reliable, filled with personality and expertise, we are focused on creating applied AI as a platform for category defining and deeply trusted products that understand each of our unique needs.

Since last year, we’ve been focused on building the foundation for this vision, with a world class team and infrastructure. To fully meet our goals, MAI requires purpose-built models. Today, we’re excited to preview the first steps to making this a reality.

    First, we’re releasing MAI-Voice-1, our first highly expressive and natural speech generation model, which is available in Copilot Daily and Podcasts, and as a brand new Copilot Labs experience to try out here. Voice is the interface of the future for AI companions and MAI-Voice-1 delivers high-fidelity, expressive audio across both single and multi-speaker scenarios.Second, we have begun public testing of MAI-1-preview on LMArena, a popular platform for community model evaluation. This represents MAI’s first foundation model trained end-to-end and offers a glimpse of future offerings inside Copilot. We are actively spinning the flywheel to deliver improved models. We’ll have much more to share in the coming months. Stay tuned!

We have big ambitions for where we go next. Not only will we pursue further advances here, but we believe that orchestrating a range of specialized models serving different user intents and use cases will unlock immense value. There will be a lot more to come from this team on both fronts in the near future. We’re excited by the work ahead as we aim to deliver leading models and put them into the hands of people globally.

Try MAI-Voice-1 in Copilot and Copilot Labs

MAI-Voice-1 is a lightning-fast speech generation model, with an ability to generate a full minute of audio in under a second on a single GPU, making it one of the most efficient speech systems available today.

MAI-Voice-1 is already powering our Copilot Daily and Podcasts features. We are also launching it in Copilot Labs where you can try our expressive speech and storytelling demos. Imagine creating a “choose your own adventure” story with just a simple prompt, or crafting a bespoke guided meditation to help you sleep. Give it a try!

Try MAI-1-preview in LMArena

MAI-1-preview is an in-house mixture-of-experts model, pre-trained and post-trained on ~15,000 NVIDIA H100 GPUs. This model is designed to provide powerful capabilities to consumers seeking to benefit from models that specialize in following instructions and providing helpful responses to everyday queries.

We will be rolling MAI-1-preview out for certain text use cases within Copilot over the coming weeks to learn and improve from user feedback. We will continue to use the very best models from our team, our partners, and the latest innovations from the open-source community to power our products. This approach gives us the flexibility to deliver the best outcomes across millions of unique interactions every day.

In addition to LMArena, we are also making this model available to trusted testers – apply for API access here. We’re excited to collect early feedback to learn more about where the model performs well and how we can make it better. Stay tuned for more.

Build the future with us

We’re a lean, fast-moving lab made up of some of the world’s most talented minds. We have an exciting roadmap of compute at MAI, with our next-generation GB200 cluster now operational. And we have an ambitious mission we truly believe in. We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in – come and join us as we work on our next generation of models!

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

微软AI MAI MAI-Voice-1 MAI-1-preview Copilot AI语音 基础模型 Microsoft AI AI Voice Foundation Model
相关文章