The Verge - Artificial Intelligences 08月21日
Google’s Gemini Live AI assistant will show you what it’s talking about
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Google Gemini Live AI助手迎来一系列重要更新,旨在提升实时对话体验。新功能包括在屏幕共享时高亮显示特定物品,方便用户与AI协作解决问题,例如通过摄像头识别并指出正确的工具。此外,Gemini Live将深度集成Messages、Phone、Clock等应用,允许用户在对话中直接操作,如发送延迟通知。更新还包括一个优化的音频模型,能更自然地模仿人类语音的语调、节奏和音高,并允许用户调整语速,甚至根据内容扮演角色,提供更富情感和沉浸感的交互。

✨ Gemini Live将支持屏幕共享中的视觉高亮功能,允许AI直接在共享的屏幕上指出用户感兴趣的物品,例如在展示工具时准确高亮出所需的工具,提升了AI在实际场景中的指导能力。

💬 Gemini Live将实现与Messages、Phone、Clock等更多应用的深度集成,使用户能够在与AI对话的同时,直接执行应用内的操作,例如在讨论行程时,直接向联系人发送关于迟到的消息。

🔊 Gemini Live推出了更新的音频模型,能够更精细地模仿人类语音的关键要素,如语调、节奏和音高,使AI的语音表达更加自然和富有情感,并可根据对话内容调整语气。

⏱️ 用户将能够自定义Gemini Live的语速,使其能像ChatGPT一样调整说话快慢,同时,AI还能根据用户需求,如讲述故事或扮演角色时,采用特定的口音,提供更丰富、更具吸引力的叙事体验。

Google is bringing a bundle of new features to Gemini Live, its AI assistant that you can have real-time conversations with. Next week, Gemini Live will be able to highlight things directly on your screen while sharing your camera, making it easier for the AI assistant to point out a specific item.

If you’re trying to find the right tool for a project, for example, you can point your smartphone’s camera at a collection of tools, and Gemini Live will highlight the correct one on your screen. This feature will be available on the newly announced Pixel 10 devices when they launch on August 28th. Google will begin rolling out visual guidance to other Android devices at the same time before expanding to iOS “in the coming weeks.”

Google is also launching new integrations that will soon allow Gemini Live to interact with more apps, including Messages, Phone, and Clock. Say you’re in the middle of a conversation with Gemini about directions to your destination, but you realize you’re running late. Google says you’ll be able to interrupt the chatbot with something like: “This route looks good. Now, send a message to Alex that I’m running about 10 minutes late.” From there, Google can draft a text to your friend for you.

Lastly, Google is launching an updated audio model for Gemini Live that the company says will “dramatically improve” how the chatbot “uses the key elements of human speech, like intonation, rhythm and pitch.” Soon, Gemini will change its tone based on what you’re speaking about, such as using a calmer voice if you’re asking about a stressful topic. 

You’ll also be able to change how fast — or slow — Gemini talks, which sounds a bit similar to how users can now tweak the style of ChatGPT’s voice mode. And, if you ask Gemini for a dramatic retelling of a story from the perspective of a particular character or historical figure, the chatbot may adopt an accent for a “rich, engaging narrative.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Gemini Live AI助手 屏幕共享 应用集成 语音交互
相关文章