https://simonwillison.net/atom/everything 10月22日 02:58
OpenAI发布具备ChatGPT功能的Mac浏览器Atlas
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI发布了一款名为ChatGPT Atlas的Mac平台专属网页浏览器,该浏览器集成了多项由ChatGPT驱动的功能。其中一项核心功能是用户可以在浏览网页时,侧边栏会同步显示与当前页面内容相关的ChatGPT聊天窗口。另一项名为“浏览器记忆”的特色功能允许ChatGPT记录用户的浏览细节,以便在后续的聊天中提供更智能的建议,例如找回之前浏览过的网页。用户可以管理这些记忆,随时查看、存档或删除。此外,Atlas还提供一个实验性的“代理模式”,在该模式下,ChatGPT可以自主导航和操作网页,完成诸如制定膳食计划、列出购物清单并添加到购物车等任务,但所有操作都在用户的监控和控制之下,且有严格的系统和数据访问限制。

💬 ChatGPT Atlas是一款由OpenAI开发的Mac平台专属网页浏览器,它最核心的特点是将ChatGPT的功能深度集成到浏览体验中,旨在提升用户浏览和信息处理的效率。

🧠 浏览器记忆功能是Atlas的一大亮点,它允许ChatGPT在用户授权的情况下,记住用户浏览过的网页中的关键信息,从而在后续的聊天交互中能够提供更具上下文关联的回复和建议,方便用户快速找回和利用历史信息。

✨ Atlas还引入了一项实验性的“代理模式”,在这个模式下,ChatGPT可以像一个智能助手一样,在用户的指令下自主地浏览网页、查找信息甚至执行简单的在线任务,例如搜索食谱和创建购物清单,但所有操作都在用户的监督和控制之下,并受到严格的权限限制,以保障安全和隐私。

🌐 为了更好地支持ChatGPT在“代理模式”下的工作,网站开发者可以通过添加ARIA标签来优化其网站的结构和交互元素的可读性,这与提升辅助技术(如屏幕阅读器)用户体验的思路是一致的,表明AI代理在某种程度上也依赖于网页的可访问性信息。

Introducing ChatGPT Atlas (via) Last year OpenAI hired Chrome engineer Darin Fisher, which sparked speculation they might have their own browser in the pipeline. Today it arrived.

ChatGPT Atlas is a Mac-only web browser with a variety of ChatGPT-enabled features. You can bring up a chat panel next to a web page, which will automatically be populated with the context of that page.

The "browser memories" feature is particularly notable, described here:

If you turn on browser memories, ChatGPT will remember key details from your web browsing to improve chat responses and offer smarter suggestions—like retrieving a webpage you read a while ago. Browser memories are private to your account and under your control. You can view them all in settings, archive ones that are no longer relevant, and clear your browsing history to delete them.

Atlas also has an experimental "agent mode" where ChatGPT can take over navigating and interacting with the page for you, accompanied by a weird sparkle overlay effect:

Here's how the help page describes that mode:

In agent mode, ChatGPT can complete end to end tasks for you like researching a meal plan, making a list of ingredients, and adding the groceries to a shopping cart ready for delivery. You're always in control: ChatGPT is trained to ask before taking many important actions, and you can pause, interrupt, or take over the browser at any time.

Agent mode runs also operates under boundaries:

    System access: Cannot run code in the browser, download files, or install extensions.Data access: Cannot access other apps on your computer or your file system, read or write ChatGPT memories, access saved passwords, or use autofill data.Browsing activity: Pages ChatGPT visits in agent mode are not added to your browsing history.

You can also choose to run agent in logged out mode, and ChatGPT won't use any pre-existing cookies and won't be logged into any of your online accounts without your specific approval.

These efforts don't eliminate every risk; users should still use caution and monitor ChatGPT activities when using agent mode.

I continue to find this entire category of browser agents deeply confusing.

The security and privacy risks involved here still feel insurmountably high to me - I certainly won't be trusting any of these products until a bunch of security researchers have given them a very thorough beating.

I'd like to see a deep explanation of the steps Atlas takes to avoid prompt injection attacks. Right now it looks like the main defense is expecting the user to carefully watch what agent mode is doing at all times!

I also find these products pretty unexciting to use. I tried out agent mode and it was like watching a first-time computer user painstakingly learn to use a mouse for the first time. I have yet to find my own use-cases for when this kind of interaction feels useful to me, though I'm not ruling that out.

There was one other detail in the announcement post that caught my eye:

Website owners can also add ARIA tags to improve how ChatGPT agent works for their websites in Atlas.

Which links to this:

ChatGPT Atlas uses ARIA tags---the same labels and roles that support screen readers---to interpret page structure and interactive elements. To improve compatibility, follow WAI-ARIA best practices by adding descriptive roles, labels, and states to interactive elements like buttons, menus, and forms. This helps ChatGPT recognize what each element does and interact with your site more accurately.

A neat reminder that AI "agents" share many of the characteristics of assistive technologies, and benefit from the same affordances.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ChatGPT Atlas OpenAI Mac浏览器 AI助手 浏览器记忆 代理模式 ARIA标签
相关文章