Steampunk AI 11月09日 04:57
科技前沿观察:浏览器之战再起,AI持续学习获进展
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本周科技领域动态频频,Meta与Hugging Face联合推出OpenEnv,旨在为AI代理提供共享环境和工具,助力强化学习在AI训练中的应用。与此同时,新一轮浏览器竞争浮出水面,各大AI公司纷纷推出AI浏览器,试图与传统浏览器争夺用户入口。OpenAI的CEO Sam Altman则分享了其营收预期和未来数据中心建设的宏大计划。此外,Thinking Machines团队在“在岗”持续学习方面取得了新进展,通过“策略蒸馏”方法提升了小模型的学习效率,避免了灾难性遗忘。最后,亚马逊因AI浏览器代理自动购物问题向Perplexity发出停止函,揭示了AI公司与传统电商之间潜在的利益冲突。

🤖 Meta与Hugging Face联合发布OpenEnv,这是一个为AI代理设计的共享环境和工具平台,旨在促进基于强化学习的AI系统训练和开发,未来其规范能否应用于生产环境值得关注。

🌐 浏览器市场再掀战火,The Verge分析了近期AI浏览器涌现的背后原因,这些AI浏览器试图成为用户进入数字世界的窗口,与传统搜索引擎和新兴的Chat AI展开竞争,但AI玩家能否通过占领浏览器市场份额获益仍不明朗,Chat AI未来可能集成更多浏览器功能。

💰 OpenAI CEO Sam Altman预测公司年化收入将在年底超过200亿美元,并在2030年达到数千亿美元,同时计划未来八年投入约1.4万亿美元用于数据中心建设,此举引发了关于AI泡沫是否已达顶峰的讨论,但其营收增长潜力和货币化途径仍然广阔。

🧠 Thinking Machines团队提出了“策略蒸馏”方法,旨在解决大型语言模型在训练后“在岗”持续学习的难题,通过让强大的监督模型迭代地向小型操作模型提供反馈,包括答案和生成步骤,从而实现更高效的模型微调和能力保留。

🛒 亚马逊向Perplexity发送停止函,原因是其AI浏览器代理能够自动完成购物,这反映了AI服务自动化购买流程与传统电商维护用户流量和数据隐私之间的矛盾,未来随着AI购物的普及,电商平台或将不得不与AI公司建立合作。

Saturday Links: Browsers, $1.4T, and Advances in Continuous Learning

Reinforcement learning environments, orbital data centres, browser wars and blocking shopping bots.

This week sees OpenAI ask for loan guarantees and then walk it back, Gemini may be coming to Siri, and Data centers in space (why is this not called Project Straylight?).

On to the most interesting stories:

    Meta and Hugging Face Launch OpenEnv, a Shared Hub for Agentic Environments. Hugging Face continues to do a great job in launching initiatives that share AI-adjacent resources. This announcement is about context and tooling for agents. Reinforcement Learning approaches are becoming more popular for AI agent training, and they require close control of the tools and actions available to an AI system. An interesting question will be whether openEnv specs will be usable for configuring production runtime environments. That seems like a requirement for agents trained in OpenEnv environments to be useful. The community itself is here.WEB WAR III: The Browser is Back. David Pierce at The Verge unpacks what is behind the recent surge of AI browser launches (still: please don't use them!). The article covers the power of browsers as the window onto the digital world that AI chatbots are now competing with. I agree with David that this is a fascinating reprise of battles that were mostly settled a decade ago by Chrome becoming dominant. What I'm less sure about is whether capturing browser market share matters for AI players. It seems more likely that as Chat AIs become the first choice for some behaviours, they will include more and more browser-like functionality. Browser usage share v's chatbots would then simply erode. OpenAI's strategy seems to be to threaten its competitors along almost every axis to keep them busy. Perplexity with its Comet offering perhaps sees this as a path to garner users who are not getting chatbot AI native. Perplexity with its Comet offering perhaps sees this as a path to garner users who are not getting chatbot AI native.$20B in revenue run-rate and $1.4T in data centre commitments. In an X-post this week, OpenAI CEO Sam Altman shared revenue numbers and projects. His quote was “We expect to end this year above $20 billion in annualized revenue run rate and grow to hundreds of billion [sic] by 2030. We are looking at commitments of about $1.4 trillion over the next 8 years.” The press reaction has mostly scoffed at the large differential between $20B and $1.4T, and questioned whether we are at peak bubble. Altman is good at thinking big, though, and it doesn't take a real stretch to imagine that OpenAI could be in the hundreds of billions in revenue by 2030. $20B is already an impressive achievement, but they are just scratching the surface of how they can monetize (ads, affiliate fees for eCommerce, business productivity, and much more). At a 200-300B revenue run rate, the data centre commitments (which are not necessarily annual) are feasible. It's also sensible to try to lock in some commitments now. OpenAI will indeed be too big to fail.On-Policy Distillation (Kevin Lu and team @ Thinking Machines). [Via Esteve Almirall]. One of the biggest breakthroughs still required for AI systems is to be able to learn post-training while "on the job". This "On Policy" learning isn't possible in a straightforward way for today's LLMs because the training process is long and involved, and even the Reinforcement Learning (RL) post-training requirements require huge numbers of samples. Worse, trying to adjust model weights post-training often causes extreme loss of prior capabilities. In this new paper, the Thinking Machine team presents methods to use a powerful supervisor model to iteratively provide feedback to a smaller operational model as it answers real queries and also the steps taken to reach those answers (this is the "distillation" mentioned in the title). The feedback is then used to create an incremental fine-tune to adapt the smaller model. Because the feedback being given is not just about the quality of the final answer, but also each token along the way, the signal for change is much stronger, and this seems to lead to much more effective learning. Also related to the FAIR paper from a few weeks ago.Amazon sends Perplexity cease-and-desist over AI browser agents making purchases. This seemingly small bit of news is actually caused by a serious long-term challenge for AI companies and existing web retailers. ChatGPT, Perplexity, and other AI services can now surface items for purchase (see Wallmart's ChatGPT deal) and would like to automate the purchase process. Ideally, so they can capture some of the value. Amazon, for its part, would like to keep shoppers coming to Amazon.com, hence the blocking action. Further, Amazon does not want Perplexity to gain private user information. How long will this last? Until AI-initiated purchases become such a large component of purchases that destination shopping sites will have no choice but to "partner" with AI firms.

Wishing you a great weekend!

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI代理 浏览器大战 持续学习 OpenAI 电商
相关文章