All Content from Business Insider 09月30日 01:09
Anthropic发布Claude Sonnet 4.5,提升AI编码与自主运行能力
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Anthropic发布了其最新AI模型Claude Sonnet 4.5,并称其为全球顶尖的AI编码系统。新模型在SWE-Bench Verified基准测试中表现卓越,显著提升了代码可靠性、重构判断和生产就绪性。Sonnet 4.5的一大亮点是其自主运行能力,最长可达30小时,远超前代模型。该模型在网络安全、金融服务等领域也展现出强大实力,能更快地检测和修复漏洞,并在研究、建模和预测任务中超越了Opus 4.1。此外,Anthropic还通过Claude Agent SDK等工具,为开发者提供了构建定制化AI代理的新能力,扩展了其开发者生态系统。

🚀 **AI编码能力大幅提升**:Claude Sonnet 4.5在SWE-Bench Verified基准测试中取得了领先成绩,被Anthropic称为全球最佳AI编码系统。该模型在代码可靠性、重构判断和生产就绪性方面均有显著增强,旨在为软件开发带来更高的效率和质量。

⏳ **超长自主运行能力**:新模型的一项重大突破是其能够实现长达30小时的自主运行,这比之前的Opus 4模型有了四倍以上的提升。这一能力为AI在复杂任务中的持续执行提供了可能,尤其是在需要长时间专注和处理的场景下。

🛡️ **多领域应用拓展**:Sonnet 4.5在网络安全和金融服务等关键领域展现出强大的应用潜力。在网络安全方面,它能更快速地检测和修复安全漏洞;在金融服务领域,它在研究、建模和预测等任务上的表现优于Opus 4.1,为业务决策提供更强支持。

🛠️ **开发者生态系统强化**:Anthropic推出了Claude Agent SDK等新工具,旨在为开发者提供更精细化的工具,以构建定制化、具备上下文感知能力的AI代理。这包括对虚拟机、内存和上下文管理的支持,以及VS Code扩展和增强的终端工作流,降低了AI应用开发的门槛。

Anthropic CEO Dario Amodei

Anthropic released Claude Sonnet 4.5, its latest model, on Monday. The company positioned it as the world's best AI coding system and a leap forward in applied artificial intelligence.

The upgrade arrives just four months after its predecessor, Sonnet 4, underscoring the startup's rapid product cadence in the generative AI arms race.

Anthropic said Sonnet 4.5 delivers state-of-the-art results on SWE-Bench Verified, a standard for evaluating software engineering performance.

The startup also pitched the new model's ability to generate practical business outcomes through autonomous computer use, cybersecurity capabilities, and the creation of production-ready applications and context-aware AI agents.

Anthropic's revenue has surged this year, primarily driven by the coding functionalities of its models and a specific product called Claude Code. The startup pulled away from rivals in AI coding, and Sonnet 4.5 is designed to maintain this lead.

Automated and assisted software coding is one of the most compelling use cases for generative AI so far. That's partly because there are big potential productivity gains and cost savings.

Anthropic noted on Monday that Claude Code is generating more than $500 million in run-rate revenue, with usage growing more than 10X in three months.

The startup said Sonnet 4.5 enhances code reliability, refactoring judgment, and production-readiness. This new model competes against other offerings such as Google's Gemini, OpenAI's GPT-5, and xAI's Grok 4.

The business implications are broad. In cybersecurity, Anthropic said Sonnet 4.5 helps detect and remediate vulnerabilities faster. In financial services, it surpasses Anthropic's Opus 4.1, the company's most advanced reasoning model, in tasks such as research, modeling, and forecasting.

Perhaps most notably, the new model can operate autonomously for up to 30 hours, more than quadrupling the endurance of Opus 4, Anthropic said.

The startup is also expanding its developer ecosystem with new tools that bring Claude Code's building blocks to a wider audience. Developers gain access to virtual machines, memory, and context management.

New developer-focused features include a native VS Code extension, enhanced terminal workflows, and checkpoints that allow engineers to roll back code instantly if their AI-powered projects veer off track.

On the Claude Developer Platform, Anthropic launched a Claude Agent software development kit providing developers with fine-grained tools for building customized, context-aware AI agents.

Sign up for BI's Tech Memo newsletter here. Reach out to me via email at abarr@businessinsider.com.

Read the original article on Business Insider

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Anthropic Claude Sonnet 4.5 AI编码 自主运行 人工智能 Claude Agent SDK AI Agents Cybersecurity Financial Services AI Coding Autonomous Operation Artificial Intelligence
相关文章