Cognition AI 09月30日 04:52
Anthropic发布新一代编码模型Claude Sonnet 4.5
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Anthropic推出最新编码模型Claude Sonnet 4.5,提升规划性能18%,端到端评估得分提升12%,展现了与以往不同的行为模式,并推动了Devin的架构变革。


Claude Sonnet 4.5 is Anthropic's newest and most powerful model, and we believe it represents a new generation of coding models.

For Devin, Claude Sonnet 4.5 increased planning performance by 18% and end-to-end eval scores by 12% - the biggest jump we've seen since the release of Claude Sonnet 3.6. It excels at testing its own code, enabling Devin to run longer, handle harder tasks, and deliver production-ready code more consistently.

Scott Wu

Over the past few days, we've been testing this model internally, and what we've found goes beyond just benchmark improvements. The model exhibits fundamentally different behaviors - ways it approaches problems, manages its own work, and structures its development process - that required us to rethink how we architect Devin.

Because Devin is an agent that plans, executes, and iterates rather than just autocompleting code, we get an unusual window into what's genuinely changed. And with Sonnet 4.5, the improvements compound across our feedback loops in exciting new ways.

Claude Sonnet 4.5 is available in Devin starting today. We're excited to see what you build with it.

Want to learn more about what makes this model different? We've been testing Sonnet 4.5 extensively over the past few days and discovered some fascinating behaviors, from how it manages its own context window to how it creates feedback loops to verify its work.

Read our deep dive on the unexpected challenges and opportunities we found while rebuilding Devin for this new generation of coding models

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Anthropic 编码模型 Claude Sonnet 4.5 Devin 性能提升
相关文章