Newsroom Anthropic 09月13日
AWS与Anthropic合作优化Claude模型
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

AWS与Anthropic合作,将Claude模型优化以运行在AWS Trainium2芯片上。Claude 3.5 Haiku现在支持在Amazon Bedrock中进行延迟优化推理,显著提高了速度而不牺牲准确性。此外,Amazon Bedrock增加了模型蒸馏支持,将大型Claude模型的智能带到更快、更具成本效益的模型上。AWS还在构建Project Rainier,这是一个包含数百万Trainium2芯片的EC2 UltraCluster,将提供比当前AI模型训练所需的计算能力高五倍的性能。Claude 3.5 Haiku的更快版本,由Trainium2提供支持,现已在美国东部(俄亥俄州)地区通过跨区域推理提供,价格为每百万输入令牌1美元,每百万输出令牌5美元。模型蒸馏使Claude 3 Haiku在特定任务上达到Claude 3.5 Sonnet类似的准确性,同时保持相同的价格和速度。这项技术通过从“老师”(Claude 3.5 Sonnet)到“学生”(Claude 3 Haiku)转移知识,使客户能够在较低的成本下运行复杂的任务,如检索增强生成(RAG)和数据分析。Claude 3.5 Haiku的价格也降低了,现在在所有平台上每百万输入令牌为0.80美元,每百万输出令牌为4美元。

💡 AWS与Anthropic合作,将Claude模型优化以运行在AWS Trainium2芯片上,显著提高了模型的推理速度和准确性。

🚀 Claude 3.5 Haiku现在支持在Amazon Bedrock中进行延迟优化推理,使其在代码完成、实时内容审核和聊天机器人等用例中表现更佳。

🔧 Amazon Bedrock增加了模型蒸馏支持,将大型Claude模型的智能带到更快、更具成本效益的模型上,使客户能够在较低的成本下运行复杂的任务。

🌐 Project Rainier是一个包含数百万Trainium2芯片的EC2 UltraCluster,将提供比当前AI模型训练所需的计算能力高五倍的性能。

💰 Claude 3.5 Haiku的价格降低了,现在在所有平台上每百万输入令牌为0.80美元,每百万输出令牌为4美元,使其更具可访问性。

As part of our expanded collaboration with AWS, we’ve begun optimizing Claude models to run on AWS Trainium2, their most advanced AI chip.

To preview what’s possible with Trainium2, Claude 3.5 Haiku now supports latency-optimized inference in Amazon Bedrock, making the model significantly faster without compromising accuracy.

We’re also adding support for model distillation in Amazon Bedrock, bringing the intelligence of larger Claude models to our faster and more cost-effective models.

Next-gen models on Trainium2

We are collaborating with AWS to build Project Rainier—an EC2 UltraCluster of Trn2 UltraServers containing hundreds of thousands of Trainium2 chips. This cluster will deliver more than five times the computing power (in exaflops) used to train our current generation of leading AI models.

Trainium2 enables us to offer faster models in Amazon Bedrock, starting with Claude 3.5 Haiku which now supports latency-optimized inference in public preview. By enabling latency optimization, Claude 3.5 Haiku can deliver up to 60% faster inference speed—making it the ideal choice for use cases ranging from code completions to real-time content moderation and chatbots.

This faster version of Claude 3.5 Haiku, powered by Trainium2, is available in the US East (Ohio) Region via cross-region inference and is offered at $1 per million input tokens and $5 per million output tokens.

Amazon Bedrock Model Distillation

We’re also enabling customers to get frontier performance from Claude 3 Haiku—our most cost-effective model from the last generation. With distillation, Claude 3 Haiku can now achieve significant performance gains, reaching Claude 3.5 Sonnet-like accuracy for specific tasks—at the same price and speed of our most cost-effective model.

This technique transfers knowledge from the "teacher" (Claude 3.5 Sonnet) to the "student" (Claude 3 Haiku), enabling customers to run sophisticated tasks like retrieval augmented generation (RAG) and data analysis at a fraction of the cost.

Unlike traditional fine-tuning, which requires developers to manually craft training examples and continuously adjust parameters, Amazon Bedrock Model Distillation automates the entire process by:

    Generating synthetic training data from Claude 3.5 SonnetTraining and evaluating Claude 3 HaikuHosting the final distilled model for inference

Amazon Bedrock Model Distillation automatically applies different data synthesis methods—from generating similar prompts to creating new high-quality responses based on your example prompt-response pairs.

Distillation for Claude 3 Haiku in Amazon Bedrock is now available in preview. Learn more in the AWS launch blog and documentation.

Lower prices for Claude 3.5 Haiku

In addition to offering a faster version on Trainium2, customers can continue to access Claude 3.5 Haiku on the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI.

To make this model even more accessible for a wide range of use cases, we’re lowering the price of Claude 3.5 Haiku to $0.80 per million input tokens and $4 per million output tokens across all platforms.

Get started

Starting today, model distillation and the faster Claude 3.5 Haiku are available in preview in Amazon Bedrock. For developers seeking the optimal balance of price, performance, and speed, you now have expanded model options with Claude:

    Claude 3.5 Haiku with latency optimization, powered by Trainium2, for general use casesClaude 3 Haiku, distilled with frontier performance, for high-volume, repetitive use cases

To get started, visit the Amazon Bedrock console. We can’t wait to see what you build.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AWS Anthropic Claude 3.5 Haiku Trainium2 Amazon Bedrock 模型蒸馏 Project Rainier
相关文章