cs.AI updates on arXiv.org 08月18日
gpt-oss-120b & gpt-oss-20b Model Card
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了两种开源推理模型gpt-oss-120b和gpt-oss-20b,它们在准确性和推理成本方面取得突破。模型采用高效混合专家transformer架构,通过大规模蒸馏和强化学习训练,具有强大的自主能力,并支持清晰指令遵循和角色划分。模型在数学、编码和安全基准测试中表现出色,相关资源已开源。

arXiv:2508.10925v1 Announce Type: cross Abstract: We present gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models that push the frontier of accuracy and inference cost. The models use an efficient mixture-of-expert transformer architecture and are trained using large-scale distillation and reinforcement learning. We optimize the models to have strong agentic capabilities (deep research browsing, python tool use, and support for developer-provided functions), all while using a rendered chat format that enables clear instruction following and role delineation. Both models achieve strong results on benchmarks ranging from mathematics, coding, and safety. We release the model weights, inference implementations, tool environments, and tokenizers under an Apache 2.0 license to enable broad use and further research.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

gpt-oss-120b gpt-oss-20b 推理模型 开源 AI
相关文章