热点
关于我们
xx
xx
"
rStar2-Agent
" 相关文章
14B打败671B,微软rStar2-Agent在数学推理上超过DeepSeek-R1
36kr
2025-09-02T07:42:28.000000Z
Microsoft AI Introduces rStar2-Agent: A 14B Math Reasoning Model Trained with Agentic Reinforcement Learning to Achieve Frontier-Level Performance
MarkTechPost@AI
2025-08-30T06:56:44.000000Z