热点
"rStar2-Agent" 相关文章
14B打败671B,微软rStar2-Agent在数学推理上超过DeepSeek-R1
36kr 2025-09-02T07:42:28.000000Z
Microsoft AI Introduces rStar2-Agent: A 14B Math Reasoning Model Trained with Agentic Reinforcement Learning to Achieve Frontier-Level Performance
MarkTechPost@AI 2025-08-30T06:56:44.000000Z