热点
关于我们
xx
xx
"
多轮强化学习
" 相关文章
开源RL框架Verlog来了,专为LLM智能体打造,400回合不成问题
机器之心
2025-10-09T01:33:45.000000Z
开源RL框架Verlog来了,专为LLM智能体打造,400回合不成问题
机器之心
2025-10-08T09:48:41.000000Z
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
cs.AI updates on arXiv.org
2025-10-02T04:18:48.000000Z
Kevin: Multi-Turn RL for Generating CUDA Kernels
cs.AI updates on arXiv.org
2025-07-17T04:14:39.000000Z
Enhancing LLM Reasoning with Multi-Attempt Reinforcement Learning
MarkTechPost@AI
2025-03-11T20:47:11.000000Z