热点
关于我们
xx
xx
"
计算扩展
" 相关文章
How Well Does RL Scale?
少点错误
2025-10-22T13:48:55.000000Z
How Well Does RL Scale?
少点错误
2025-10-22T13:48:55.000000Z
Sigmoidal Scaling Curves Make Reinforcement Learning RL Post-Training Predictable for LLMs
MarkTechPost@AI
2025-10-18T02:42:51.000000Z
ParaThinker: Scaling LLM Test-Time Compute with Native Parallel Thinking to Overcome Tunnel Vision in Sequential Reasoning
MarkTechPost@AI
2025-09-09T09:28:27.000000Z
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
cs.AI updates on arXiv.org
2025-09-08T04:51:36.000000Z
Can LLMs Really Judge with Reasoning? Microsoft and Tsinghua Researchers Introduce Reward Reasoning Models to Dynamically Scale Test-Time Compute for Better Alignment
MarkTechPost@AI
2025-05-26T18:25:50.000000Z
The State of LLM Reasoning Models
Ahead of AI
2025-03-08T12:12:31.000000Z
o1: A Technical Primer
少点错误
2024-12-09T19:13:15.000000Z