热点
"长时推理" 相关文章
Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries
cs.AI updates on arXiv.org 2025-10-17T04:18:58.000000Z
Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries
cs.AI updates on arXiv.org 2025-10-17T04:18:58.000000Z
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
cs.AI updates on arXiv.org 2025-10-10T04:08:24.000000Z
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
cs.AI updates on arXiv.org 2025-10-10T04:08:24.000000Z
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-09T04:14:08.000000Z
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-09T04:14:08.000000Z