热点
"LRMs" 相关文章
复旦大学与美团联合发布 R-HORIZON,长链推理评测框架
oschina.net 2025-10-29T03:16:50.000000Z
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
cs.AI updates on arXiv.org 2025-10-24T04:51:15.000000Z
RegexPSPACE: A Benchmark for Evaluating LLM Reasoning on PSPACE-complete Regex Problems
cs.AI updates on arXiv.org 2025-10-13T04:10:17.000000Z
RegexPSPACE: A Benchmark for Evaluating LLM Reasoning on PSPACE-complete Regex Problems
cs.AI updates on arXiv.org 2025-10-13T04:10:17.000000Z
Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking
cs.AI updates on arXiv.org 2025-09-30T04:01:36.000000Z
FuSaR: A Fuzzification-Based Method for LRM Safety-Reasoning Balance
cs.AI updates on arXiv.org 2025-08-19T04:01:38.000000Z
一篇72页的DeepSeek-R1/QWQ-32B推理能力在AI Agents场景的应用分析
PaperAgent 2025-03-22T11:48:18.000000Z
DeepSeek R1/o1大型推理模型蓝图:架构设计及快速原型实现框架x1
PaperAgent 2025-02-01T16:19:19.000000Z