热点
"推理能力提升" 相关文章
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
cs.AI updates on arXiv.org 2025-10-28T04:07:01.000000Z
First Try Matters: Revisiting the Role of Reflection in Reasoning Models
cs.AI updates on arXiv.org 2025-10-10T04:08:48.000000Z
MALT: Improving Reasoning with Multi-Agent LLM Training
cs.AI updates on arXiv.org 2025-10-07T04:18:56.000000Z
Distilling Reasoning into Student LLMs: Local Naturalness for Selecting Teacher Data
cs.AI updates on arXiv.org 2025-10-07T04:16:04.000000Z
Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns
cs.AI updates on arXiv.org 2025-09-26T04:19:17.000000Z
Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning
cs.AI updates on arXiv.org 2025-09-05T04:45:20.000000Z
AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance
cs.AI updates on arXiv.org 2025-08-12T04:39:13.000000Z
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
cs.AI updates on arXiv.org 2025-08-08T04:17:27.000000Z
Cognitive Loop via In-Situ Optimization: Self-Adaptive Reasoning for Science
cs.AI updates on arXiv.org 2025-08-06T04:01:52.000000Z
AUTO-CEI: A Curriculum and Expert Iteration Approach to Elevate LLMs’ Response Precision and Control Refusal Rates Across Diverse Reasoning Domains
MarkTechPost@AI 2024-11-01T10:02:12.000000Z