热点
关于我们
xx
xx
"
战略推理
" 相关文章
LM Fight Arena: Benchmarking Large Multimodal Models via Game Competition
cs.AI updates on arXiv.org
2025-10-13T04:09:09.000000Z
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
cs.AI updates on arXiv.org
2025-09-30T04:06:26.000000Z
刚刚,大模型棋王诞生,40轮血战,OpenAI o3豪夺第一,人类大师地位不保?
36kr
2025-08-22T11:51:38.000000Z
CHBench: A Cognitive Hierarchy Benchmark for Evaluating Strategic Reasoning Capability of LLMs
cs.AI updates on arXiv.org
2025-08-19T04:01:24.000000Z
A Multi-Agent Pokemon Tournament for Evaluating Strategic Reasoning of Large Language Models
cs.AI updates on arXiv.org
2025-08-05T11:10:09.000000Z
From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning
cs.AI updates on arXiv.org
2025-07-21T04:06:33.000000Z
Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning
cs.AI updates on arXiv.org
2025-07-17T04:14:14.000000Z