MASPRM：多智能体系统推理性能优化模型

cs.AI updates on arXiv.org 10月30日 12:15

MASPRM：多智能体系统推理性能优化模型

本文提出了一种名为MASPRM的多智能体系统推理性能优化模型，通过引导推理时间搜索和选择性计算，提高了多智能体系统的测试性能。该模型在GSM8K和MATH数据集上取得了显著的性能提升，且无需重新训练即可在不同数据集上实现零样本迁移。

arXiv:2510.24803v1 Announce Type: cross Abstract: Practical deployment of Multi-Agent Systems (MAS) demands strong test-time performance, motivating methods that guide inference-time search and selectively spend compute to improve quality. We present the Multi-Agent System Process Reward Model (MASPRM). It assigns per-action, per-agent values to partial inter-agent transcripts and acts as an inference-time controller. MASPRM is trained from multi-agent Monte Carlo Tree Search (MCTS) rollouts without requiring step-level human annotations, by propagating returns to local targets. At inference, MASPRM guides step-level beam search and MCTS, focusing computation on promising branches and pruning early. On GSM8K and MATH, MASPRM-guided decoding with an outcome reward model (ORM) applied to the final answer, improves exact match (EM) over a single straight-through MAS pass by $+30.7$ and $+22.9$ points, respectively. A MASPRM trained on GSM8K transfers zero-shot to MATH without retraining, adding $8.4$ EM points at the same budget. MASPRM is a plug-in value model that estimates per-agent progress and complements verifier-style decoders, enabling more reliable, compute-aware multi-agent reasoning. Code: https://github.com/milad1378yz/MASPRM

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

多智能体系统推理性能优化 MASPRM

相关文章

This AI Paper from Google DeepMind Explores the Effect of Communication Connectivity in Multi-Agent Systems

Llama-Agents: A New Open-Source AI Framework that Simplifies the Creation, Iteration, and Deployment of Multi-Agent AI Systems

LlamaIndex技术报告：知识助手的尽头是Multi-Agents！

北航沙磊教授：当Agentic RAG照进现实——Agent Insights

Benchmark Self-Evolving ｜自我进化的大模型动态评测基准

GitHub星标超16万，爆火AutoGPT进阶版来了：定制节点、多智能体协同

只需两步，让大模型智能体社区相信你是秦始皇

Researchers at FPT Software AI Center Introduce AgileCoder: A Multi-Agent System for Generating Complex Software, Surpassing MetaGPT and ChatDev

AtomAgents: A Multi-Agent AI System to Autonomously Design Metallic Alloys

MegaAgent: A Practical AI Framework Designed for Autonomous Cooperation in Large-Scale LLM Agent Systems