MHA-RAG：提升有限数据域适应的生成模型

cs.AI updates on arXiv.org 10月08日

MHA-RAG：提升有限数据域适应的生成模型

本文研究将基础模型应用于新领域时，如何以有限训练数据高效、稳定地利用领域特定示例。提出了一种新型模型架构MHA-RAG，通过软提示与无序不变模型，在多个问答基准测试中实现了20%的性能提升，同时降低了推理成本。

arXiv:2510.05363v1 Announce Type: new Abstract: Adapting Foundation Models to new domains with limited training data is challenging and computationally expensive. While prior work has demonstrated the effectiveness of using domain-specific exemplars as in-context demonstrations, we investigate whether representing exemplars purely as text is the most efficient, effective, and stable approach. We explore an alternative: representing exemplars as soft prompts with an exemplar order invariant model architecture. To this end, we introduce Multi-Head Attention Retrieval-Augmented Generation (MHA-RAG), a framework with the number of attention heads serving as a simple hyperparameter to control soft prompt-generation across different tasks. Across multiple question-answering benchmarks and model scales, MHA-RAG achieves a 20-point performance gain over standard RAG, while cutting inference costs by a factor of 10X GFLOPs-delivering both higher accuracy and greater efficiency, invariant to exemplar order.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

MHA-RAG 生成模型有限数据域模型架构软提示

相关文章

AI Trends 2024: Machine Learning & Deep Learning with Thomas Dietterich - #666

How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614

Equivariant Priors for Compressed Sensing with Arash Behboodi - #584

Advances in Neural Compression with Auke Wiggers - #570

Vidur: A Large-Scale Simulation Framework Revolutionizing LLM Deployment Through Cost Cuts and Increased Efficiency

Enhancing Security and Efficiency: The Integral Role of AI in Advanced Cryptocurrency Systems

河北承德县多管齐下全面提升小型水库运管水平

每18秒下线一个光伏组件，揭秘全球光伏行业首座灯塔工厂 | 碳访

国泰君安：零食产业仍然处于品类多元化及渠道扩张阶段

AI最大的作用就是取代那些毫无意义的工作所以最近对我表达AI 太有用了的都是公务员…