新方法提升微调模型泛化能力

cs.AI updates on arXiv.org 10月27日 14:25

新方法提升微调模型泛化能力

本文提出一种新型重参数化方法，旨在提升微调模型的泛化能力。通过高维二分类实验和LLMs微调实验验证了方法的有效性。

arXiv:2510.21345v1 Announce Type: cross Abstract: Fine-tuning has proven to be highly effective in adapting pre-trained models to perform better on new desired tasks with minimal data samples. Among the most widely used approaches are reparameterization methods, which update a target module by augmenting its frozen weight matrix with an additional trainable weight matrix. The most prominent example is Low Rank Adaption (LoRA), which gained significant attention in recent years. In this paper, we introduce a new class of reparameterization methods for transfer learning, designed to enhance the generalization ability of fine-tuned models. We establish the effectiveness of our approach in a high-dimensional binary classification setting using tools from Random Matrix Theory, and further validate our theoretical findings through more realistic experiments, such as fine-tuning LLMs.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

微调泛化能力重参数化 LLMs 微调模型

相关文章

When More is More? When For an LLM is Enough?

Alignment Lab AI Releases ‘Buzz Dataset’: The Largest Supervised Fine-Tuning Open-Sourced Dataset

Web-Instruct’s Instruction Tuning for MAmmoTH2 and MAmmoTH2-Plus Models: The Power of Web-Mined Data in Enhancing Large Language Models

FinRobot: A Novel Open-Source AI Agent Platform Supporting Multiple Financially Specialized AI Agents Powered by LLMs

Mistral-finetune: A Light-Weight Codebase that Enables Memory-Efficient and Performant Finetuning of Mistral’s Models

Synthetic Data Generation in Foundation Models and Differential Privacy: Three Papers from Microsoft Research

Show HN: 让开发人员方便使用 LLM 的 CLI

如何优化 LLM 以提高准确性

Show HN: Chatty - 用于在浏览器中运行 LLM 的免费人工智能私人聊天工具

法学硕士在引用资料来源时几乎都是正确的，对此最好的解释是什么？