热点
"数据选择" 相关文章
CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs
cs.AI updates on arXiv.org 2025-10-22T04:13:59.000000Z
CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs
cs.AI updates on arXiv.org 2025-10-22T04:13:59.000000Z
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
cs.AI updates on arXiv.org 2025-10-17T04:18:22.000000Z
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
cs.AI updates on arXiv.org 2025-10-08T04:07:27.000000Z
mR3: Multilingual Rubric-Agnostic Reward Reasoning Models
cs.AI updates on arXiv.org 2025-10-02T04:18:49.000000Z
Train on Validation (ToV): Fast data selection with applications to fine-tuning
cs.AI updates on arXiv.org 2025-10-02T04:17:41.000000Z
Annotation-Efficient Active Test-Time Adaptation with Conformal Prediction
cs.AI updates on arXiv.org 2025-10-01T06:00:48.000000Z
Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime
cs.AI updates on arXiv.org 2025-09-30T04:09:36.000000Z
Data-Efficient Training by Evolved Sampling
cs.AI updates on arXiv.org 2025-09-30T04:04:40.000000Z
Autoguided Online Data Curation for Diffusion Model Training
cs.AI updates on arXiv.org 2025-09-22T04:26:29.000000Z
不靠海量数据,如何精准喂养大模型?上交Data Whisperer:免训练数据选择法,10%数据逼近全量效果
机器之心 2025-07-29T13:27:15.000000Z
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
cs.AI updates on arXiv.org 2025-07-18T04:14:06.000000Z
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
cs.AI updates on arXiv.org 2025-07-10T04:06:09.000000Z
TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection
cs.AI updates on arXiv.org 2025-07-08T06:58:12.000000Z
MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models
cs.AI updates on arXiv.org 2025-07-08T05:53:50.000000Z
ACL 2025 | 数据多不如风格齐?SCAR精选<1%样本,指令微调效果飙升
PaperWeekly 2025-06-17T09:22:41.000000Z
字节最新大模型秘籍:只挑能有推理潜力的数据训练!1.3B模型无需标签自动挑选
智源社区 2025-05-16T09:14:18.000000Z
Model Performance Begins with Data: Researchers from Ai2 Release DataDecide—A Benchmark Suite to Understand Pretraining Data Impact Across 30K LLM Checkpoints
MarkTechPost@AI 2025-04-17T06:30:36.000000Z
10篇R1相关的研究全面汇总,万字思考!
Datawhale 2025-03-20T16:32:17.000000Z
Enhancing Instruction Tuning in LLMs: A Diversity-Aware Data Selection Strategy Using Sparse Autoencoders
MarkTechPost@AI 2025-02-25T17:48:40.000000Z