后训练方法_Fishai

热点

"后训练方法" 相关文章

Human-in-the-loop Online Rejection Sampling for Robotic Manipulation

cs.AI updates on arXiv.org 2025-10-31T04:07:52.000000Z

推理效率狂飙60倍：DiDi-Instruct让扩散大模型16步超越千步GPT

机器之心 2025-10-27T07:15:35.000000Z

Reconstruction Alignment Improves Unified Multimodal Models

cs.AI updates on arXiv.org 2025-09-19T05:06:02.000000Z

A post-training approach to AI regulation with Model Specs

Interconnects 2024-10-22T06:07:43.000000Z

Post-Training有多重要？AI2研究员长文详解前沿模型的后训练秘籍

智源社区 2024-08-20T06:07:37.000000Z

Copyright © 2019 FISHAI.All Rights Reserved