微调框架_Fishai

热点

"微调框架" 相关文章

RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning

cs.AI updates on arXiv.org 2025-10-17T04:10:32.000000Z

InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions

cs.AI updates on arXiv.org 2025-10-07T04:14:36.000000Z

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback

cs.AI updates on arXiv.org 2025-10-06T04:27:13.000000Z

AdaRing: Towards Ultra-Light Vision-Language Adaptation via Cross-Layer Tensor Ring Decomposition

cs.AI updates on arXiv.org 2025-08-19T04:02:03.000000Z

Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index

cs.AI updates on arXiv.org 2025-07-31T04:48:16.000000Z

Copyright © 2019 FISHAI.All Rights Reserved