热点
关于我们
xx
xx
"
微调框架
" 相关文章
RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning
cs.AI updates on arXiv.org
2025-10-17T04:10:32.000000Z
InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions
cs.AI updates on arXiv.org
2025-10-07T04:14:36.000000Z
Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback
cs.AI updates on arXiv.org
2025-10-06T04:27:13.000000Z
AdaRing: Towards Ultra-Light Vision-Language Adaptation via Cross-Layer Tensor Ring Decomposition
cs.AI updates on arXiv.org
2025-08-19T04:02:03.000000Z
Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index
cs.AI updates on arXiv.org
2025-07-31T04:48:16.000000Z