热点
关于我们
xx
xx
"
价值函数
" 相关文章
Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
cs.AI updates on arXiv.org
2025-10-15T04:38:26.000000Z
Evolutionary Guided Decoding: Iterative Value Refinement for LLMs
cs.AI updates on arXiv.org
2025-10-07T04:19:06.000000Z
Improved Monte Carlo Planning via Causal Disentanglement for Structurally-Decomposed Markov Decision Processes
cs.AI updates on arXiv.org
2025-10-06T04:28:24.000000Z
A Mechanism for Mutual Fairness in Cooperative Games with Replicable Resources -- Extended Version
cs.AI updates on arXiv.org
2025-08-20T04:17:06.000000Z
Turning up the Heat on Deceptively-Misaligned AI
少点错误
2025-01-07T00:16:20.000000Z
Exploring Offline Reinforcement Learning RL: Offering Practical Advice for Domain-Specific Practitioners and Future Algorithm Development
MarkTechPost@AI
2024-06-18T09:31:26.000000Z