价值函数_Fishai

热点

"价值函数" 相关文章

Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

cs.AI updates on arXiv.org 2025-10-15T04:38:26.000000Z

Evolutionary Guided Decoding: Iterative Value Refinement for LLMs

cs.AI updates on arXiv.org 2025-10-07T04:19:06.000000Z

Improved Monte Carlo Planning via Causal Disentanglement for Structurally-Decomposed Markov Decision Processes

cs.AI updates on arXiv.org 2025-10-06T04:28:24.000000Z

A Mechanism for Mutual Fairness in Cooperative Games with Replicable Resources -- Extended Version

cs.AI updates on arXiv.org 2025-08-20T04:17:06.000000Z

Turning up the Heat on Deceptively-Misaligned AI

少点错误 2025-01-07T00:16:20.000000Z

Exploring Offline Reinforcement Learning RL: Offering Practical Advice for Domain-Specific Practitioners and Future Algorithm Development

MarkTechPost@AI 2024-06-18T09:31:26.000000Z

Copyright © 2019 FISHAI.All Rights Reserved