热点
"连续控制" 相关文章
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
cs.AI updates on arXiv.org 2025-10-22T04:25:42.000000Z
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
cs.AI updates on arXiv.org 2025-10-22T04:25:42.000000Z
Self-Evidencing Through Hierarchical Gradient Decomposition: A Dissipative System That Maintains Non-Equilibrium Steady-State by Minimizing Variational Free Energy
cs.AI updates on arXiv.org 2025-10-22T04:17:18.000000Z
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
cs.AI updates on arXiv.org 2025-10-16T04:27:18.000000Z
Noise-Guided Transport for Imitation Learning
cs.AI updates on arXiv.org 2025-10-01T06:01:42.000000Z
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
cs.AI updates on arXiv.org 2025-10-01T06:01:09.000000Z
Frictional Q-Learning
cs.AI updates on arXiv.org 2025-09-25T05:50:19.000000Z
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
cs.AI updates on arXiv.org 2025-09-12T04:19:11.000000Z
Categorical Policies: Multimodal Policy Learning and Exploration in Continuous Control
cs.AI updates on arXiv.org 2025-08-20T04:17:23.000000Z
Extending Group Relative Policy Optimization to Continuous Control: A Theoretical Framework for Robotic Reinforcement Learning
cs.AI updates on arXiv.org 2025-07-29T04:21:51.000000Z
Autonomous Control Leveraging LLMs: An Agentic Framework for Next-Generation Industrial Automation
cs.AI updates on arXiv.org 2025-07-11T04:03:54.000000Z