热点
"迭代优化" 相关文章
Provably Efficient Reward Transfer in Reinforcement Learning with Discrete Markov Decision Processes
cs.AI updates on arXiv.org 2025-10-21T04:29:53.000000Z
[DevOps] 内部做一个新的环境,在更新生产环境前先把生产服务等先在这个服务上过一遍,确认没问题后再上生产,这个环境叫什么环境?这种流程叫什么流程?
V2EX 2025-10-16T16:46:33.000000Z
[DevOps] 内部做一个新的环境,在更新生产环境前先把生产服务等先在这个服务上过一遍,确认没问题后再上生产,这个环境叫什么环境?这种流程叫什么流程?
V2EX 2025-10-16T11:04:35.000000Z
TGPR: Tree-Guided Policy Refinement for Robust Self-Debugging of LLMs
cs.AI updates on arXiv.org 2025-10-09T04:04:03.000000Z
Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support
cs.AI updates on arXiv.org 2025-10-09T04:03:33.000000Z
Evolutionary Guided Decoding: Iterative Value Refinement for LLMs
cs.AI updates on arXiv.org 2025-10-07T04:19:06.000000Z
Eval playgrounds for faster, focused iteration
Braintrust Blog 2025-10-02T12:52:36.000000Z
What Machine Learning Can Teach Us About Life - 7 Lessons
https://eugeneyan.com/rss 2025-09-30T11:13:11.000000Z
The Only Prompt Generator Guide You’ll Ever Need
Yatter Blog 2025-09-25T10:02:18.000000Z
Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting
cs.AI updates on arXiv.org 2025-09-16T05:46:17.000000Z
[Solana] HODL 页面增加了历史持有量显示
V2EX 2025-07-18T07:12:11.000000Z
高级Prompt优化实战指南:用迭代优化将大模型输出质量提升200%的代码
掘金 人工智能 2025-05-16T07:38:03.000000Z
Model Compression Without Compromise: Loop-Residual Neural Networks Show Comparable Results to Larger GPT-2 Variants Using Iterative Refinement
MarkTechPost@AI 2025-04-16T06:52:29.000000Z