迭代优化_Fishai

热点

"迭代优化" 相关文章

Provably Efficient Reward Transfer in Reinforcement Learning with Discrete Markov Decision Processes

cs.AI updates on arXiv.org 2025-10-21T04:29:53.000000Z

[DevOps] 内部做一个新的环境，在更新生产环境前先把生产服务等先在这个服务上过一遍，确认没问题后再上生产，这个环境叫什么环境？这种流程叫什么流程？

V2EX 2025-10-16T16:46:33.000000Z

[DevOps] 内部做一个新的环境，在更新生产环境前先把生产服务等先在这个服务上过一遍，确认没问题后再上生产，这个环境叫什么环境？这种流程叫什么流程？

V2EX 2025-10-16T11:04:35.000000Z

TGPR: Tree-Guided Policy Refinement for Robust Self-Debugging of LLMs

cs.AI updates on arXiv.org 2025-10-09T04:04:03.000000Z

Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support

cs.AI updates on arXiv.org 2025-10-09T04:03:33.000000Z

Evolutionary Guided Decoding: Iterative Value Refinement for LLMs

cs.AI updates on arXiv.org 2025-10-07T04:19:06.000000Z

Eval playgrounds for faster, focused iteration

Braintrust Blog 2025-10-02T12:52:36.000000Z

What Machine Learning Can Teach Us About Life - 7 Lessons

https://eugeneyan.com/rss 2025-09-30T11:13:11.000000Z

The Only Prompt Generator Guide You’ll Ever Need

Yatter Blog 2025-09-25T10:02:18.000000Z

Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting

cs.AI updates on arXiv.org 2025-09-16T05:46:17.000000Z

[Solana] HODL 页面增加了历史持有量显示

V2EX 2025-07-18T07:12:11.000000Z

高级Prompt优化实战指南：用迭代优化将大模型输出质量提升200%的代码

掘金人工智能 2025-05-16T07:38:03.000000Z

Model Compression Without Compromise: Loop-Residual Neural Networks Show Comparable Results to Larger GPT-2 Variants Using Iterative Refinement

MarkTechPost@AI 2025-04-16T06:52:29.000000Z

Copyright © 2019 FISHAI.All Rights Reserved