计算扩展_Fishai

热点

"计算扩展" 相关文章

How Well Does RL Scale?

少点错误 2025-10-22T13:48:55.000000Z

How Well Does RL Scale?

少点错误 2025-10-22T13:48:55.000000Z

Sigmoidal Scaling Curves Make Reinforcement Learning RL Post-Training Predictable for LLMs

MarkTechPost@AI 2025-10-18T02:42:51.000000Z

ParaThinker: Scaling LLM Test-Time Compute with Native Parallel Thinking to Overcome Tunnel Vision in Sequential Reasoning

MarkTechPost@AI 2025-09-09T09:28:27.000000Z

ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

cs.AI updates on arXiv.org 2025-09-08T04:51:36.000000Z

Can LLMs Really Judge with Reasoning? Microsoft and Tsinghua Researchers Introduce Reward Reasoning Models to Dynamically Scale Test-Time Compute for Better Alignment

MarkTechPost@AI 2025-05-26T18:25:50.000000Z

The State of LLM Reasoning Models

Ahead of AI 2025-03-08T12:12:31.000000Z

o1: A Technical Primer

少点错误 2024-12-09T19:13:15.000000Z

Copyright © 2019 FISHAI.All Rights Reserved