热点
"困惑度" 相关文章
A Transformer-based Neural Architecture Search Method
cs.AI updates on arXiv.org 2025-11-03T05:18:34.000000Z
Rethinking GSPO: The Perplexity-Entropy Equivalence
cs.AI updates on arXiv.org 2025-10-28T04:14:35.000000Z
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning
cs.AI updates on arXiv.org 2025-10-20T04:14:01.000000Z
What's the strongest AI model you can train on a laptop in five minutes?
https://www.seangoedecke.com/rss.xml 2025-10-02T12:53:42.000000Z
Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models
cs.AI updates on arXiv.org 2025-09-15T08:35:45.000000Z
Efficiently Detecting Hidden Reasoning with a Small Predictor Model
少点错误 2025-07-13T16:17:35.000000Z
The Weighted Perplexity Benchmark: Tokenizer-Normalized Evaluation for Language Model Comparison
少点错误 2025-07-07T21:47:33.000000Z
低Token高精度!字节复旦推出自适应推理框架CAR
智源社区 2025-05-29T01:52:53.000000Z
低Token高精度!字节复旦推出自适应推理框架CAR
量子位 2025-05-27T04:36:10.000000Z
长文本有了专属困惑度!北大、MIT、阿里推出LongPPL新指标
机器之心 2025-03-09T08:57:28.000000Z
NeurIPS 2024|SparseLLM:突破性全局剪枝技术,大语言模型稀疏化革命
机器之心 2024-10-10T06:11:59.000000Z
This AI Paper from Databricks and MIT Propose Perplexity-Based Data Pruning: Improving 3B Parameter Model Performance and Enhancing Language Models
MarkTechPost@AI 2024-06-05T04:00:59.000000Z