困惑度_Fishai

热点

"困惑度" 相关文章

A Transformer-based Neural Architecture Search Method

cs.AI updates on arXiv.org 2025-11-03T05:18:34.000000Z

Rethinking GSPO: The Perplexity-Entropy Equivalence

cs.AI updates on arXiv.org 2025-10-28T04:14:35.000000Z

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

cs.AI updates on arXiv.org 2025-10-20T04:14:01.000000Z

What's the strongest AI model you can train on a laptop in five minutes?

https://www.seangoedecke.com/rss.xml 2025-10-02T12:53:42.000000Z

Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models

cs.AI updates on arXiv.org 2025-09-15T08:35:45.000000Z

Efficiently Detecting Hidden Reasoning with a Small Predictor Model

少点错误 2025-07-13T16:17:35.000000Z

The Weighted Perplexity Benchmark: Tokenizer-Normalized Evaluation for Language Model Comparison

少点错误 2025-07-07T21:47:33.000000Z

低Token高精度！字节复旦推出自适应推理框架CAR

智源社区 2025-05-29T01:52:53.000000Z

低Token高精度！字节复旦推出自适应推理框架CAR

量子位 2025-05-27T04:36:10.000000Z

长文本有了专属困惑度！北大、MIT、阿里推出LongPPL新指标

机器之心 2025-03-09T08:57:28.000000Z

NeurIPS 2024｜SparseLLM：突破性全局剪枝技术，大语言模型稀疏化革命

机器之心 2024-10-10T06:11:59.000000Z

This AI Paper from Databricks and MIT Propose Perplexity-Based Data Pruning: Improving 3B Parameter Model Performance and Enhancing Language Models

MarkTechPost@AI 2024-06-05T04:00:59.000000Z

Copyright © 2019 FISHAI.All Rights Reserved