热点
"置信度" 相关文章
Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning
cs.AI updates on arXiv.org 2025-10-10T04:16:12.000000Z
Is It Certainly a Deepfake? Reliability Analysis in Detection & Generation Ecosystem
cs.AI updates on arXiv.org 2025-09-23T05:24:36.000000Z
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
cs.AI updates on arXiv.org 2025-09-18T04:58:18.000000Z
Meta AI Introduces DeepConf: First AI Method to Achieve 99.9% on AIME 2025 with Open-Source Models Using GPT-OSS-120B
MarkTechPost@AI 2025-08-27T17:19:29.000000Z
Overconfidence in LLM-as-a-Judge: Diagnosis and Confidence-Driven Solution
cs.AI updates on arXiv.org 2025-08-11T04:08:19.000000Z
New method assesses and improves the reliability of radiologists’ diagnostic reports
MIT News - Artificial intelligence 2025-04-04T04:12:06.000000Z
Presentation of uncertainty in geoscience Large Language Models (LLM)
A Geodyssey – Enterprise Search Discovery, Text Mining, Machine Learning 2024-11-28T10:37:08.000000Z