热点
关于我们
xx
xx
"
模型可靠性
" 相关文章
Charting the future of AI, from safer answers to faster thinking
MIT News - Computer Science and Artificial Intelligence Laboratory
2025-11-06T21:56:59.000000Z
BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
cs.AI updates on arXiv.org
2025-10-28T04:14:36.000000Z
Neural Diversity Regularizes Hallucinations in Small Models
cs.AI updates on arXiv.org
2025-10-24T04:50:21.000000Z
Scalable multilingual PII annotation for responsible AI in LLMs
cs.AI updates on arXiv.org
2025-10-09T04:05:45.000000Z
Scalable multilingual PII annotation for responsible AI in LLMs
cs.AI updates on arXiv.org
2025-10-09T04:05:45.000000Z
A novel hallucination classification framework
cs.AI updates on arXiv.org
2025-10-08T04:09:53.000000Z
A novel hallucination classification framework
cs.AI updates on arXiv.org
2025-10-08T04:09:53.000000Z
Our Experience Running Independent Evaluations on LLMs: What Have We Learned?
少点错误
2025-10-03T19:22:11.000000Z
Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
cs.AI updates on arXiv.org
2025-10-03T04:11:46.000000Z
Enhancing Safety in Diabetic Retinopathy Detection: Uncertainty-Aware Deep Learning Models with Rejection Capabilities
cs.AI updates on arXiv.org
2025-10-02T04:16:43.000000Z
Calibrating Verbalized Confidence with Self-Generated Distractors
cs.AI updates on arXiv.org
2025-10-01T06:00:32.000000Z
Anthropic: A postmortem of three recent issues
https://simonwillison.net/atom/everything
2025-09-30T11:10:53.000000Z
Can Large Language Models Express Uncertainty Like Human?
cs.AI updates on arXiv.org
2025-09-30T04:06:20.000000Z
EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models
cs.AI updates on arXiv.org
2025-09-25T05:58:36.000000Z
Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM
cs.AI updates on arXiv.org
2025-09-23T06:11:36.000000Z
从分布外检测到代码生成,这位博士生要让AI既可靠又好用
MIT 科技评论 - 本周热榜
2025-09-11T15:46:44.000000Z
一文读懂AI大模型之「盾」!全行业283个LLM基准测试都在这了
智源社区
2025-09-02T19:29:42.000000Z
SycEval: Evaluating LLM Sycophancy
cs.AI updates on arXiv.org
2025-08-22T04:02:16.000000Z
Adversarial Attacks on VQA-NLE: Exposing and Alleviating Inconsistencies in Visual Question Answering Explanations
cs.AI updates on arXiv.org
2025-08-19T04:21:25.000000Z
Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities
cs.AI updates on arXiv.org
2025-08-12T04:39:19.000000Z