热点
关于我们
xx
xx
"
预训练数据
" 相关文章
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
cs.AI updates on arXiv.org
2025-11-03T05:19:31.000000Z
All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language
cs.AI updates on arXiv.org
2025-10-14T04:13:27.000000Z
Reinforcement Learning on Pre-Training Data
cs.AI updates on arXiv.org
2025-09-26T04:24:09.000000Z
Extrinsic Hallucinations in LLMs
Lil'Log
2025-09-25T10:02:04.000000Z
Kling 2.0: uncanny valley crossed — video creation will never be the same
Coding with Intelligence
2025-09-25T10:01:24.000000Z
Analyzing Generalization in Pre-Trained Symbolic Regression
cs.AI updates on arXiv.org
2025-09-25T05:51:54.000000Z
代码里插广告,腾讯 Codebuddy 们 “背锅”?DeepSeek “极你太美”事件,其他模型也逃不掉?
36kr
2025-08-27T07:50:21.000000Z
Aligning Instruction Tuning with Pre-training
cs.AI updates on arXiv.org
2025-08-12T04:39:25.000000Z
Topic Over Source: The Key to Effective Data Mixing for Language Models Pre-training
cs.AI updates on arXiv.org
2025-08-11T04:08:23.000000Z
Juru: Legal Brazilian Large Language Model from Reputable Sources
cs.AI updates on arXiv.org
2025-07-29T04:21:40.000000Z
手术刀式去噪突破LLM能力上限,从头预训练模型下游任务平均提高7.2% | 中科院&阿里
智源社区
2025-07-22T04:47:48.000000Z
手术刀式去噪突破LLM能力上限,从头预训练模型下游任务平均提高7.2% | 中科院&阿里
量子位
2025-07-21T17:37:00.000000Z
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs
cs.AI updates on arXiv.org
2025-07-08T05:54:13.000000Z
ICLR 2025|浙大、千问发布预训练数据管理器DataMan,53页细节满满
机器之心
2025-02-28T05:40:33.000000Z
AI做数学学会「动脑子」,UCL等发现LLM「程序性知识」,推理绝不是背答案
36kr-科技
2024-12-02T07:06:37.000000Z
大模型不会推理,为什么也能有思路?有人把原理搞明白了
机器之心
2024-11-22T06:10:07.000000Z
多个中国团队斩获EMNLP'24最佳论文!UCLA华人学者中三篇杰出论文,明年顶会落户苏州
智源社区
2024-11-16T10:53:04.000000Z
多个中国团队斩获EMNLP'24最佳论文,UCLA华人学者中三篇杰出论文,明年顶会落户苏州
36氪 - 科技频道
2024-11-15T07:43:19.000000Z
Building Safer AI from the Ground Up: Steering Model Behavior via Pre-Training Data Curation
少点错误
2024-09-29T20:22:49.000000Z
OpenAI 翁荔提出大模型「外在幻觉」:万字 blog 详解抵抗办法、产幻原因和检测方式
IT之家
2024-07-13T15:23:21.000000Z