热点
关于我们
xx
xx
"
合成数据集
" 相关文章
Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs
cs.AI updates on arXiv.org
2025-11-07T05:50:49.000000Z
GOAT: A Training Framework for Goal-Oriented Agent with Tools
cs.AI updates on arXiv.org
2025-10-15T04:39:52.000000Z
TalkPlayData 2: An Agentic Synthetic Data Pipeline for Multimodal Conversational Music Recommendation
cs.AI updates on arXiv.org
2025-09-15T08:13:36.000000Z
Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection
cs.AI updates on arXiv.org
2025-08-14T05:12:51.000000Z
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
cs.AI updates on arXiv.org
2025-08-08T04:17:48.000000Z
NYU Researchers Introduce WILDCHAT-50M: A Large-Scale Synthetic Dataset for Efficient LLM Post-Training
MarkTechPost@AI
2025-02-04T18:46:57.000000Z
SmolTalk Released: The Dataset Recipe Behind the Best-in-Class Performance of SmolLM2
MarkTechPost@AI
2024-11-21T17:05:15.000000Z
突破视频多模态大模型瓶颈!「合成数据」立大功,项目已开源
机器之心
2024-10-21T08:11:33.000000Z
A "Bitter Lesson" Approach to Aligning AGI and ASI
少点错误
2024-07-06T01:35:10.000000Z
Researchers at the University of Wisconsin-Madison Propose a Finetuning Approach Utilizing a Carefully Designed Synthetic Dataset Comprising Numerical Key-Value Retrieval Tasks
MarkTechPost@AI
2024-07-03T04:01:50.000000Z
Gretel AI Releases a New Multilingual Synthetic Financial Dataset on HuggingFace ? for AI Developers Tackling Personally Identifiable Information PII Detection
MarkTechPost@AI
2024-06-14T03:01:38.000000Z