TechCrunch News 10月10日 02:00
Datacurve获1500万美元融资,以“赏金猎人”模式解决AI训练数据难题
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

AI公司日益成熟,高质量训练数据成为行业竞争焦点。在此背景下,Datacurve凭借其创新的“赏金猎人”模式,吸引了1500万美元的A轮融资,领投方为Chemistry的Mark Goldberg。该公司专注于为软件开发提供高质量数据,通过支付高额赏金激励技术人员贡献稀缺数据集。与传统数据标注不同,Datacurve将用户体验置于首位,致力于打造一款吸引和留住高素质人才的“消费级产品”。随着AI模型对复杂数据需求的增长,Datacurve的模式有望在金融、营销、医疗等领域扩展应用。

🌟 **创新数据收集模式:** Datacurve采用“赏金猎人”系统,通过支付高额赏金吸引和激励高技能软件工程师贡献难以获取的训练数据集。该模式已成功分发超过100万美元的赏金,显示了其在解决AI数据稀缺性方面的有效性。

💡 **用户体验至上:** 公司联合创始人Serena Ge强调,Datacurve将数据收集视为“消费产品”而非简单的数据标注操作。通过优化平台的用户体验,旨在吸引并留住对平台感兴趣的顶尖人才,这对于需要高价值服务(如软件开发)的数据收集至关重要。

🚀 **应对AI数据需求增长:** 随着AI模型从早期简单的训练数据集转向更复杂的RL环境,对数据在数量和质量上的要求日益提高。Datacurve的高质量数据收集策略,特别是在软件开发领域,使其能够满足这些不断增长和复杂化的需求,为公司在行业内建立竞争优势。

🌐 **未来扩展潜力:** 目前Datacurve主要聚焦于软件工程领域,但联合创始人Serena Ge表示,其创新的数据收集模式具有广泛的应用潜力,可以轻松扩展到金融、营销甚至医疗等其他高价值领域,为不同行业的AI发展提供关键数据支持。

As AI companies mature, the fight for high-quality data has become one of the most competitive areas in the industry, launching companies like Mercor, Surge, and, most prominently, Alexandr Wang’s ScaleAI. But now that Wang has moved to run AI at Meta, many funders see an opening — and are willing to fund companies with compelling new strategies for collecting training data.

The Y Combinator graduate Datacurve is one such company, focusing on high-quality data for software development. On Thursday, the company announced a $15 million Series A round, led by Mark Goldberg at Chemistry with participation from employees at DeepMind, Vercel, Anthropic, and OpenAI. The Series A comes after a $2.7 million seed round, which drew investment from former Coinbase CTO Balaji Srinivasan.

Datacurve uses a “bounty hunter” system to attract skilled software engineers to complete the hardest-to-source datasets. The company pays for those contributions, distributing over $1 million in bounties so far.

But co-founder Serena Ge (pictured above with co-founder Charley Lee) says the biggest motivation isn’t financial. For high-value services like software development, the pay will always be far lower for data work than conventional employment — so the company’s most important edge is a positive user experience.

“We treat this as a consumer product, not a data labeling operation,” Ge said. “We spend a lot of time thinking about: How can we optimize it so that the people we want are interested and get onto our platform?”

That’s particularly important as the needs of post-training data grow more complex. While earlier models were trained on simple datasets, today’s AI products rely on complex RL environments, which need to be constructed through specific and strategic data collection. As the environments grow more sophisticated, the data requirements become both more intense for both quantity and quality — a factor that could give high-quality data collection companies like Datacurve an edge.

As an early-stage company, Datacurve is focused on software engineering, but Ge says the model could apply just as easily to fields like finance, marketing, or even medicine.

Techcrunch event

San Francisco | October 27-29, 2025

“What we’re doing right now is we’re creating an infrastructure for post training data collection that attracts and retains highly competent people in their own domains,” Ge says.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Datacurve AI训练数据 数据收集 软件开发 赏金系统 融资 AI Machine Learning Data Collection Software Development Bounty System Funding
相关文章