Interconnects 09月25日
开源模型中国主导,西方组织依赖东方技术
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

中国模型在开源领域占据主导地位,尽管西方组织发布的模型数量最多(约60%),但其中多数依赖中国模型(如Qwen)。中国模型不仅在数量上占优,更以高性能和宽松许可的模型推动了开源生态的发展。本文分析了模型的地域分布和影响力,指出中国模型在技术前沿和许可开放性上的优势。此外,文章还介绍了多个新兴开源模型,如Common Pile、Virtuoso-Large、moondream2、Qwen3-Embedding-0.6B和MiniMax-M1-80k,这些模型在推理能力和开放性上展现了新的趋势。

🔍 中国模型在开源领域占据主导地位,尽管西方组织发布的模型数量最多(约60%),但其中多数依赖中国模型(如Qwen)。这一现象表明中国模型在开源生态中的核心作用。

📈 中国模型不仅在数量上占优,更以高性能和宽松许可的模型推动了开源生态的发展。例如,Qwen系列模型因其技术前沿性和开放性许可,对开源生态产生了深远影响。

🌐 西方组织发布的模型中,多数依赖中国模型(如Qwen)进行微调。这一趋势反映了全球开源生态对中国技术的依赖,以及中国模型在技术领先性和开放性上的优势。

📚 文章介绍了多个新兴开源模型,如Common Pile、Virtuoso-Large、moondream2、Qwen3-Embedding-0.6B和MiniMax-M1-80k,这些模型在推理能力和开放性上展现了新的趋势,为开源生态注入了新的活力。

🔬 此外,文章还分析了模型的地域分布和影响力,指出中国模型在技术前沿和许可开放性上的优势,为开源模型的未来发展提供了重要参考。

In previous posts, we've noted in text how Chinese models currently dominate the space of open models. We analyzed the geographic distribution of all models from past Artifacts collections to quantify it. It turns out that most of the artifacts are released by Western organizations (~60%), but most of these rely on Chinese models (i.e. Qwen). Crucially, on top of counting, Chinese models also have been qualitatively more impactful on the direction of the open ecosystem by releasing models closest to the frontier of performance with the most permissive licenses.

We present only a selection of models in the Artifacts series based on a mix of our perceived immediate and long-term impact. Our analysis of is broader than just text-only language models, including image/video generation models where Chinese labs are more dominant.

For attribution, we count fine-tunes according to the team that released them, i.e., a Qwen fine-tune published by a Western company is marked as Western.

When looking into the models heritage (as reported by HuggingFace), the picture is as expected: Qwen is the first choice for anyone who fine-tunes their model.1

Also, RL and reasoning is now part of a lot of the model releases as part of the training pipeline. Therefore, we stop making reasoning its own category. Together with links being broken out into its own series, these posts should have a more streamlined structure. Our picks, then models, then datasets.

Share

Our Picks

Models

Flagship

Read more

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

开源模型 中国技术 Qwen 推理能力 开放性许可
相关文章