近期大模型及AI技术动态速览

三花AI 10月11日 11:38

近期大模型及AI技术动态速览

本文汇总了近期大模型和人工智能领域的最新进展，涵盖了Google发布的Data Commons MCP、GitHub Copilot CLI的公开预览版，以及Kimi的OK Computer、Perplexity的Search API和Ollama的Web Search API等实用工具。此外，还介绍了Elon Musk的Grokipedia知识库计划、Google Gemini 2.5 Flash Image的生产环境可用性、OpenAI的AgentKit工具包和Sora 2提示词指南，以及Claude Code对插件的支持。在模型开源方面，Meta、讯飞、Alibaba、快手、蚂蚁、腾讯、Stability AI、DeepSeek、Anthropic、OpenAI、智谱和字节等公司均发布了新的模型或更新，涉及代码、化学、图像、3D生成、视频以及多模态能力，展现了AI技术的快速发展和广泛应用。

🚀 **工具与平台更新：** 近期AI领域涌现出多项实用工具和平台更新，如Google的Data Commons MCP方便查询公共数据集，GitHub Copilot CLI提供命令行辅助开发，Kimi的OK Computer成为AI全栈团队助手，Perplexity推出了原生AI搜索引擎的Search API，Ollama提供有限免费的Web Search API。这些更新旨在提升开发者和用户的AI应用体验。

💡 **模型开源与能力拓展：** 多家科技公司积极推进大模型开源，丰富了AI生态。Meta开源了Code World Model，讯飞发布了化学大模型和文生音效模型，Alibaba推出了Wan2.5-Preview及Liquid Nanos系列轻量模型，快手开源了KAT-Dev-32B和KAT-Coder大模型，蚂蚁开源了Ring v2系列，腾讯混元发布了Hunyuan3D-Omni/Part 3D生成模型、HunyuanImage 3.0图像模型及Hunyuan-Vision-1.5视觉语言模型。此外，Stability AI发布了SD3.5-Flash，DeepSeek更新了V3.2-Exp模型。

✨ **前沿模型与服务发布：** 备受关注的AI模型和服务也迎来重要更新。OpenAI发布了AgentKit工具包用于构建Agent，并提供了Sora 2的官方提示词指南；Anthropic推出了Claude Sonnet 4.5模型，并宣布Claude Code支持插件功能；Google发布了Gemini 2.5 Flash Image可用于生产环境，以及Gemini CLI扩展和Gemini 2.5 Computer Use模型；智谱发布了GLM-4.6旗舰模型；字节发布了豆包1.6-vision视觉大模型。Elon Musk也宣布将打造Grokipedia开源知识库。

🌐 **AI生态的持续演进：** 整个AI领域正经历快速迭代和生态扩张。从命令行工具到多模态模型，再到专门领域的AI应用（如化学、3D生成、视频），AI技术正以前所未有的速度渗透到各个层面。开源模型的涌现加速了技术的普及和创新，而商业公司在模型和服务上的持续投入则推动着AI能力的边界不断拓展。

原创小茸茸 2025-10-11 10:58 重庆

狠狠的放了一个大长假，今天古法阅读人力看了一遍最近发生的事情和大模型相关发布，挑出了值得关注的，佬们感兴趣可以一一查看。

值得关注

Google 发布 Data Commons MCP[1]

❝自然语言查询公共数据集

GitHub Copilot CLI[2] 推出公开预览版

❝这下都有 CLI 了

Kimi 上线 OK Computer[3]

❝你的 AI 全栈团队

Perplexity 发布 Search API[4] 及相关 SDK

❝原生 AI 搜索引擎接口

Ollama 云提供免费 Web Search API[5]

❝当然是有限免费

Elon Musk 宣布打造 Grokipedia 开源知识库[6]

❝Grok 宇宙

Nano Banana 结束预览正式发布 Google Gemini 2.5 Flash Image[7]

❝可用于生产环境

OpenAI 发布 AgentKit[8] 工具包

❝一站式构建发布优化你的 Agent

OpenAI 发布 Sora 2 官方提示词指南[9]

❝佬们学吧

谷歌发布 Gemini CLI 扩展[10]生态系统

Claude Code 现已支持插件功能[11]

❝编程 CLI 迎来扩展/插件时代

大模型：

Meta 开源 Code World Model[12]

讯飞开源化学大模型与文生音效模型[13]

Alibaba 发布 Wan2.5-Preview[14]

Liquid Nanos[15] 系列轻量模型开源

快手开源 KAT-Dev-32B[16] 与 KAT-Coder[17] 大模型

蚂蚁开源 Ring v2 系列[18]模型

腾讯混元开源 Hunyuan3D-Omni[19] 和 Hunyuan3D-Part[20] 3D 生成模型

Stability AI 发布 SD3.5-Flash[21]

腾讯混元开源 HunyuanImage 3.0[22] 模型

DeepSeek 开源 DeepSeek-V3.2-Exp

Anthropic发布 Claude Sonnet 4.5[23] 模型

OpenAI 发布 Sora 2[24] 视频模型

智谱发布 GLM-4.6 旗舰模型

字节发布豆包 1.6-vision 视觉大模型

Google 发布 Gemini 2.5 Computer Use[25] 模型

腾讯混元开源 Hunyuan-Vision-1.5[26] 视觉语言模型

Microsoft 发布 UserLM-8b[27] 用户角色模型

Reference

[1] Data Commons MCP: https://developers.googleblog.com/en/datacommonsmcp/

[2] GitHub Copilot CLI: https://github.com/github/copilot-cli

[3] OK Computer: https://www.kimi.com

[4] Search API: https://www.perplexity.ai/hub/blog/introducing-the-perplexity-search-api

[5] Web Search API: https://ollama.com/blog/web-search

[6] Elon Musk 宣布打造 Grokipedia 开源知识库: https://x.com/elonmusk/status/1972992095859433671

[7] Google Gemini 2.5 Flash Image: https://developers.googleblog.com/en/gemini-2-5-flash-image-now-ready-for-production-with-new-aspect-ratios/

[8] AgentKit: https://openai.com/index/introducing-agentkit/

[9] Sora 2 官方提示词指南: https://cookbook.openai.com/examples/sora/sora2_prompting_guide

[10] Gemini CLI 扩展: https://geminicli.com/extensions/

[11] 插件功能: https://www.anthropic.com/news/claude-code-plugins

[12] Code World Model: https://ai.meta.com/research/publications/cwm/

[13] 化学大模型与文生音效模型: https://modelscope.cn/organization/iflytek

[14] Wan2.5-Preview: https://x.com/Alibaba_Wan/status/1970697244740591917

[15] Liquid Nanos: https://huggingface.co/collections/LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a

[16] KAT-Dev-32B: https://huggingface.co/Kwaipilot/KAT-Dev

[17] KAT-Coder: https://kwaipilot.github.io/KAT-Coder/

[18] Ring v2 系列: https://huggingface.co/collections/inclusionAI/ring-v2-68db3941a6c4e984dd2015fa

[19] Hunyuan3D-Omni: https://github.com/Tencent-Hunyuan/Hunyuan3D-Omni

[20] Hunyuan3D-Part: https://github.com/Tencent-Hunyuan/Hunyuan3D-Part

[21] SD3.5-Flash: https://hmrishavbandy.github.io/sd35flash/

[22] HunyuanImage 3.0: https://github.com/Tencent-Hunyuan/HunyuanImage-3.0

[23] Claude Sonnet 4.5: https://www.anthropic.com/news/claude-sonnet-4-5

[24] Sora 2: https://openai.com/index/sora-2/

[25] Gemini 2.5 Computer Use: https://blog.google/technology/google-deepmind/gemini-computer-use-model/

[26] Hunyuan-Vision-1.5: https://github.com/Tencent-Hunyuan/HunyuanVision

[27] UserLM-8b: https://huggingface.co/microsoft/UserLM-8b

阅读原文

跳转微信打开

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

值得关注

大模型：

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签