Artificial Ignorance 10月07日
OpenAI DevDay 2024:新品发布与未来展望
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI的DevDay 2024活动发布了一系列重要更新,旨在拓展AI应用场景并简化开发者体验。重点包括全新的ChatGPT Apps,引入了基于模型上下文协议(MCP)的交互式应用,期望吸引更多开发者构建生态;AgentKit则集成了Agent Builder、ChatKit、Evals等工具,为构建和部署AI智能体提供一站式解决方案,可能对现有AI初创公司构成挑战。Codex云编码代理现已通用可用,并新增Slack集成和SDK,增强了协作和集成能力。最后,压轴发布了Sora 2、GPT-5 Pro及更高效的gpt-realtime-mini和gpt-image-1-mini模型。本次大会显示OpenAI正从开发者社区扩展到更广泛的市场,其成功取决于新应用商店模式的接受度和企业级解决方案的吸引力。

🚀 **ChatGPT Apps的演进与生态构建**:OpenAI第三次尝试推出ChatGPT Apps,此次基于模型上下文协议(MCP),允许应用内嵌交互式UI元素,并支持与ChatGPT对话的双向上下文传递。早期合作伙伴包括Booking.com、Canva等知名企业。此举旨在通过ChatGPT的广泛用户基础,为开发者提供一个强大的分发渠道,但其成功与否仍取决于开发者对经济模型和分发机制的信任。

🛠️ **AgentKit赋能AI智能体开发**:AgentKit整合了Agent Builder(无代码可视化构建工具)、ChatKit(可嵌入的对话式AI聊天组件)、Evals(评估与优化工具)和Connector Registry(数据连接中心)等,为开发者提供构建、部署和优化AI工作流的一站式平台。这标志着OpenAI正构建一个强大的AI开发基础设施,可能对提供类似功能的初创公司带来显著竞争压力。

💻 **Codex增强开发者工具链**:Codex云编码代理现已全面可用,并新增了Slack集成,允许在Slack频道中直接使用Codex进行任务委托和代码生成,同时提供SDK以便集成到第三方工具和工作流中。此外,还加入了面向企业用户的管理功能,如环境控制和使用分析,显示出OpenAI在提升开发者效率和企业级支持方面的持续投入。

🧠 **模型更新与性能提升**:在发布会的最后,OpenAI公布了多款新模型,包括下一代视频生成模型Sora 2、更强大的推理模型GPT-5 Pro,以及为实时应用设计的更快速、更经济的gpt-realtime-mini和gpt-image-1-mini模型。这些模型预示着AI在内容生成和复杂任务处理能力上的进一步飞跃。

For the second time in three years (I missed last year), I had the chance to attend OpenAI’s DevDay in person - a slick, stylish event at San Francisco’s Fort Mason.

Two years ago, DevDay was all about making developers’ lives easier with better APIs and the Assistants framework. And in some ways, OpenAI is still pursuing many of the themes from two years ago; in others, it appears that the focus has expanded beyond “just” developers.

Let’s break down what was announced and what it all means.

Subscribe now

ChatGPT Apps (Take Three)

Remember plugins? Remember GPTs? Well, OpenAI is trying again with ChatGPT Apps. Third time’s the charm?

What’s new:

The MCP foundation is a smart move. Rather than fighting the momentum behind MCP, OpenAI is embracing it while adding its own layer: OAuth flows and HTML scaffolding for richer UIs. This feels much more pragmatic than the proprietary plugin architecture from 2023. Based on some of the demos, this is going to be a handy way to add richer interaction; however, I’m sure we’ll soon feel the limits of the APIs.

But the billion-dollar question remains: Will developers actually build apps en masse? You’re essentially becoming a headless data/action endpoint in exchange for ChatGPT’s distribution. That’s a compelling trade-off if the distribution is real and the monetization is fair - both of which are TBD.

Also, some practical concerns:

AgentKit: The No-Code Agent Stack

OpenAI is packaging up everything you need to build, deploy, and optimize “agentic” workflows into a single offering called AgentKit.

What’s included:

Look: a ton of headlines are about to proclaim that OpenAI is “killing hundreds of startups.” And that might be true - Agent Builder certainly looks like a lightweight n8n or Zapier clone. But it’ll likely need a decent amount of additional engineering and product investment before it becomes a serious competitor to those tools. So while those startups might not be “dead,” they’re certainly on notice.

But it’s good to see acknowledgement that the drag-and-drop interface isn’t everything you need. ChatKit is clever. It reminds me of the conversation equivalent of Stripe’s payment widget - you get a battle-tested UI without building it yourself. For B2B SaaS companies wanting to add AI chat without reinventing the wheel, this is a no-brainer. And the evals expansion will hopefully get more people into the camp of testing their LLM apps more rigorously, instead of relying on vibes every time there’s a prompt update.

Codex: Now With Slack and an SDK

Codex (OpenAI’s cloud coding agent) has been in preview for a while, but today it hit general availability with some developer-friendly additions.

The latest:

Codex has been iterating quickly, and it shows1. The Slack integration is the kind of thing that’ll make non-engineer managers very happy - delegate a task, get a link back when it’s done. For teams with less technical PMs, this lowers the barrier to leveraging Codex significantly.

I can’t currently speak to how good the Codex model is versus, say, Anthropic’s Claude or even Cursor. But it’s clear OpenAI is investing heavily in the developer experience and enterprise controls. The admin dashboards and environment management are exactly the kind of thing that’ll convince larger orgs to roll this out org-wide.

But perhaps the most interesting piece was the SDK. It turns Codex from “a tool” into “a platform component” that you can plug into your own internal tools. In the demos they showed, apps had the ability to “edit themselves” using the Codex SDK and live-reload code changes. I’m still wrapping my brain around what types of products would benefit the most from that.

New Models: The Last-Minute Bombshells

In the final 10 minutes of the keynote, OpenAI dropped four new models into the API:

I’m perhaps most excited about these - and judging from Twitter, the Sora 2 announcement is going to dominate viral moments for the next week.

Final Thoughts

OpenAI really, really wants an app store. This is their third attempt (plugins → GPTs → Apps), and they’re bringing heavier artillery each time. MCP as the foundation helps, but the success of this platform hinges entirely on whether developers trust the economics.

AgentKit is (probably) a startup killer. From eval tools to agent builders to embeddable chat widgets, OpenAI just bundled features that previously required multiple vendors. At this point, if you’re building tooling for almost anything in the LLM ecosystem, you need to ask: “What happens when OpenAI builds this into their platform?”

The terminology wars continue. “Agent” means different things to different people, and OpenAI’s usage here (essentially, workflow automation) is likely to frustrate those who define agents more narrowly. We were just starting to find a little consensus on what constitutes an “agent2” - OpenAI has made things a bit murkier now.

Two years ago, DevDay seemed far more focused on developers. This year, the focus has unmistakably broadened - the Slack integrations, no-code builders, and embeddable widgets are as much for PMs and managers as they are for engineers. OpenAI is chasing a bigger market than “just” developers.

Whether that strategy pays off depends on questions they still haven’t answered: Will the app store work this time? Will enterprises trust an all-in-one platform? And can they avoid the GPT Store’s fate of quietly fading into irrelevance?

See you in 2027 for DevDay 3.0 - and probably Apps 4.0.

Thanks for reading Artificial Ignorance! This post is public so feel free to share it.

Share

1

Though also worth noting that it’s not necessarily groundbreaking - Claude and Devin (among others) have had similar capabilities for a while.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI DevDay ChatGPT Apps AgentKit Codex AI模型 人工智能 Sora 2 GPT-5 Pro
相关文章