https://simonwillison.net/atom/everything 10月17日 05:39
Claude Skills:AI能力扩展新范式
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Anthropic发布了Claude Skills,一种扩展AI模型新能力的新模式。Skills以文件夹形式存在,包含指令、脚本和资源,Claude在需要时加载。这使得Claude在处理Excel、遵循品牌指南等专业任务时表现更佳。Claude的文档创建和代码解释功能已通过Skills实现,支持多种文件格式。Skills机制依赖于文件系统和命令执行环境,与MCP和ChatGPT Plugins不同,它更高效且易于共享,预示着AI能力将迎来爆发式增长。

💡 **Claude Skills的引入与机制**:Claude Skills是一种全新的AI能力扩展模式,以文件夹形式封装了指令、脚本和资源。Claude仅在任务相关时加载并使用这些Skills,从而提升了其在特定领域的表现,例如处理Excel文件或遵循企业品牌规范。

📄 **Skills在文档处理及代码解释中的应用**:Claude新推出的文档创建能力和代码解释功能,均是基于Skills机制实现的。这使得Claude能够处理.pdf、.docx、.xlsx和.pptx等多种文件格式,极大地扩展了其应用范围。

⚙️ **Skills的依赖与优势**:与MCP和ChatGPT Plugins不同,Claude Skills依赖于文件系统和命令执行环境。这种设计使其在处理任务时更为高效,并且能利用模型自行理解的CLI工具帮助。Skills的易于创建和共享特性,预示着AI能力将迎来快速发展。

🚀 **Skills带来的通用代理能力**:Claude Skills的出现,使得Claude能够作为通用代理,自动化执行计算机上的各种任务。通过组合不同的Skills,可以构建出强大的“数据新闻代理”等,极大地拓展了AI在实际应用中的潜力。

Claude Skills are awesome, maybe a bigger deal than MCP

16th October 2025

Anthropic this morning introduced Claude Skills, a new pattern for making new abilities available to their models:

Claude can now use Skills to improve how it performs specific tasks. Skills are folders that include instructions, scripts, and resources that Claude can load when needed.

Claude will only access a skill when it’s relevant to the task at hand. When used, skills make Claude better at specialized tasks like working with Excel or following your organization’s brand guidelines.

Their engineering blog has a more detailed explanation. There’s also a new anthropic/skills GitHub repo.

(I inadvertently preempted their announcement of this feature when I reverse engineered and wrote about it last Friday!)

Skills are conceptually extremely simple: a skill is a Markdown file telling the model how to do something, optionally accompanied by extra documents and pre-written scripts that the model can run to help it accomplish the tasks described by the skill.

Claude’s new document creation abilities, which accompanied their new code interpreter feature in September, turned out to be entirely implemented using skills. Those are now available Anthropic’s repo covering .pdf, .docx, .xlsx, and .pptx files.

There’s one extra detail that makes this a feature, not just a bunch of files in disk. At the start of a session Claude’s various harnesses can scan all available skill files and read a short explanation for each one from the frontmatter YAML in the Markdown file. This is very token efficient: each skill only takes up a few dozen extra tokens, with the full details only loaded in should the user request a task that the skill can help solve.

Trying out the slack-gif-creator skill

Here’s that metadata for an example slack-gif-creator skill that Anthropic published this morning:

Toolkit for creating animated GIFs optimized for Slack, with validators for size constraints and composable animation primitives. This skill applies when users request animated GIFs or emoji animations for Slack from descriptions like “make me a GIF for Slack of X doing Y”.

I just tried this skill out in the Claude mobile web app, against Sonnet 4.5. First I enabled the slack-gif-creator skill in the settings, then I prompted:

Make me a gif for slack about how Skills are way cooler than MCPs

And Claude made me this GIF:

Click to hide the GIF

(OK, this particular GIF is terrible, but the great thing about skills is that they’re very easy to iterate on to make them better.)

Here are some noteworthy snippets from the Python script it wrote, comments mine:

# Start by adding the skill's directory to the Python pathimport syssys.path.insert(0, '/mnt/skills/examples/slack-gif-creator')from PIL import Image, ImageDraw, ImageFont# This class lives in the core/ directory for the skillfrom core.gif_builder import GIFBuilder# ... code that builds the GIF ...# Save it to disk:info = builder.save('/mnt/user-data/outputs/skills_vs_mcps.gif',                     num_colors=128,                     optimize_for_emoji=False)print(f"GIF created successfully!")print(f"Size: {info['size_kb']:.1f} KB ({info['size_mb']:.2f} MB)")print(f"Frames: {info['frame_count']}")print(f"Duration: {info['duration_seconds']:.1f}s")# Use the check_slack_size() function to confirm it's small enough for Slack:passes, check_info = check_slack_size('/mnt/user-data/outputs/skills_vs_mcps.gif', is_emoji=False)if passes:    print("✓ Ready for Slack!")else:    print(f"⚠ File size: {check_info['size_kb']:.1f} KB (limit: {check_info['limit_kb']} KB)")

This is pretty neat. Slack GIFs need to be a maximum of 2MB, so the skill includes a validation function which the model can use to check the file size. If it’s too large the model can have another go at making it smaller.

Skills depend on a coding environment

The skills mechanism is entirely dependent on the model having access to a filesystem, tools to navigate it and the ability to execute commands in that environment.

This is a common pattern for LLM tooling these days—ChatGPT Code Interpreter was the first big example of this back in early 2023, and the pattern later extended to local machines via coding agent tools such as Cursor, Claude Code, Codex CLI and Gemini CLI.

This requirement is the biggest difference between skills and other previous attempts at expanding the abilities of LLMs, such as MCP and ChatGPT Plugins. It’s a significant dependency, but it’s somewhat bewildering how much new capability it unlocks.

The fact that skills are so powerful and simple to create is yet another argument in favor of making safe coding environments available to LLMs. The word safe there is doing a lot of work though! We really need to figure out how best to sandbox these environments such that attacks such as prompt injections are limited to an acceptable amount of damage.

Claude as a General Agent

Back in January I made some foolhardy predictions about AI/LLMs, including that “agents” would once again fail to happen:

I think we are going to see a lot more froth about agents in 2025, but I expect the results will be a great disappointment to most of the people who are excited about this term. I expect a lot of money will be lost chasing after several different poorly defined dreams that share that name.

I was entirely wrong about that. 2025 really has been the year of “agents”, no matter which of the many conflicting definitions you decide to use (I eventually settled on "tools in a loop").

Claude Code is, with hindsight, poorly named. It’s not purely a coding tool: it’s a tool for general computer automation. Anything you can achieve by typing commands into a computer is something that can now be automated by Claude Code. It’s best described as a general agent. Skills make this a whole lot more obvious and explicit.

I find the potential applications of this trick somewhat dizzying. Just thinking about this with my data journalism hat on: imagine a folder full of skills that covers tasks like the following:

    Where to get US census data from and how to understand its structureHow to load data from different formats into SQLite or DuckDB using appropriate Python librariesHow to publish data online, as Parquet files in S3 or pushed as tables to Datasette CloudA skill defined by an experienced data reporter talking about how best to find the interesting stories in a new set of dataA skill that describes how to build clean, readable data visualizations using D3

Congratulations, you just built a “data journalism agent” that can discover and help publish stories against fresh drops of US census data. And you did it with a folder full of Markdown files and maybe a couple of example Python scripts.

Skills compared to MCP

Model Context Protocol has attracted an enormous amount of buzz since its initial release back in November last year. I like to joke that one of the reasons it took off is that every company knew they needed an “AI strategy”, and building (or announcing) an MCP implementation was an easy way to tick that box.

Over time the limitations of MCP have started to emerge. The most significant is in terms of token usage: GitHub’s official MCP on its own famously consumes tens of thousands of tokens of context, and once you’ve added a few more to that there’s precious little space left for the LLM to actually do useful work.

My own interest in MCPs has waned ever since I started taking coding agents seriously. Almost everything I might achieve with an MCP can be handled by a CLI tool instead. LLMs know how to call cli-tool --help, which means you don’t have to spend many tokens describing how to use them—the model can figure it out later when it needs to.

Skills have exactly the same advantage, only now I don’t even need to implement a new CLI tool. I can drop a Markdown file in describing how to do a task instead, adding extra scripts only if they’ll help make things more reliable or efficient.

Here come the Skills

One of the most exciting things about Skills is how easy they are to share. I expect many skills will be implemented as a single file—more sophisticated ones will be a folder with a few more.

Anthropic have Agent Skills documentation and a Claude Skills Cookbook. I’m already thinking through ideas of skills I might build myself, like one on how to build Datasette plugins.

Something else I love about the design of skills is there is nothing at all preventing them from being used with other models.

You can grab a skills folder right now, point Codex CLI or Gemini CLI at it and say “read pdf/SKILL.md and then create me a PDF describing this project” and it will work, despite those tools and models having no baked in knowledge of the skills system.

I expect we’ll see a Cambrian explosion in Skills which will make this year’s MCP rush look pedestrian by comparison.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Claude Skills AI LLM 能力扩展 通用代理 Claude Anthropic
相关文章