← 回總覽

XSkill:通过持续学习和技能蒸馏增强 AI 智能体

📅 2026-03-15 05:15 elvis 人工智能 2 分鐘 1482 字 評分: 88
AI 智能体 持续学习 XSkill MCP LLM 工具使用
📌 一句话摘要 作者分析了 XSkill 论文,该论文提出了一种双流框架,通过经验蒸馏帮助智能体提升工具使用和规划能力。 📝 详细摘要 这条推文对“XSkill”进行了技术深度解析,它是一种专为 AI 智能体设计的双流持续学习框架。推文解释了智能体如何从历史轨迹中提取可复用知识:将“经验”用于动作级工具选择,将“技能”用于任务级规划和工作流。作者分享了将这些技能与 MCP 和命令行界面 (CLIs) 结合以改进编码智能体的个人见解。文中还强调了关键指标,例如 Gemini-3-Flash 上的成功率从 33.6% 提升至 40.3%,以及工具错误率的显著降低。 📊 文章信息 AI 评分:

// Continual Learning from Experience and Skills // Skills are so good when you combine them properly with MCP & CLIs.

I have found that Skills can significantly improve tool usage of my coding agents.

The best way to improve them is to regularly document improvements, patterns, and things to avoid.

Self-improving skills don't work that well (yet).

Check out this related paper on the topic:

It introduces XSkill, a dual-stream continual learning framework.

Agents distill two types of reusable knowledge from past trajectories: experiences for action-level tool selection, and skills for task-level planning and workflows.

Both are grounded in visual observations.

During accumulation, agents compare successful and failed rollouts via cross-rollout critique to extract high-quality knowledge. During inference, they retrieve and adapt relevant experiences and skills to the current visual context.

Evaluated across five benchmarks with four backbone models, XSkill consistently outperforms baselines. On Gemini-3-Flash, the average success rate jumps from 33.6% to 40.3%. Skills reduce overall tool errors from 29.9% to 16.3%.

Agents that accumulate and reuse knowledge from their own trajectories get better over time without parameter updates.

I have now seen two papers this week with similar ideas.

Paper: arxiv.org/abs/2603.12056

Learn to build effective AI agents in our academy: academy.dair.ai

查看原文 → 發佈: 2026-03-15 05:15:05 收錄: 2026-03-15 08:01:00

🤖 問 AI

針對這篇文章提問,AI 會根據文章內容回答。按 Ctrl+Enter 送出。