Self-Evolving Agents

Definition

Self-evolving agents are AI systems that improve their own capabilities through deployment experience, treating failures as training signals rather than just retry opportunities. The model weights stay frozen; improvement happens through an evolving external skill/memory layer.

自进化智能体是一类通过部署经验提升自身能力的人工智能系统，它们将失败视为训练信号，而非单纯的重试机会。模型权重保持冻结；改进是通过不断演化的外部技能/记忆层实现的。

Details

The Evolution Spectrum

Static tools — manually added, never change
Learnable skills — agent creates skills from successful runs (SkillCraft)
Self-rewriting skills — agent rewrites failing skills (Memento-Skills)
Meta-level improvement — agent improves its own improvement methods (HyperAgents / Facebook FAIR)

1. **静态工具** —— 人工添加，永不改变 2. **可习得技能** —— 智能体从成功运行中创建技能（SkillCraft） 3. **自重写技能** —— 智能体重写失败的技能（Memento-Skills） 4. **元级改进** —— 智能体改进自身的改进方法（HyperAgents / Facebook FAIR）

Memento-Skills: Read-Execute-Reflect-Write Loop

The core loop:

Read — retrieve relevant skill from memory
Execute — attempt the task using the skill
Reflect — analyze outcome, attribute success/failure to specific skills
Write — update, rewrite, or create new skill based on reflection

核心循环：1. **读取** — 从记忆中检索相关技能 2. **执行** — 使用该技能尝试任务 3. **反思** — 分析结果，将成功或失败归因于特定技能 4. **写入** — 基于反思更新、重写或创建新技能

When a task fails, the system records the state, updates a utility score for the responsible skill, and triggers a reflection pass that may rewrite or replace the skill entirely.

当任务失败时，系统记录状态，更新负责技能的效用分数，并触发反思过程，该过程可能会重写或彻底替换该技能。

SkillCraft Findings

Enabling “Skill Mode” reduces token consumption by up to 80% on repetitive tasks
Stronger models are better at discovering and reusing skills within the same task
Deep skill nesting (skills calling skills) increases error propagation rather than helping

- 启用“技能模式”最多可减少 **80%** 的重复任务 Token 消耗 - 更强的模型更擅长在同一任务中发现并复用技能 - **深层技能嵌套（技能调用技能）会增加错误传播**，而并非有所助益

Limitations

When agents rewrite their own skills, they optimize for aggregate error rates rather than the specific edge cases users care about
Self-optimization performs hidden value judgments through loss functions
No guaranteed convergence — skills can degrade

- 当智能体重写自身技能时，它们优化的是聚合错误率，而非用户关心的特定边缘情况 - 自我优化通过损失函数执行隐性的价值判断 - 无法保证收敛——技能可能会退化

Connections

Related to: Skills, Skill Lifecycle
Mentioned in: Memento-Skills, SkillCraft, Skills Agent Research

- 相关于：[[ai-agent-architecture/concepts/skills|技能]]，[[ai-agent-architecture/concepts/skill-lifecycle|技能生命周期]] - 提及于：[[ai-agent-architecture/sources/memento-skills-framework|Memento-Skills]]，[[ai-agent-architecture/sources/skillcraft-paper|SkillCraft]]，[[ai-agent-architecture/sources/skills-agent-research|技能智能体研究]]

LLM Wiki

探索

Self-Evolving Agents

Self-Evolving Agents

Definition

Details

The Evolution Spectrum

Memento-Skills: Read-Execute-Reflect-Write Loop

SkillCraft Findings

Limitations

Connections

关系图谱

目录

反向链接