AI Agent Architecture

Summary

AI Agent architecture has undergone a fundamental shift from 2024 to 2026. The industry has moved beyond asking “can we build agents?” to asking “can we make agents work reliably?” This wiki captures the full landscape — from foundational design patterns and scaling laws to enterprise deployment strategies and security considerations.

从2024年到2026年，AI智能体架构经历了根本性变革。业界关注的焦点已从“我们能否构建智能体？”转变为“我们能否让智能体可靠地运行？”。本知识库涵盖了完整的技术版图——从基础设计模式和扩展定律，到企业部署策略与安全考量。

The architecture of modern AI agents can be understood through three interrelated layers: the reasoning core (LLM with cognitive depth adaptation), the harness (the controlled boundary between thinking and acting), and the skill layer (modular, reusable, evolvable capabilities). The harness has emerged as the most critical infrastructure component, serving as a mandatory firewall between LLM reasoning and production system execution. Meanwhile, Skills — standardized capability units defined by the agentskills.io open specification — are crystallizing as the fundamental building blocks of agent capability.

现代人工智能智能体的架构可以通过三个相互关联的层级来理解：**推理核心**（具备认知深度适应能力的LLM）、**管控层**（思考与行动之间的受控边界）以及**技能层**（模块化、可复用、可演进的能力）。管控层已成为最关键的基础设施组件，充当LLM推理与生产系统执行之间必不可少的防火墙。与此同时，技能——由 agentskills.io 开放规范定义的标准化能力单元——正逐渐确立为智能体能力的基础构建模块。

Key empirical findings challenge common assumptions: multi-agent systems do NOT always outperform single agents (performance drops up to 70% on sequential reasoning tasks), a small 7B model with proper cognitive depth adaptation can outperform GPT-4o by 40%, and 79% of multi-agent failures originate in the orchestration layer. These findings underscore that architecture decisions matter more than raw model capability.

关键实证发现挑战了普遍假设：多智能体系统并不总是优于单智能体（在序列推理任务中性能下降高达 70%），一个经过适当认知深度适配的 7B 小模型可以比 GPT-4o 高出 40%，且 79% 的多智能体故障源于编排层。这些发现强调了架构决策比模型原始能力更为重要。

Key Concepts

Harness - The controlled boundary between agent reasoning and real-world execution
Multi-Agent Architectures - Five topology patterns and when to use each
Agent Scaling Laws - Empirical laws governing multi-agent system performance
Cognitive Depth Adaptation - Dynamic reasoning depth allocation per step
Agent Memory - Memory as a core harness function, not a plugin
Skills - The fundamental unit of agent capability
Skill Lifecycle Management - Create, evaluate, connect, and evolve skills
Self-Evolving Agents - Agents that learn from deployment experience
Sandbox Architectures - Isolation patterns for agent execution
Agent Security - Attack vectors and defenses for agent systems
Agentic Problem Frames - Engineering framework for reliable agent design
AI Infrastructure Stack - The layered architecture powering AI systems

Key Entities

Anthropic - Creator of Claude and the agentskills.io standard
LangChain - Agent framework ecosystem (LangGraph, Deep Agents)
NVIDIA - GPU ecosystem and OpenShell harness
Letta AI - Memory-first agent architecture (formerly MemGPT)
SkillNet - Unified skill ontology platform (Zhejiang University)
agentskills.io - Open standard for portable agent skills
Memento-Skills - Self-evolving agent framework
Uber LangEffect - Enterprise agent case study

Open Questions

At what scale does the index-based approach to wiki/knowledge management break down, requiring embedding-based RAG?
Can self-evolving skill frameworks (Memento-Skills) reliably improve without introducing regressions?
How do you prevent skill poisoning in open skill ecosystems?
What is the right balance between agent autonomy and human oversight for enterprise deployment?
Will the ~45% capability ceiling for multi-agent benefit hold as models improve?

- 基于索引的百科/知识管理方法在何种规模下会失效，从而需要转向基于嵌入的RAG？ - 自我进化的技能框架（Memento-Skills）能否在不引入回退的情况下可靠地改进？ - 在开放式技能生态系统中，如何防止技能投毒？ - 对于企业级部署，智能体自主性与人工监督之间的恰当平衡是什么？ - 随着模型的改进，多智能体约45%的能力增益上限能否保持？

LLM Wiki

探索

AI Agent Architecture

AI Agent Architecture

Summary

Key Concepts

Key Entities

Open Questions

Sources

关系图谱

目录

反向链接