Tier 1 — Discovery
At startup, the agent loads only names and descriptions of available capabilities. This costs approximately ~80 tokens per skill (median). For Anthropic’s 17 built-in skills, the total discovery cost is only ~1,700 tokens — a fraction of the context window.
Tier 2 — Activation
When the model determines a skill is relevant to the current task, the full instruction body loads (275 to 8,000 tokens depending on complexity). This is a file read, not an LLM call, so latency is minimal.
Tier 3 — Execution
Supporting scripts, reference materials, and detailed examples load only during active task execution. These are the heaviest assets and are evicted when the task completes.
Key insight: This three-tier approach mirrors how human experts work: you know what you’re capable of (discovery), you pull up the relevant playbook when needed (activation), and you consult reference materials only during execution. The agent never carries the full weight of all its capabilities at once.