Architecture
Sandboxing limits what each agent can affect: file system access, network calls, database writes, and spending limits. Run code-executing agents in isolated containers with resource caps. Use budget envelopes: each task gets a token budget and dollar cap; the system halts if exceeded. Blast radius analysis: if this agent goes rogue, what is the worst it can do? Design so the answer is bounded and reversible.
Pattern
Container: isolated execution
Budget: token + dollar cap
Blast radius: worst case bounded
// Reversible > irreversible actions
Key insight: Ask “if this agent goes rogue, what’s the worst case?” — then make that case survivable.