6
“The best multi-agent system is one where doing the right thing is also the easy thing.”
- Even “cooperative” agents can defect if incentives are misaligned — design for it with mechanism design.
- Reputation only works if agents have persistent identity — anonymous agents can’t be held accountable.
- Same model weights = same biases = easy collusion. Diversify your agent pool across models and prompts.
7
“Learn the patterns — frameworks change fast.”
- Role specialization: if you cannot unit-test a role’s expected output, the role definition is too loose.
- Tool access is your permission model — treat it like IAM, not a free buffet. Log every call.
- Match conversation patterns to decision types: debate for judgment, hierarchy for execution, reflection for quality.
8
“Agent-level tests passing does not guarantee system-level success — test the interactions.”
- Evaluate at three layers: agent capability, interaction quality, and end-to-end task success.
- Always report cost alongside accuracy — a 2% accuracy gain at 5× cost is rarely worth it.
- Ablation studies: remove agents one at a time. No effect = remove it. System crash = add redundancy.
9
“Safety is not a feature you add at the end — it is an architecture decision you make at the start.”
- The danger is a chain of agents trusting each other’s mistakes — cascading failures are the top risk.
- Guardrails must run at every agent boundary, not just the user-facing edge.
- An untested kill switch is a false sense of security — drill it regularly.
10
“Multi-agent systems are distributed systems with language interfaces — treat them with that level of rigor.”
- The demo-to-production gap is engineering discipline: cost controls, latency SLAs, graceful degradation.
- The cheapest token is the one you never send — prune context, tier models, cache tool results.
- Every agent needs an on-call owner — “the AI did it” is not an incident response.
Section takeaway: Production MAS requires game-theoretic design, rigorous evaluation at every layer, safety as architecture, and distributed-systems engineering discipline. Principles outlast frameworks.