Go/No-Go Criteria
Before any agent goes to production, it must pass a production readiness review covering all the patterns from this chapter. This isn't a formality — it's the gate that separates pilots that impress demos from agents that survive real-world conditions. The checklist covers observability (all five pillars implemented), resilience (circuit breakers, retries, fallbacks tested), guardrails (input, output, and execution guardrails active), cost controls (per-request, per-user, per-agent budgets set), scaling plan (load tested at 3x expected peak), compliance (audit trails, documentation, human oversight), and runbook (incident response procedures documented and rehearsed). An agent that passes all seven categories is production-ready. Anything less is a risk.
Readiness Checklist
Production readiness review:
Observability:
□ Traces, metrics, logs, evals, review
Resilience:
□ Circuit breakers configured
□ Retries with backoff
□ Fallback chain defined
Guardrails:
□ Input, output, execution guards
Cost controls:
□ Per-request/user/agent budgets
Scaling:
□ Load tested at 3x peak
Compliance:
□ Audit trails, docs, oversight
Runbook:
□ Incident response documented
All 7 categories = production ready
Key insight: The production readiness review should be a recurring event, not a one-time gate. Run it quarterly, because the agent's environment changes (model updates, new integrations, scaling) even when the agent's code doesn't.