The ReAct Loop Problem
AI agents using the ReAct pattern (Reason + Act) accumulate context with every tool call. Each cycle adds the tool result (hundreds or thousands of tokens), the model’s reasoning, and the action taken. Without intervention, a 10-step agent task can consume the entire context window — pushing out the system instructions and early task context the model needs to reason well.
Critical in AI: OpenAI recommends fewer than 20 tools per agent, with accuracy degrading past 10. Connect a few MCP servers and you might reach 90+ tool definitions — over 50,000 tokens of schemas before the model starts reasoning. This is the tool management crisis that Chapter 7 addresses.
Multi-Agent Amplification
In multi-agent systems, each agent has its own context window. Information must be selectively routed between agents — a billing agent doesn’t need the onboarding knowledge base, and a code review agent doesn’t need the deployment logs. Without routing, every agent carries every piece of context, multiplying waste.
What’s Next
The remaining chapters of this course address each dimension of this challenge: progressive disclosure (Ch 3) controls what loads and when, compression (Ch 4) shrinks accumulated history, routing (Ch 5) directs queries to the right source, retrieval (Ch 6) fetches external knowledge on demand, tool management (Ch 7) controls the capability surface, and token budgeting (Ch 8) ties it all together economically.