Ch 5: Multi-Agent Planning & Negotiation

Ch 5 — Multi-Agent Planning & Negotiation

Decomposition, private info, auctions, plan merging, and adaptive replanning

Index

Foundations

flag

Goal

arrow_forward

call_split

Decompose

arrow_forward

visibility

Observe

arrow_forward

swap_horiz

Negotiate

arrow_forward

gavel

Agree

arrow_forward

task_alt

Execute

Click play or press Space to begin...

Step- / 8

route

Why Multi-Agent Planning Is Different

Interaction, privacy, and joint action spaces

The Challenge

Single-agent planning searches over one action space. Multi-agent planning must handle interleaved actions, shared resources, and agents whose private information limits what can be centrally optimized. The joint action space grows exponentially with the number of agents. Practical systems cope by decomposing problems, communicating partial plans, and negotiating where conflicts arise. Without structure, planning degenerates into brute-force enumeration or unconstrained chat that never converges.

Pattern

Joint space = A1 × A2 × … An Decompose to keep tractable // Negotiate only at interfaces

Key insight: Decompose first, negotiate at boundaries — never plan everything jointly.

account_tree

Plan Decomposition & Partial-Order Plans

Breaking goals into assignable pieces

Technique

Hierarchical task networks (HTN) and partial-order planning let you break a goal into subtasks with ordering constraints but flexibility in scheduling. Each agent receives a subplan with preconditions and effects that must mesh with neighbors. In LLM stacks, the orchestrator can produce a DAG of subtasks (JSON), assign owners, and track status. Partial ordering allows parallelism where no dependency exists.

Pattern

HTN: goal → methods → subtasks POP: ordering constraints only // Parallel where no edge

Key insight: Expose the DAG to all agents so each knows what it blocks and what blocks it.

visibility

Incomplete Information & Private Goals

What agents don’t share

Reality

In many settings, agents have private beliefs or goals they cannot or should not reveal (competitive bidding, user privacy, proprietary data). Planning under incomplete information requires either communication protocols that elicit just enough to coordinate, or robust plans that work across plausible private states. In LLM systems, this maps to scoped context: each agent sees only what policy allows, and the orchestrator merges sanitized summaries.

Pattern

Agent A: knows X, hides Y Agent B: knows Y, hides X Merge: sanitized summaries // Least-privilege context

Key insight: Design for least-privilege context — agents should see only what they need to act.

swap_horiz

Negotiation Basics

Proposals, counter-proposals, and concession

Framework

Negotiation lets agents resolve conflicts over resources, schedules, or design choices through iterative proposals. A simple protocol: Agent A proposes, Agent B accepts or counter-proposes, repeat until agreement or deadline. Key parameters: reservation value (walk-away point), concession rate, and deadline pressure. For LLM agents, bound the number of rounds and require structured proposals (JSON with changed fields highlighted) so progress is measurable.

Pattern

propose → accept | counter max_rounds, deadline // Log concession trajectory

Key insight: Without a deadline or max rounds, LLM negotiation loops can burn tokens indefinitely.

gavel

Auctions & Market Mechanisms

Allocating scarce resources efficiently

Mechanisms

Auctions are structured negotiation: agents bid, a rule picks winners and prices. First-price sealed-bid, Vickrey (second-price), and combinatorial auctions each have different incentive properties. Vickrey encourages truthful bidding (dominant strategy). For multi-agent task allocation, auction rounds can assign jobs to the agent with the lowest cost or highest capability score. Implement as a microservice the orchestrator calls, not as free-form chat.

Pattern

Vickrey: bid true value Winner: highest bid Price: second-highest bid // Truthful in theory

Key insight: Use Vickrey-style rules when you want agents to reveal true capabilities without gaming.

merge

Plan Merging & Conflict Resolution

Combining partial plans safely

Process

After agents plan locally, a merge step checks for resource conflicts (two agents booking the same API slot), causal threats (one agent undoing another’s precondition), and ordering violations. Resolution strategies: priority rules, temporal separation, resource duplication, or re-planning the conflicting subgraph. Automate conflict detection with a constraint checker over the merged DAG before execution begins.

Pattern

merge(planA, planB) detect: resource & causal conflicts resolve: priority | replan // Never skip the check

Key insight: Merge-then-check is cheaper than joint planning — but you must actually check.

replay

Replanning & Adaptive Execution

When the world invalidates the plan

Resilience

Plans fail: tools error, APIs change, teammates crash. Replanning triggers when a monitor detects deviation beyond a threshold. Options: local patch (swap one step), subgraph replan (redo the failing branch), or full replan (expensive, last resort). In LLM multi-agent systems, keep a plan version counter and broadcast invalidation events so all agents know the current truth. Stale plans cause cascading failures.

Pattern

monitor → deviation? patch | replan_branch | replan_all // Broadcast version bump

Key insight: Broadcast plan version changes — agents acting on stale plans are worse than agents with no plan.

checklist

Chapter Summary

From decomposition to adaptive execution

Takeaways

Multi-agent planning decomposes goals, assigns via negotiation or auction, merges partial plans with conflict checks, and replans when reality drifts. Next chapter: emergence, game theory, and incentives — what happens when agents have their own utility functions and the system’s behavior is more than the sum of its parts.

Pattern

Decompose → Assign → Merge → Execute Monitor → Replan // Ch 6: game theory & emergence

Key insight: Good multi-agent planning is iterative: plan, check, negotiate, merge, execute, monitor, repeat.

arrow_back Ch 4: Coordination & Cooperation Ch 6: Emergence, Game Theory & Incentives arrow_forward