precision_manufacturing

Harness Engineering

The discipline of designing systems that make AI coding agents reliable — constraints, feedback loops, documentation, linting, review pipelines, and orchestration at scale

Co-Created by Kiran Shirol and Claude

Topics CLAUDE.md Constraints Review Pipelines Orchestration Entropy Management Agent Governance

home Learning Portal play_arrow Start Learning summarize Key Insights dictionary Glossary 8 chapters · 3 sections

Section 1

Foundations — Why the Harness Matters

What harness engineering is, why the model is commodity, and the evidence that the harness determines agent performance.

pets

What Is Harness Engineering?

The horse-tack metaphor. Formal definition: constrain, inform, verify, correct. Coined by Elvis Saravia in early 2026. Why the model is commodity but the harness is competitive advantage.

arrow_forward Learn

compare

The Model vs. The Harness

LangChain’s jump from 52.8% to 66.5% on Terminal Bench 2.0 by changing only the harness. Nate B Jones’s 78% vs 42% finding. AutoHarness from Google DeepMind (ICLR 2026).

arrow_forward Learn

description

Constraint Documents & CLAUDE.md

CLAUDE.md, AGENTS.md, .cursorrules files. The 3-tier architecture (root rules, skills, agent guides). Writing effective constraint documents that agents actually follow.

arrow_forward Learn

Section 2

Core Mechanics — Constraints, Review & Memory

Architectural enforcement, multi-agent review pipelines, feedback loops, and entropy management.

architecture

Architectural Constraints & Linting

Dependency layering enforced by CI. Deterministic linters and LLM-based auditors. “Vibecoded lints” targeting agent-specific mistakes. Pre-commit hooks and structural tests.

arrow_forward Learn

rate_review

Review Pipelines & Feedback Loops

Multi-agent review workflows. Self-verification loops and pre-completion checklists. Loop detection to prevent “doom loops.” The reasoning sandwich pattern.

arrow_forward Learn

memory

Memory & Entropy Management

Episodic memory for few-shot learning. Documentation consistency agents. Constraint violation scanners. Preventing codebase entropy drift over time.

arrow_forward Learn

Section 3

Scale & Future — Orchestration, Risks & Governance

Production orchestration systems, the four maturity levels, agent sprawl, and the future of the discipline.

account_tree

Orchestration at Scale

OpenAI Symphony (Elixir-based, dispatches Codex agents). Stripe Minions (1,000+ merged PRs/week). Basis ($200M revenue, zero human-written code). The four maturity levels.

arrow_forward Learn

shield

Risks, Governance & The Future

Agent sprawl (73% unmonitored “shadow agents”). Vendor lock-in at the workflow level. Security surface area. Building rippable harnesses. The future of the discipline.

arrow_forward Learn

Harness Engineering

Foundations — Why the Harness Matters

Core Mechanics — Constraints, Review & Memory

Scale & Future — Orchestration, Risks & Governance

Related Courses