Loop Engineering: build the loop that prompts your agents

Awesome Loop Engineering

Learn loop engineering: the practice of designing systems that prompt, check,
remember, and re-run AI coding agents for you, instead of prompting them by hand, turn after turn.

Website · Learn · Prompts · Build your own loop · Cheat sheet

This is a teaching repo, not a link dump. It has four parts:

Learn — in-depth, plain-language chapters that explain loop engineering from first principles.
Read — a large, annotated reading list (~100 sources): the essays that named the field, the papers it stands on, the evals/observability canon, the talks, and the skeptics.
Practice — a library of real, copy-paste loop prompts (Ralph loops, /goal and /loop commands, Codex automations, maker/checker sub-agents, verifier rubrics, and full loop contracts).
Build — a portable agent skill that walks you through designing and scaffolding a loop for your own workflow, plus a small runnable example in plain Python.

Fast paths:

🌐 Website (the one-page tour)
🛑 When not to build a loop (the decision rule + readiness scorecard)
🧩 Build a loop for your workflow (the skill)
📄 Cheat Sheet (the whole field on one page)
📊 Case studies (real loops, real numbers)
❓ FAQ (is this just cron? is prompting dead?)
🛠 Hands-on: Claude Code / Codex / Cursor / OpenCode
🧠 Field Notes (13 hard-won lessons)

What is loop engineering, in one paragraph

For two years, getting work out of a coding agent meant writing a prompt, reading the reply, and typing the next prompt — you held the tool the whole time. Loop engineering replaces you as the thing that prompts the agent. You build a small system that discovers work, hands it to an agent, checks the result with a separate verifier, writes down what's done, decides what's next, and re-runs on a schedule. As Addy Osmani put it in the essay that named the field (June 2026): "Build the loop. But build it like someone who intends to stay the engineer, not just the person who presses go."

The two quotes that kicked it off:

"You shouldn't be prompting coding agents anymore. You should be designing loops that prompt your agents." — Peter Steinberger

"I don't prompt Claude anymore. I have loops running. They're the ones prompting Claude and figuring out what to do. My job is to write loops." — Boris Cherny, creator of Claude Code

Who this is for

You use a coding agent daily (Claude Code, Codex, Cursor) and you're tired of being its full-time hand-cranker.
You keep re-typing the same prompts ("fix the failing tests", "triage the new issues") and suspect a machine should be doing the typing.
You want the leverage without losing the plot — to automate the inner loop while staying the engineer who owns verification, comprehension, and intent.

No prior "agent" theory required. If you can write a shell while loop and read a diff, you can start.

Where it sits: the lineage

Loop engineering is the newest floor in a four-story stack. Each layer makes the one below it more useful:

Layer	The question it answers
Prompt engineering	What should I say to the model?
Context engineering (Karpathy)	What state and knowledge should the model see?
Harness engineering	What tools, permissions, tests, and sandboxes should surround one agent run?
Loop engineering	What recurring system should discover, delegate, verify, persist, and re-run — when the human is no longer in the inner loop?

Prompt, context, and harness make one run better. The loop makes recurring work reliable, observable, and governable. → Full explanation in docs/02 — The Lineage.

The shape of a loop

Objective
  │
  ▼
Trigger / cadence            ← an automation fires the loop (the "heartbeat")
  ▼
Discover / intake work       ← read CI failures, issues, alerts, a task list
  ▼
Delegate to an agent         ← the "maker" proposes a change
  ▼
Act in an isolated workspace ← a git worktree, so parallel agents don't collide
  ▼
Verify with a SEPARATE checker  ← tests, lint, an independent reviewer agent (the "/goal")
  ├── fail → feed the evidence back and retry
  └── pass → persist state, then decide what's next
  ▼
Repeat — or stop on: goal met · budget spent · stalled · needs-a-human

A loop is not a cron job. A cron job repeats blindly; a loop discovers, verifies, persists, and decides what's next — and stops for an honest reason. → docs/03 — Anatomy of a Loop.

Start here (a reading path)

In a hurry? The Cheat Sheet is the whole field on one page, and the FAQ clears up the common confusions. Otherwise, read in order:

docs/01 — What Is Loop Engineering — the idea, plainly.
docs/02 — The Lineage — prompt → context → harness → loop.
docs/03 — Anatomy of a Loop — the lifecycle and the "loop contract."
docs/04 — The Five Building Blocks — automations, worktrees, skills, connectors, sub-agents (+ memory).
Build your own: run the loop-engineering skill, or copy a prompt from prompts/ into Claude Code, Codex, Cursor, or OpenCode, or follow the hands-on docs/10 guide.
Go deeper: verification, memory, stacking loops.
Stay honest: token economics and the risks.
Steal the lessons: Field Notes — 13 cross-cutting things that took everyone else a while to learn.

The docs (learn it in depth)

#	Chapter	What you'll learn
01	What Is Loop Engineering	The core idea and why it appeared in 2026
02	The Lineage	Prompt → Context → Harness → Loop
03	Anatomy of a Loop	The lifecycle + the 11-part loop contract
04	The Five Building Blocks	Automations · Worktrees · Skills · Connectors · Sub-agents · Memory
05	Stop Conditions & Verification	`/loop` vs `/goal`, maker/checker, LLM-as-judge
06	Memory & State	Ralph, session-as-event-log, brain/hands/session
07	Stacking Loops	Loopcraft, the 4-loop stack, going UP vs DOWN
08	Token Economics	O(N²) cost, the 4×/15× figures, budgets, the orchestration tax
09	Risks & the Debt Trilogy	Comprehension, cognitive, and intent debt; cognitive surrender
10	Building Loops in Claude Code, Codex, Cursor & OpenCode	Hands-on: `claude -p`, `codex exec`, `cursor-agent`, `opencode run`, the brakes per tool
11	Case Studies	Real loops with real numbers — and the loops that bit back
12	Security & Failure Modes	Reward hacking, the lethal trifecta, doom loops, the threat catalog
13	Evaluating the Loop	Evals as the regression suite, trajectory vs outcome, proving the verifier
00	When Not to Build a Loop	The decision rule: when a loop is the wrong tool
—	Glossary	Every term in one place
—	Field Notes	Hard-won synthesis: 13 cross-cutting lessons
—	Cheat Sheet	The whole field distilled to one page
—	FAQ	The questions everyone asks first

Practice (copy-paste loop prompts)

The prompts/ library collects real prompts you can adapt today:

Ralph loops — the while :; do cat PROMPT.md \| agent; done technique and its refined descendants.
/goal & /loop — Claude Code stop-condition and cadence prompts.
Automations — OpenAI Codex scheduled-automation prompts (triage, CI repair, PR babysitting).
Sub-agents (maker/checker) — explorer, implementer, and independent-verifier skills.
Verification rubrics — LLM-as-judge graders that return PASS/FAIL with evidence.
Loop contracts — full specs (PR babysitter, CI repair, incident response, dependency triage, PR review, doc-drift repair, flaky-test hunt).

Build a loop for your own workflow (the skill)

The loop-engineering skill is a portable agent skill you can drop into Claude Code, Codex, Cursor, or OpenCode. Point your agent at it and it will interview you about the work you keep doing by hand, fill in an 11-part loop contract, choose honest stop conditions and budgets, and scaffold the files so you finish with a loop you can actually run — and trust.

Install it with one command — it asks which agent you use and where to put it:

npx github:invincible04/awesome-loop-engineering

Or copy the folder yourself (a skill is just a folder with a SKILL.md):

cp -r skill/loop-engineering .claude/skills/loop-engineering   # Claude Code
cp -r skill/loop-engineering .opencode/skills/loop-engineering  # OpenCode

Then ask your agent: "Use the loop-engineering skill to help me design a loop for <the thing you keep doing by hand>." See skill/loop-engineering/README.md for every install path and tool.

A runnable example

examples/runnable/ is a dependency-free Python implementation of the loop machinery — engine, budget, memory, a separate verifier, and the four honest stop conditions — with a test suite. Read it to see the ideas execute, not just described.

cd examples/runnable
python -m loop_cookbook                    # watch a maker/checker loop converge and stop
python -m unittest discover -s tests -v    # 10 tests, standard library only

Contributing

New chapters, sources, and (especially) real attributed prompts are welcome — see CONTRIBUTING.md. Released under the MIT License: use anything here freely.

Spread the loop

If this helped you stop hand-cranking your agents, star the repo and pass it on. The best way to share it is the one-page website — it teaches the whole idea in a single scroll.

A note on dates: loop engineering is young (the term took off in June 2026). Perishable claims — token multipliers, tool features, model behavior — carry an "as of" note, because they expire. What changed between revisions is tracked in the CHANGELOG. Corrections via PR are very welcome.