The Loop Is the Unit of Work — Convergent Loop Engineering Across Claude Code and Agentic Frameworks

Source: wiki synthesis: Loop Engineering (Cobus Greyling), Loop Engineering (Addy Osmani’s Essay), Reflecting on a Year of Claude Code (Cherny & Wu), Dynamic Workflows, 12-Factor Agents, Clawd Chief, The Verification Frontier

Across 2026 the wiki kept documenting the same shift from two unconnected directions. From inside Anthropic, Boris Cherny describes a career that went source code → agent → loop: “I don’t talk to an agent anymore. I talk to a loop… and it prompts Claude for me.” From the tool-agnostic OSS world, Cobus Greyling and Addy Osmani name the identical move — “replacing yourself as the person who prompts the agent; design the system that does it instead” — and ship it as a pattern catalog with primitives, a readiness ladder, and a failure-mode index. This article connects those threads: the loop has become the unit of agentic work, the leverage point migrated prompt → harness → loop, and the single primitive that turns a loop from reckless to shippable is the maker/checker verifier — which is the verification frontier thesis applied one level down.

Key Takeaways

One pattern, two vocabularies. Claude-Code-native practice (/loop, /goal, Routines, Dynamic Workflows) and tool-agnostic frameworks (loop-engineering, 12-Factor Agents, Clawd Chief’s “agents are cron jobs and markdown files”) are describing the same control system. loop-engineering is the explicit naming/generalization of what Anthropic’s team already does — its own sources.md cites Cherny and Osmani as the seed.
The leverage ladder is prompt → harness → loop. Prompt engineering tunes one turn; harness engineering sets up one session; loop engineering schedules + verifies + persists across many sessions. Each rung is a strictly larger unit of leverage.
There is a shared loop anatomy. Every source assembles the same parts: a scheduler, a triage skill, durable state, isolated worktrees, a maker/checker split, connectors, and a human gate. The vocabulary differs; the wiring doesn’t.
Verification is the gate that unlocks autonomy. loop-engineering’s L0→L3 readiness ladder is the same trust gradient the verification frontier describes: “no verification → no autonomy.” Boris earned trust in auto-mode the same way loop-engineering demands a verifier — by turning red-team attacks into evals.
The failure modes are now catalogued. What used to be folklore (infinite fix loops, verifier theater, token burn, notification fatigue, cognitive surrender) is a documented S1/S2/S3 catalog you can design against — the operational complement to the verification thesis.

The leverage ladder: prompt → harness → loop

The connecting spine is a migration of what an engineer interacts with, and both clusters draw the same three rungs:

Rung	Unit	Claude-Code framing	Tool-agnostic framing
Prompt	one turn	the message you type	(the thing loop engineering replaces)
Harness	one session	minimal system prompt + tools; “be a context minimalist”	Osmani’s agent harness engineering — the single-session sandbox
Loop	many sessions over time	”talk to a loop / routine, it prompts Claude” (Cherny’s 2nd leap)	loop-engineering: schedule + state + verification chain

Cherny frames it as two leaps — source code → agent, then agent → loop. Osmani’s vocabulary makes the harness/loop boundary explicit: Harness = single session setup; Loop = harness + schedule + state + verification. Same ladder, two namings. Karpathy’s vibe-coding → agentic-engineering is the third independent statement of the same progression.

Two namings of one pattern

The payoff of connecting these articles is seeing that the Claude-native and tool-agnostic vocabularies are a one-to-one map:

The move	Claude-Code-native (claude-ai)	Tool-agnostic (agents-agentic-systems)
Self-prompting heartbeat	scheduled tasks / [[claude-ai/claude-code-routines	Routines]] / `/loop`
Encode intent once, read every run	”write the fix to `CLAUDE.md` or a skill” so Claude “runs forever”	Skills primitive (pays down intent debt)
Safe parallelism	desktop app auto work-tree cloning	Worktrees (`isolation: worktree`)
Don’t grade your own homework	auto-mode classifier; “can the agent run the thing?”	Sub-agents maker/checker split; verifier on a stronger model
Durable spine outside the chat	repo state / issue trackers	Memory / State primitive (`STATE.md`)
Reach real tools	MCP connectors	Plugins & Connectors (MCP) primitive
Orchestrate many at once	[[claude-ai/dynamic-workflows-claude-code	Dynamic Workflows]] (hundreds–thousands of agents)

Cherny’s “agents are cron jobs and markdown files” twin — Ryan Carson’s Clawd Chief — sits exactly in the middle of this table: two load-bearing markdown files (priority-map, auto-resolver) + a 15-minute schedule is a loop with the scheduling, skills, and state primitives and a deferred verifier.

Where `/loop`, `/goal`, Routines, and Dynamic Workflows Precisely Differ (research-agenda drain, 2026-07-03)

This article’s own Open Questions section named this gap: the first-party sources use “loop” and “routine” loosely, and the crisp boundaries live scattered across scheduled tasks, Routines, goal` walkthrough, and Dynamic Workflows rather than in one place. Here they are side by side:

Primitive	Unit of work	Durability	Trigger	Cost profile
`/loop` (scheduled tasks)	Re-runs the same prompt/command on an interval, inside one session	Session-scoped — dies with the session; restored on `--resume` only if unexpired; recurring tasks auto-expire 7 days after creation	Fixed interval, Claude-paced interval, or the built-in maintenance prompt	Cheap per iteration at sane intervals (5-30 min); cost is roughly linear in iteration count
`/goal`	One completion-condition contract; the session keeps working turn-after-turn until a separate, fast checker model confirms the condition holds	Session-scoped, same lifetime as `/loop` — but goal-convergent rather than schedule-driven; stops when done, not when time’s up	Manual invocation with a natural-language condition (up to 4,000 characters)	Scales with convergence time — “might run 10 minutes, might run 10 hours”; each turn re-bills accumulated context, so cost grows with turns the way [[agent-loops/loop-economics-and-security
Routines	One recurring or event-triggered job, cloud-hosted, stateless per run	Durable across restarts and a closed laptop; each run clones fresh (no memory carries between runs — persist via Connectors or files); the saved Routine itself does not expire	Cron schedule (1-hour minimum), API POST with a bearer token, or a GitHub webhook	Bounded by per-plan daily caps (Pro 5/day, Max 15/day, Team/Enterprise 25/day) — the only one of the four with a hard, predictable ceiling
Dynamic Workflows	Many subtasks fanned out across tens-to-hundreds of parallel subagents within one session, cross-checked adversarially until findings converge	Resumable by design — checkpointed continuously; an interrupted run picks up rather than restarts; can extend hours to days	The `ultracode` trigger word, or an explicit “create a workflow” ask	”Meaningfully more usage” than a normal session — a confirm-before-first-run gate exists; one operator’s unofficial measurement put hard caps at 16 concurrent agents and 1,000 agents total per run

The axis that actually separates them: /loop and /goal are session primitives — they need a live, open session and (mostly) die with it. Routines are a cloud primitive — no session, no open laptop, durable, but stateless per run and capped by daily quota rather than by time or tokens directly. Dynamic Workflows is not a scheduling primitive at all — it’s an in-session fan-out mechanism: many parallel subagents attacking one task inside a single run, explicitly designed to compose with /goal and /loop rather than replace them (Anthropic’s own guidance: “combine with /goal and /loop for autonomous, repeat-until-done behavior”). Boris Cherny’s autonomous-Opus playbook stacks all of this rather than picking one: auto mode, dynamic workflows, /goal or /loop, cloud-hosted sessions, and a self-verification harness, together.

One operator’s mental shorthand — “/goal is depth (re-runs until done, can run 24h+), a Workflow is width (N agents fan out once, results synthesize once, no per-iteration convergence check)” — is a useful simplification but not the official line: Anthropic’s announcement explicitly frames Dynamic Workflows as self-checking (agents refute each other’s findings until they converge), so treat “width, no loop” as an approximation, not a contradiction of the depth/width split.

Reach for which, by the question you’re actually asking:

If you’re asking…	Reach for
”Keep this session working until X is true”	`/goal`
”Poll something at an interval while I’m in this session”	`/loop`
”Run this every night at 2am, no session, no laptop needed”	Routines
”This one task is too big for one context window — fan it out”	Dynamic Workflows

Verification is what makes a loop shippable

This is where the connection closes back onto the verification frontier. That article’s law — AI loops compound where verification is cheap and stall where it’s expensive — is the theory; loop engineering is the operating manual for the cheap-verification case:

The readiness ladder is a verification gradient. L1 report-only needs no verifier (it doesn’t act). L2 assisted requires a separate verifier + worktree + max-attempts. L3 unattended requires proven verification and cost observability. You climb the ladder exactly as fast as your verification surface grows — Jeremy’s “no verification → no autonomy” stated as a checklist.
Maker/checker is cheap verification at the loop level. loop-engineering’s “the implementer must never grade its own homework” is the same adversarial-verification pattern Dynamic Workflows bakes into its orchestrator (a separate agent checks each finding) and that AutoAgent reduces to a reward file. “Verifier Theater” (verifier approves, CI fails) is the named failure when this verification is fake.
Anthropic earned auto-mode trust the loop-engineering way. Cherny’s team “collected thousands of agent trajectories, brought in red-teamers, turned attacks into evals, iterated until all were denied.” That is building the verifier before granting autonomy — the same discipline loop-engineering’s checklist enforces, at frontier scale.

The corollary: as generation gets cheap, the loop’s bottleneck moves to review (Amdahl’s law), which is why loop-engineering treats comprehension debt and cognitive surrender as first-class S2 failures — the human-judgment frontier the verification thesis says stays expensive to verify.

One branch of the agentic-systems cluster runs the loop on the harness itself: Browserbase Autobrowse, Reflexio, and AutoAgent all hill-climb on a cheap criterion, then graduate a successful run into a reusable SKILL.md/playbook. That’s the Skills primitive being written by the loop instead of by you — the same converge-then-graduate ratchet Karpathy’s AutoResearch walkthrough describes, and the mechanism by which a loop pays down its own intent debt over time.

What combining them enables

A shared checklist for any loop, on any tool. loop-engineering’s design checklist + failure catalog are tool-agnostic, so they apply equally to a Claude Code /loop, a Codex Automation, or a GitHub Action — a portable rubric the Claude-native docs never wrote down.
A cost lens the AIOS pattern lacks. The AIOS pattern gets you context + skills + cadence; loop engineering adds the operating layer the AIOS leaves implicit — token budgets, cadence-as-multiplier math, kill switches, run logs.
A vocabulary that survives tool switches. Because the primitives map cleanly across Grok / Claude Code / Codex / Actions, a team can move runtimes without re-learning the pattern — the same portability bet 12-Factor Agents makes for LLM apps generally.

Try It

Place every automation on the readiness ladder. For each /loop, Routine, or scheduled agent you run, label it L1/L2/L3 and check it has the verification its rung requires. Anything acting at L2+ without a separate verifier is a Verifier-Theater risk — fix that first.
Run the failure catalog as a pre-mortem. Before enabling a new loop, walk the S1/S2/S3 list (infinite fix loop, token burn, notification fatigue, over-reach, escalation failure) and name the mitigation for each. This is the cheapest insurance loop engineering offers.
Invest in the verifier, not the generator. Per the verification frontier: the biggest unlock on a stuck loop is usually a cheaper verification surface — a test suite, an eval, an adversarial-review subagent — not a better prompt.
Encode the fix, not the correction. Adopt Cherny’s rule across your loops: when the agent errs, write the lesson into CLAUDE.md or a skill (the Skills primitive) so the loop stops re-making it — the difference between a loop that drifts and one that compounds.

Loop Engineering — Addy Osmani’s Essay — the canonical primary essay this synthesis names as a seed; “replace yourself as the prompter,” five primitives across Claude Code + Codex.
Loop Engineering (Cobus Greyling) — the tool-agnostic pattern catalog at the center of this synthesis: six primitives, seven patterns, readiness ladder, failure modes.
Verifier-First Loops — the maker/checker verifier discipline this article calls the shippability gate, stated as a pre-flight checklist.
Should You Build a Loop? — the cost/security operating layer (four-condition test, token math, security tax) that complements the readiness ladder.
Reflecting on a Year of Claude Code — Cherny’s “source → agent → loop” two-leaps framing and the first-party “my job is to write loops.”
The Verification Frontier — the law beneath the readiness ladder: loops compound only where verification is cheap.
Dynamic Workflows — the Claude-native orchestrate-many-agents mechanism with adversarial verification built in.
loop`) and Routines — the session-scoped vs cloud-hosted scheduling primitives this article’s comparison table pins down precisely.
goal` Walkthrough — the completion-condition primitive contrasted against /loop, Routines, and Dynamic Workflows above.
12-Factor Agents — “own your control flow / context / prompts”; the same discipline for LLM apps, portable across tools.
Clawd Chief — “agents are cron jobs and markdown files”; the loop pattern at solo-founder scale.
The 2026 Claude Code AIOS Pattern — the surrounding OS (context + skills + cadence); loops are its orchestration layer.
From Vibe Coding to Agentic Engineering — Karpathy’s independent statement of the same source → agent → loop progression.
Agent Loops (topic) — the hands-on learning path for this material; the Write Loops, Not Prompts explainer traces the ReAct → AutoGPT → Ralph Loop → /goal lineage behind the prompt → harness → loop shift.

Open Questions

Independent convergence or shared lineage? loop-engineering explicitly cites Cherny and Osmani, so the OSS naming is partly downstream of Anthropic practice rather than a fully independent discovery. Clawd Chief and 12-Factor look more parallel. The exact dependency graph is fuzzy.
~~Where do /loop, /goal, and “routine” precisely differ?~~ — resolved 2026-07-03: see “Where /loop, /goal, Routines, and Dynamic Workflows Precisely Differ” above for the four-way comparison table (unit of work, durability, trigger, cost profile).
Does the loop become the unit at team scale? Every source here is single-operator or single-repo. Whether “the team talks to loops” holds for multi-operator orgs is the same open question the AIOS pattern flags.

Jonathon's AI Wiki

Explorer

The Loop Is the Unit of Work — Convergent Loop Engineering Across Claude Code and Agentic Frameworks

Key Takeaways

The leverage ladder: prompt → harness → loop

Two namings of one pattern

Where `/loop`, `/goal`, Routines, and Dynamic Workflows Precisely Differ (research-agenda drain, 2026-07-03)

Verification is what makes a loop shippable

What combining them enables

Try It

Open Questions

Graph View

Table of Contents

Backlinks

Jonathon's AI Wiki

Explorer

The Loop Is the Unit of Work — Convergent Loop Engineering Across Claude Code and Agentic Frameworks

Key Takeaways

The leverage ladder: prompt → harness → loop

Two namings of one pattern

Where /loop, /goal, Routines, and Dynamic Workflows Precisely Differ (research-agenda drain, 2026-07-03)

Verification is what makes a loop shippable

A related sub-pattern: loops that improve themselves

What combining them enables

Try It

Related

Open Questions

Graph View

Table of Contents

Backlinks

Where `/loop`, `/goal`, Routines, and Dynamic Workflows Precisely Differ (research-agenda drain, 2026-07-03)