Should You Build a Loop? The Four-Condition Test, Cost Math & Security Tax

Source: ai-research/plutos-eth-loops-vs-fable5-gap-2026-06-14.md — Author: plutos (@plutos_eth) · URL: https://x.com/plutos_eth/status/2064470776611504188 (X Article: “I Stopped Prompting Claude and Started Building Loops. Here’s the Gap Fable 5 Opened.”) · Posted: 2026-06-09 · 7.7K views. Mythos capabilities + costs are attributed to Anthropic’s Frontier Red Team and VentureBeat; token-cost estimates are the author’s standalone assertions; failure modes are credited to Geoffrey Huntley and Addy Osmani. 2026-07-03 research-agenda drain added: ai-research/augmentcode-agent-loop-token-cost-2026-07-03.md (mathematical cost model + measured benchmarks), ai-research/finout-claude-code-pricing-2026-07-03.md (named spike patterns + real-dollar incidents), ai-research/morphllm-ai-coding-costs-2026-07-03.md (per-plan usage-tier breakdown).

The six building blocks of a loop are by now well-covered (Osmani’s essay is the origin; the Cobus reference is the catalog). This article captures @plutos_eth’s distinct contribution: the decision and risk layer — whether you should build a loop at all, what it actually costs, how it fails silently, and the security tax nobody budgets for. Its blunt headline: “most developers don’t need a loop yet,” and a loop pointed at the wrong task “costs more than it returns, forever.”

Key Takeaways

The four-condition test — miss one box, keep it a manual prompt: (1) the task repeats at least weekly (a loop amortizes its setup across runs); (2) verification is automated (a test/type-check/linter/build that can fail the work without you in the room); (3) your token budget can absorb the waste (“obvious to people with effectively free tokens, reckless to people on a $20 consumer plan — both groups are right”); (4) the agent has a senior engineer’s tools (logs, repro env, the ability to run what it writes).
The economics are unforgiving. A single-agent loop on a medium task burns 50k–200k tokens; a fleet with an orchestrator + 3 specialists 500k–2M; a daily-scheduled loop millions a week. “Loops re-read context, retry, and explore — they spend whether or not the run ships anything.”^[author’s standalone estimates]
The only metric that matters: cost per accepted change — not tokens spent, not tasks attempted. “If fewer than half the loop’s outputs survive your review unchanged, you’re doing the review work the loop was supposed to remove, and the loop is losing.”
Four silent failure modes, all fixed by an objective gate: the Ralph Wiggum loop (Geoffrey Huntley — emits “done” early, exits on half-done work, keeps spending), self-preferential bias (maker grades its own homework), agentic laziness (“done enough” at partial completion), and goal drift (constraints evaporate by turn 47 as summarization loses them). “Not a verifier with an opinion” — a pass/fail test, a build, a zero/non-zero linter. Goal drift’s fix is a standing VISION.md re-read every run.
The security tax sharpens as the loop speeds up. Unattended loops merge insecure code on autopilot; community skills are injection vectors; long-running loops scatter secrets into debug logs; permission scope creep — scopes added “temporarily” and never re-audited (see the checklist below) — is the quiet killer.
The skill gap of 2026: a prompt engineer writes better instructions and is the feedback loop; a loop engineer writes the VISION.md, the gate, and the stop condition, then walks away and trusts the verifier. “The tools are identical. The mindset isn’t.”

The Security Tax — 30-Day Loop Checklist

The article’s most reusable artifact (re-run every 30 days):

Gate includes SAST + dependency audit + secret scanning
No skill auto-install — read the source before adding one
Verbose logging OFF in production loops; sanitize what’s logged
Permissions re-audited; remove every scope added “temporarily”
Human approval gate on merge / deploy / dependency change

The skill-injection point corroborates the wiki’s skill-security thread: “one audit found credentials leaking in hundreds of public skills out of seventeen thousand” — the same risk surface SkillSpector scans for. A loop that auto-installs community skills inherits every prompt injection in their descriptions.

The Mythos Framing (and a wiki clarification)

The article’s hook: Anthropic’s unreleased Mythos Preview red-team model autonomously found a 27-year-old OpenBSD DoS bug, a 16-year-old FFmpeg flaw (on a line fuzzers had hit 5M times), and a Linux-kernel root chain — and was deemed too dangerous to release (Project Glasswing restricted access). Per VentureBeat, the discovery campaign cost ~ $20, 000 * * w hi l e t h es p ec i f i c w innin g r u n cos t * * u n d er$ 50 — “the expensive part wasn’t intelligence, it was the system around the model — the loop.” That cost asymmetry is the article’s argument for why loop engineering, not model access, is the leverage you can actually pick up this month.

Fable 5 ≠ Mythos Preview

The title says “Fable 5” but the body describes the unreleased Mythos Preview red-team model. The wiki’s record: Fable 5 (Mythos-class) did ship 2026-06-09, while Mythos Preview is the separate internal frontier-reference model. The cyber findings + costs map to the Mythos cyber-capabilities story, not to the shipped Fable 5. Treat the article’s “best model locked in a vault” as the red-team model, not the product.^[wiki clarification — the source conflates the two]

Real Token-Cost Benchmarks, Beyond One Author’s Estimates (research-agenda drain, 2026-07-03)

The research agenda asked for empirical token-cost profiles of the three starter loops named in Write Loops, Not Prompts (issue-backlog, front-end-verification, code-review/babysit) on a Claude Max or Codex CLI plan specifically, and where the no-progress blowups actually occur. No public source benchmarks those three named loops individually — this remains genuinely unmeasured, and this article’s original token-cost ranges (50k-200k / 500k-2M / millions-per-week) stay flagged as the author’s estimates, not measured data. But general loop-cost benchmarking has caught up since this article was written, and it materially answers the “where do blowups occur” half of the question:

The cost mechanism is quadratic, not linear, and it’s now formalized. A naive N-step loop rebills the entire accumulated conversation history on every turn, so total input tokens grow as N×S + u×N(N+1)/2 rather than scaling linearly with N. A worked 10-iteration file-reading loop on Claude Sonnet 4.6 hit 43.3x the cost of a single pass (472,500 vs 9,000 input tokens) purely from re-billing prior context — before any retries or wasted exploration. This is the load-bearing reason a loop can blow past a budget even when every individual step looks cheap.
Blowups concentrate in three named places: (1) context resubmission — a long session with several retries can burn 50,000-300,000 tokens on a single prompt, reportedly the root cause behind most “my Max plan evaporated in under two hours” reports; (2) tool-output bloat — one SWE-bench measurement found 30,400 of 48,400 total tokens in a run came from tool results alone, and 40-60% of that was removable waste; (3) uncapped subagent fan-out — the single largest real-world spike pattern: one operator’s 49-subagent, 2.5-hour run was estimated at $8, 000 -$ 15,000; a financial-services team left 23 subagents running unattended for three days and reported a $47,000 bill.
Multi-agent loops cost more than single-agent loops by a known, wide multiplier — reported at roughly 4x a chat interaction for a single agent and ~15x for a multi-agent system, with an unoptimized verified multi-agent run measured at 850K tokens versus 100K for a single agent (8.5x). Coordinator-specialist architectures that isolate each subagent’s context measurably claw back ~54% of that overhead.
Real per-plan usage bands exist, even without a per-loop breakdown: published June 2026 figures put light usage (2-3 tasks/day) at ~ $36/ m o A P I - e q u i v a l e n t (co v er e d b y P r o^{'} s$ 20), a full workday at ~ $178/ m o (co v er e d b y M a x 5 x^{'} s$ 100), and continuous full-day agent use at ~ $594/ m o (co v er e d b y M a x 20 x^{'} s$ 200) — with Anthropic’s own reported enterprise average at ~$13/developer/active day.

What this resolves and what it doesn’t: the general shape of loop-cost risk — quadratic context growth, where blowups concentrate, and the order of magnitude real operators hit — is now well-evidenced rather than one author’s anecdote. The specific ask (issue-backlog vs front-end-verification vs code-review, broken out separately, on Claude Max or Codex CLI) has no published source anywhere and stays open below.

Try It

Run the four-condition test before building anything. If the task doesn’t recur weekly, has no automated gate, would blow your token budget, or the agent lacks real tooling — keep it a prompt. This is the cheapest decision in the whole topic.
Instrument cost-per-accepted-change from run one. It’s the single number that tells you whether the loop is winning; tokens-spent and tasks-attempted are vanity metrics.
Build in order, not all at once: one manual run that works → a skill → an automation → a state file → a gate → then schedule it. “Skip ahead and you’re paying for a swarm before you have a single run that works.”
Put the security checklist on a 30-day calendar reminder the moment a loop goes unattended.

Loop Engineering — Addy Osmani’s Essay — the primitives this article assumes and openly builds on (comprehension debt, cognitive surrender, maker/checker are Osmani’s).
Loop Engineering — Cobus Greyling’s Reference — the token-economics tooling (loop-cost), the L0→L3 ladder, and the severity-rated failure catalog this article’s four failure modes slot into.
Verifier-First Loops — the objective-gate discipline that fixes every failure mode named here.
Write Loops, Not Prompts — the topic entry point; its three cost controls are the beginner version of the four-condition test.
SkillSpector — the skill-security scanner for the credential-leak / prompt-injection risk this article flags.
Epoch AI — Mythos Cyber Capabilities and Mythos Preview — the actual record behind the OpenBSD/FFmpeg/kernel framing.
The Verification Frontier — why the “automated gate” condition is the load-bearing one.

Open Questions

The token-cost ranges (50k–200k / 500k–2M / millions-per-week) are the author’s experience-based estimates, not measured benchmarks — treat as directional. Update 2026-07-03: general loop-cost benchmarks now exist (see “Real Token-Cost Benchmarks” above) and corroborate the order of magnitude here, but don’t validate these specific numbers.
7.7K views, author not an established authority; the cited claims (Mythos costs/finds via VentureBeat + Red Team) are verifiable, the prescriptive economics are opinion. Weighted accordingly (confidence: medium).
Still open (researched 2026-07-03): empirical token-cost profiles for the three specific starter loops named in Write Loops, Not Prompts (issue-backlog, front-end-verification, code-review/babysit) on a Claude Max or Codex CLI plan specifically. No public source breaks these three loop types out individually — every available benchmark (see above) measures generic loop shapes (N-step file-reading agents, multi-agent fan-outs) rather than these named task types. Re-check on a future drain cycle if a practitioner publishes a like-for-like comparison.

Jonathon's AI Wiki

Explorer

Should You Build a Loop? The Four-Condition Test, Cost Math & Security Tax

Key Takeaways

The Security Tax — 30-Day Loop Checklist

The Mythos Framing (and a wiki clarification)

Real Token-Cost Benchmarks, Beyond One Author’s Estimates (research-agenda drain, 2026-07-03)

Try It

Open Questions

Graph View

Table of Contents

Backlinks

Jonathon's AI Wiki

Explorer

Should You Build a Loop? The Four-Condition Test, Cost Math & Security Tax

Key Takeaways

The Security Tax — 30-Day Loop Checklist

The Mythos Framing (and a wiki clarification)

Real Token-Cost Benchmarks, Beyond One Author’s Estimates (research-agenda drain, 2026-07-03)

Try It

Related

Open Questions

Graph View

Table of Contents

Backlinks