Source: wiki synthesis: joshpocock-vault, synthadoc, build-llm-wiki-for-business-walkthrough, karpathy-vault-additions-from-synthadoc, pipeline-diagram, arscontexta-skill-graphs, ben-five-skill-aios-setup, nate-herk-claude-code-operating-systems-course
Karpathy’s “the LLM should be able to maintain a wiki for you” gist (viral X tweet + GitHub gist follow-up) is now an entire ecosystem. Six distinct third-party implementations of the pattern have landed in this wiki across 2026-04 through 2026-05-17 — across two topic folders (karpathy-pattern for community starters/engines, claude-ai for skill/plugin packagings of the same shape). This article maps the spectrum, surfaces the design choices each implementation made, and pulls out what this vault has already adopted from each. It exists because the cluster of six is now too large to navigate ad-hoc from individual topic indexes, and because the meta-pattern (raw → schema → wiki, with CLAUDE.md as the routing layer) is converging fast across previously-disconnected creators.
Key Takeaways
Six implementations, three form-factors
| Implementation | Form factor | License / stars | Article |
|---|---|---|---|
| joshpocock/karpathy-obsidian-vault (Stride starter) | Minimal Obsidian template — ships empty, 4 ops | MIT / 9 ★ | joshpocock-vault |
| axoviq-ai/synthadoc | Python engine + Obsidian plugin — ~10 ops, SQLite audit DB | AGPL-3.0 / 231 ★ | synthadoc |
| Dream Labs AI walkthrough | 10-min YouTube tutorial — how to adopt for a business | n/a (educational) | build-llm-wiki-for-business-walkthrough |
| agenticnotetaking/arscontexta (Heinrich) | Claude Code plugin — 6-phase conversational setup, 16+ commands, 4 hooks, 249 research claims | MIT / ~3,300 ★ | arscontexta-skill-graphs |
| Ben’s 5-Skill AIOS | Claude Code skill bundle — /os-setup + /os-operator + /os-optimizer + /team-os + /os-mcp | Plugin ZIP / paid community | ben-five-skill-aios-setup |
| Nate Herk’s AIS-OS | Claude Code skill bundle (3 skills) + Three-Ms / Four-Cs frameworks | MIT / 75 ★ | nate-herk-claude-code-operating-systems-course |
The three form-factors are: template (Stride), engine (synthadoc), and plugin-bundle (arscontexta, Ben’s AIOS, Nate’s AIS-OS). The Dream Labs AI walkthrough is form-factor-agnostic — it’s a tour of how to spin up the pattern with no specific tooling.
Spectrum endpoints
- Minimalist end: Stride starter. 4 operations (Ingest, Research, Query, Lint). No frontmatter, no manifest, no contradiction handling, no provenance markers. Ships empty. “Hello world” of the pattern.
- Architectural end: synthadoc. ~10 operations. SQLite audit DB. 5-pass IngestAgent with explicit contradiction flagging + auto-resolve at ≥85% confidence.
frontmatter status: active|contradicted|archived. Query decomposition. Per-domainAGENTS.md. Background job queue. Multi-LLM (7 providers + 2 CLI subscriptions). - This vault sits between them — closer to synthadoc on operational discipline (11 ops, 17-check lint, manifest, contradiction callouts, provenance, confidence decay) but closer to Stride on form factor (Obsidian-native, no Python engine, no SQLite job queue until 2026-05-05 when
bin/build-audit-dbshipped).
The three plugin packagings converge on the same shape
Ars Contexta, Ben’s AIOS, and Nate Herk’s AIS-OS all ship the Karpathy pattern as a Claude Code skill bundle. The shape is consistent:
- Onboarding skill — conversational setup that derives folder structure +
CLAUDE.mdper subfolder. Arscontexta calls it/arscontexta:setup. Ben’s calls it/os-setup. Nate’s calls it/onboard. All three ask a small number of domain questions (2-4 / 12-section / 7-question respectively) and scaffold from the answers. - Audit/lint skill — recurring “is the brain healthy?” check. Arscontexta has
/arscontexta:health+/reflect. Ben’s has/os-optimizer(combines Anthropic’s CLAUDE.md best practices + caveman compression + Chroma context-route + this Karpathy LLM-wiki pattern). Nate’s has/audit(Four-Cs gap report). - Operator/level-up skill — recurring “what should I work on?” function check. Ben’s
/os-operatorpulls daily real-time context from connectors (Fireflies, Slack, Calendar). Nate’s/level-updoes a Three-Ms interview producing one shipped artifact. Arscontexta uses/reduce+/reflect+/reweave(the “6 Rs” processing pipeline).
Ben’s /os-optimizer explicitly cites the Karpathy LLM-wiki as one of its named frameworks. That’s the single most direct vault-citation in any of the six — it names this very pattern as one of its compression/audit references.
The Dream Labs AI walkthrough closes the loop
The third-party Dream Labs AI tutorial (10 minutes, YouTube FAWm7DuFSPc, 2026-05-17) demonstrates how an operator with no prior wiki sets up the pattern in ~9 minutes:
- Copy Karpathy’s improved gist (his “v2” follow-up to the viral tweet) into a fresh Obsidian welcome page.
- Voice-prompt Claude Code: “I just pasted in an idea file from Andrej Karpathy. Can you please use the vault in Obsidian called [name] to build this out for me?”
- Claude scaffolds
raw/(withassets/subfolder),wiki/concepts/entities/sources/synthesis/four-folder layout,CLAUDE.md,index.md,log.md. Welcome page transforms into a user-guide. - Drag raw business data into
raw/assets/— PDFs, transcripts, MDs, goals docs, competitor research. - Compile.
The walkthrough is a tour of this exact same machine, applied to a business-second-brain context — Karpathy’s idea file → Obsidian vault → raw/ → wiki/. Distinct from Karpathy’s own framing in from-vibe-coding-to-agentic-engineering (Sequoia talk) by being business-applied rather than developer-focused.
Two patterns surfaced in the walkthrough that this vault should track:
- The Alex Hormozi 12-question self-interview — used as seed content for
raw/when the business doesn’t have rich existing documentation. Claude conducts the interview, the operator answers, output goes intoraw/and gets ingested. Useful for net-new operators with no document corpus yet. - The gap-detection query — “Tell me where the gaps in my knowledge are for my business.” Same effect as this vault’s question-log gap scoring but conversationally accessible to a non-developer. Worth wiring as a one-liner shortcut.
Convergent architectural choices
Every implementation in the six landed on these design decisions independently:
- Raw → schema → wiki as the three-layer pipeline. Universal across all six. Karpathy’s gist is the schema source; raw/ is the immutable inbox; wiki/ is the LLM-curated output. (Synthadoc adds an explicit ingest job queue between raw and wiki, but the layering is the same.)
CLAUDE.mdas the routing/schema file. Universal across all six. Ben’s bundle even ships per-subfolderCLAUDE.mdfiles for nested context routing — this vault adopted the same pattern via topic_index.mdfiles + the optional per-topicAGENTS.mdborrowed from synthadoc.- Obsidian as the visualization layer, folder as the source of truth. Universal across the implementations that name a visualization tool. Synthadoc ships an Obsidian plugin natively; Stride is an Obsidian template; arscontexta ships a Claude Code plugin that targets Obsidian-compatible markdown; Ben’s bundle requires Obsidian + the Relay community plugin for team sync. The folder, not the app, is canonical.
- Append-only operation log (
log.md). Universal across the implementations that document persistence. Synthadoc layers JSON-lines + SQLite on top; this vault layers JSON manifest + SQLite (viabin/build-audit-db) on top. - Scale awareness at the same thresholds. Stride’s README and this vault both flag the ~500-article scale ceiling explicitly. Beyond that, the master index stops being a reliable retrieval layer and a search tool becomes mandatory.
Divergence points
Where the implementations differ is more diagnostic of design choices than where they agree:
- Frontmatter discipline. This vault and synthadoc both require YAML frontmatter on every article (title, topic, sources, created, updated, confidence, provenance, type, status). Stride is frontmatter-free. Arscontexta uses frontmatter but doesn’t enforce a fixed schema. Ben’s and Nate’s bundles are silent on schema.
- Contradiction handling. Synthadoc’s
[!contradiction]+status: contradicted+ auto-resolve at ≥85% is the most discipline. This vault adopted the callout + status field but not auto-resolve (every contradiction stays for human review). Stride flags conflicts under## Open Questionsonly. The plugin bundles are silent on contradiction handling. - Manifest / delta-based ingest. Synthadoc + this vault both track sha256-keyed delta ingestion. Stride and the plugin bundles re-process everything on each compile.
- Live publishing. This vault publishes via Quartz → Cloudflare Worker. Synthadoc, Stride, and the plugin bundles ship no publishing primitive — they’re local-only. The Dream Labs AI walkthrough mentions a published version is possible but doesn’t demonstrate it.
- What gets adopted by the LLM that maintains it. This vault has explicitly adopted 5 features from synthadoc (see karpathy-vault-additions-from-synthadoc) and 3 from arscontexta (per the article’s “Why this matters” section: promote
bin/scripts to Claude Code commands, fresh-context-per-phase in/compile, hooks-enforced quality boundary — pending). Stride contributed nothing this vault adopted because it’s strictly a subset. Ben’s, Nate’s, and the Dream Labs walkthrough contributed naming + a few queries (gap-detection, Hormozi 12-questions) but no structural changes.
What this vault should adopt next
From cross-reading the six side-by-side, three concrete additions surface as load-bearing:
- Wire
bin/scripts as Claude Code slash commands. The arscontexta article explicitly enumerates this as the most adoptable pattern: promotebin/post-ingest,bin/refresh,bin/lint-stale-sources,bin/output-reviewfrom hidden shell scripts to visible/post-ingest,/refresh,/lint-stale,/output-reviewslash commands. Same engine; better discoverability for any other operator who clones this vault. - Adopt fresh-context-per-phase pattern for
/compile. Currently/compileruns all triage decisions in a single session context. Arscontexta’s/pipelineand/ralphpatterns reset context between phases — Vision → Analysis → Candidate search → Decision → Write — to keep each phase’s prompt focused. Worth piloting on a multi-source batch. - Adopt the Hormozi 12-question + gap-detection conversational shortcuts. Both are zero-engineering additions. Hormozi 12-questions becomes a
raw/seed/template the operator can fill via voice transcription. Gap-detection becomes a one-liner/gapsslash command wrapping the existing question-log + topic_research-agenda.mdsynthesis.
Karpathy’s improved gist (open thread)
The Dream Labs AI walkthrough references Karpathy’s improved/updated gist as the “v2” of the original viral tweet (linked in the video description). This vault’s vault schema is descended from the original gist, but no direct ai-research pull of the improved version exists yet. Worth a primary-source pull to verify whether Karpathy himself has refined the schema in ways that should propagate to this vault.
Try It
- If starting fresh: clone the Stride starter (
joshpocock/karpathy-obsidian-vault). It’s the lowest-overhead entry point and ships ready-to-use. See joshpocock-vault § Try It. - If you need a Python engine + multi-LLM ingest + SQLite audit: install synthadoc (
pip install synthadocper the README). See synthadoc § Try It. - If you want a Claude Code plugin with conversational onboarding: install arscontexta (
/plugin install agenticnotetaking/arscontexta) and run/arscontexta:setup. See arscontexta-skill-graphs § Try It. - If you want a paid AIOS bundle with team permissions + Railway MCP: Ben’s 5-skill bundle (plugin ZIP link in his YouTube description). See ben-five-skill-aios-setup.
- If you want the tool-agnostic framework (Three Ms + Four Cs): Nate Herk’s free AIS-OS repo + 2-hour course. See nate-herk-claude-code-operating-systems-course.
- If you want a 10-minute hands-on walkthrough: the Dream Labs AI YouTube tutorial (
FAWm7DuFSPc). See build-llm-wiki-for-business-walkthrough. - If you’re maintaining this vault: action the three “what this vault should adopt next” items above. Each is a small, focused commit.
Related
- _index — topic root for community implementations
- Andrej Karpathy — Karpathy’s own talks/tutorials, including the original gist source
- from-vibe-coding-to-agentic-engineering — Karpathy’s Sequoia talk that endorses LLM knowledge bases
- pipeline-diagram — visual architecture of how this specific vault implements the pattern
- karpathy-vault-additions-from-synthadoc — the specific 5 improvements this vault adopted from synthadoc
- claude-code-best-practices — Anthropic’s primary-source best-practices doc, anchors the meta-pattern these adoptions implement
- seven-claude-skills-run-my-business — operator-curated skill catalog using Obsidian-as-persistence (the same pattern at a different scale)
- 2026-claude-code-aios-pattern — convergent-evidence cousin article documenting AIOS adoption across three other practitioners
Open Questions
- Karpathy’s improved gist — referenced in the Dream Labs AI walkthrough as v2 of the viral tweet; no primary-source pull yet. Action: ai-research pull, then compare against this vault’s
CLAUDE.mdfor adopt-worthy refinements. - Quantitative cross-implementation comparison — no benchmark exists yet of compile speed, retrieval quality, or contradiction-detection accuracy across the six implementations. A standardized eval (same
raw/corpus run through each) would be a meaningful contribution. - Plugin ecosystem maturity threshold — at what point do the plugin bundles (arscontexta, Ben’s, Nate’s) replace the hand-rolled vault as the recommended entry point? Today, hand-roll wins for operators who already own the vault discipline; plugins win for cold-start operators with no Claude Code experience. The threshold likely shifts as plugin onboarding gets smoother.
- Role-model-mentor “skills plugin” concept (from Dream Labs AI) — packaging a public mentor’s body of work as a
.claude/skills/<mentor-name>/folder withSKILL.mdorienting answer framing in that mentor’s voice. The creator says they’re personally working on it; no shipped artifact yet. Worth watching.