Jonathon's AI Wiki

Tag: alignment

5 items with this tag.

Jun 16, 2026
Claude Fable 5 and Claude Mythos 5
Jun 15, 2026
Claude Opus 4.8 — Anthropic Release + System Card (May 28, 2026)
Jun 11, 2026
Claude Mythos Preview — Anthropic System Card (April 7 2026)
Jun 10, 2026
When AI Builds Itself — Anthropic on Recursive Self-Improvement
May 08, 2026
Translating Claude's Thoughts Into Language — Activation-to-Text Interpretability

Created with Quartz v4.5.2 © 2026

✦ Explore the graph in 3D