Jonathon's AI Wiki

Tag: benchmarks

6 items with this tag.

Jun 22, 2026
GLM-5 / GLM-5.2 — Z.ai's Open-Weight Agentic-Coding Frontier
Jun 15, 2026
Claude Opus 4.8 — Anthropic Release + System Card (May 28, 2026)
Jun 10, 2026
When AI Builds Itself — Anthropic on Recursive Self-Improvement
May 31, 2026
agentmemory — Persistent Memory for AI Coding Agents (rohitg00)
May 29, 2026
The AI Paradox — Dan Shipper on Why More Automation Means More Humans (Lenny's Podcast)
Apr 14, 2026
Stanford HAI AI Index Report 2026

Created with Quartz v4.5.2 © 2026

✦ Explore the graph in 3D