Jonathon's AI Wiki

Tag: benchmarks

6 items with this tag.

  • Jun 22, 2026

    GLM-5 / GLM-5.2 — Z.ai's Open-Weight Agentic-Coding Frontier

    • glm
    • zai
    • zhipu
    • open-weights
    • agentic-coding
    • claude-code
    • long-horizon
    • benchmarks
    • moe
  • Jun 15, 2026

    Claude Opus 4.8 — Anthropic Release + System Card (May 28, 2026)

    • opus-4-8
    • claude
    • model-release
    • system-card
    • anthropic
    • benchmarks
    • alignment
    • safety
    • agentic
    • long-context
    • pricing
    • mythos-preview
  • Jun 10, 2026

    When AI Builds Itself — Anthropic on Recursive Self-Improvement

    • anthropic
    • anthropic-institute
    • recursive-self-improvement
    • ai-accelerating-ai
    • mythos-preview
    • benchmarks
    • agentic-coding
    • research-automation
    • alignment
    • ai-safety
    • capability-curve
    • jack-clark
  • May 31, 2026

    agentmemory — Persistent Memory for AI Coding Agents (rohitg00)

    • agent-memory
    • persistent-memory
    • mcp
    • claude-code
    • codex
    • retrieval
    • rrf
    • knowledge-graph
    • benchmarks
    • open-source
  • May 29, 2026

    The AI Paradox — Dan Shipper on Why More Automation Means More Humans (Lenny's Podcast)

    • dan-shipper
    • every
    • ai-agents
    • super-agent
    • cowork
    • codex
    • forward-deployed-engineer
    • saas
    • ai-product-management
    • benchmarks
    • podcast
  • Apr 14, 2026

    Stanford HAI AI Index Report 2026

    • industry-report
    • stanford-hai
    • ai-adoption
    • compute
    • responsible-ai
    • ai-economy
    • benchmarks
    • policy
    • epoch-ai
    • annual-report

Created with Quartz v4.5.2 © 2026

  • ✦ Explore the graph in 3D