Jonathon's AI Wiki

Tag: claudes-constitution

1 item with this tag.

  • Jul 02, 2026

    Refusal Calibration and Constitutional AI — Shaping Claude's Refusals Without Raising Jailbreak Risk

    • constitutional-ai
    • claudes-constitution
    • refusals
    • over-refusal
    • jailbreak
    • safety
    • system-prompt
    • anthropic

Created with Quartz v4.5.2 © 2026

  • ✦ Explore the graph in 3D