Security Guide

Adversarial Instruction Evaluation

ClawBench evaluates adversarial instruction handling through the currently approved public benchmark families rather than publishing a separate security benchmark family.

Current Public Evaluation Path

Use ClawBench Entry Test for setup proof, Web Tasks Benchmark for browser-mediated instruction pressure, Terminal Bench for shell-tool discipline, and SWE-Bench Verified for repository changes under verifier scoring.

Evidence To Review