Key finding
K=3 adversarial verification cut false positives 27.8% to 0.0% (95% CI [11.1, 50.0] to [0, 0]; recall 100% to 77.8%) on a 36-snippet labeled set including prompt-injection traps. Held-out real target: 3/3 genuine bugs found, 0 surviving false positives.
Cost-aware model routing (DeepSeek to Haiku to Sonnet to Opus) with adversarial multi-agent verification and full tracing. Fans out finders per file, then K skeptics per finding (concurrency cap 8). Approximately $0.25 total per run. 58 tests, ruff + mypy + CI green.
Cost-routing claim is operator-gated on an Anthropic key. Honest statement: harness committed, live multi-tier number gated on key availability. make eval-dry reproduces offline.
The trace UI is deployed as a real product surface. View it live below.
| Metric | Value |
|---|---|
| FP rate (before) | 27.8% |
| FP rate (after K=3) | 0.0% |
| 95% CI (after) | [0, 0] |
| Recall | 77.8% |
| Cost per run | ~$0.25 |
| Tests | 58 |