Benchmarks

ThreatPrint — methodology & results

The numbers on the landing page come from a reproducible benchmark published in the Pulse-Proxy repository. This page summarizes the methodology, environment, and key tables from the April 13, 2026 run.

Detection rate

96%

Attack prompts correctly flagged

False positives

0%

Clean prompts falsely flagged

Median Worker CPU

1.41 ms

Cloudflare Analytics p50

Environment

Proxyhttps://proxy.orionslock.com
EndpointPOST /v1/threatprint/test
Package version1.0.0 (package.json)
Rule bundleRULES_VERSION = 2026.04.01
ScannerSCANNER_VERSION = 1.0.0

Methodology

  1. Driver: scripts/extended-benchmark.ps1 (PowerShell 5.1+).
  2. Corpus: 100 attack prompts (20 × 5 categories) and 100 clean prompts (20 × 5).
  3. Transport: sequential POSTs with X-Pulse-Key header and a 30 ms delay between calls.
  4. Scoring: live Worker ThreatPrint rules and the authenticated key's trust_level drive block/warn/log thresholds.
  5. Attack pass: blocked or warned counts as detected; logged-only is a miss.
  6. Clean pass: logged-only passes; blocked or warned is a false positive.
  7. Artifacts: results/benchmark-results.json and results/benchmark-summary.md (JSON is gitignored).

Executive summary

Attack prompts100
Clean prompts100
Attacks blocked40
Attacks warned56
Attacks missed (logged only)4
Detection rate96%
False positives on clean0
False positive rate0%

Attack detection by category

CategoryBlockedWarnedMissedDetection
prompt_injection4160100%
jailbreak118195%
data_exfiltration1550100%
cost_abuse3170100%
encoded_obfuscation710385%

Clean prompts: false positives by category

CategoryFlagged (blocked or warned)
coding0
business_writing0
technical_explanation0
data_analysis0
creative_writing0

Reproducibility

Every prompt in the corpus is published verbatim in the Pulse-Proxy/BENCHMARKS.md document alongside the exact driver script. Given a valid Pulse key, any reviewer can re-run the full 200-prompt suite and publish their own numbers.

  1. Clone the Pulse-Proxy repo.
  2. Export PULSE_KEY.
  3. Run .\scripts\extended-benchmark.ps1.
  4. Inspect results/benchmark-summary.md.

Benchmark run date: April 13, 2026. Corpus size: 100 attack + 100 clean. Numbers above match results/benchmark-summary.md from that run.