Documentation Index
Fetch the complete documentation index at: https://docs.benchspan.com/llms.txt
Use this file to discover all available pages before exploring further.
Endpoint
Request body
The text to classify. Max 32,000 characters; longer inputs are truncated from the right.
Where the text came from. Tool-origin content (API responses, email bodies, HTML pages, docs) is the dominant attack vector for agents and has a dedicated classifier path.
Controls the returned
verdict. In block mode, injections return "verdict": "block". In warn mode, they return "verdict": "warn" (non-injections return "pass" in both modes). Does not affect the HTTP response; the API never throws on injection. Your client decides what to do. Note: the SDK exposes zero-latency behavior for warn mode by running the scan in the background. If you call this HTTP endpoint directly, you always wait for the response.Optional label for the tool / source that produced this text. Shows up in the dashboard as a per-source breakdown.
Optional label for which of your agents is making the call. Shows up in the dashboard as a per-agent breakdown.
Response
UUIDv4 identifier for this scan. Use it to correlate with dashboard logs.
true if the score crosses our injection threshold (score ≥ 0.5).Model confidence between 0.0 and 1.0. Values near 0 = confidently benign; values near 1 = confidently injection.
"block": injection detected, mode was"block". Client should abort."warn": injection detected, mode was"warn". Client should log but proceed."pass": benign.
The classifier version that produced the score (e.g.
classifier-v3). Stable for a deployment; changes when we roll a new model.Server-side inference latency in milliseconds. Useful for debugging slow calls separate from your network RTT.
Response codes
| Code | Meaning |
|---|---|
200 | Classification returned |
400 | Malformed request body |
401 | Missing / invalid / revoked API key |
429 | Rate limit exceeded |
5xx | Transient server error. Retry with backoff. |