Which LLM vendors does this cover?+
Anthropic Messages, OpenAI Chat Completions, OpenAI Responses, and Gemini generateContent. Streaming and non-streaming on every vendor. Add a vendor by pointing your agent SDK at the proxy base URL instead of the vendor base URL; vendor body shapes are walked natively.
How does this differ from agent monitoring or LLM observability products?+
Observability watches what already happened. DataDam sits inline and changes the request before the vendor sees it. PII gets redacted, custom patterns get redacted, image attachments get black rectangles painted over detected regions. No post-hoc alert about a leak that already occurred.
Does this catch Claude Desktop or the ChatGPT app?+
No, not today. Those apps connect directly to their vendor and do not expose a base URL override, so no proxy can insert itself in the path. We cover agent-driven traffic where you control the SDK (your in-house agents, Cursor, Continue, Cline, custom workflows). Coverage for Claude Desktop and the ChatGPT web app is on the roadmap via the managed browser extension (DataDam Endpoint) plus an MDM enterprise pack that routes consumer traffic through the same DataDam proxy and audit pipeline. Honest framing: this product slice covers agents, not consumer-AI shadow IT, until the extension ships.
Does the proxy hold our vendor API keys?+
Yes, on your proxy host. The agent presents its DataDam key; the proxy strips that and attaches your vendor key on forward. Your vendor key lives in your environment's secrets layer (env vars, AWS Secrets Manager, 1Password, HashiCorp Vault). The control plane never sees your vendor key.
What gets scanned inside images?+
Text inside the image gets the same PII detection that runs on prompts: emails, phone numbers, SSNs, payment cards, secrets, and your operator-authored custom patterns. Visual threats that text extraction misses are flagged separately: credit cards, ID cards, faces, signatures. The detection class list is operator-extensible.
What does the vendor actually receive when an image gets redacted?+
The vendor still receives an image. Detected regions are painted over with black rectangles; the rest of the image passes unchanged. The agent sees a vendor response generated against the redacted image. If image encoding ever fails, the proxy strips the image entirely and substitutes a clear sentinel so the request still parses.
Can we choose how strict the gate is?+
Yes. Per-vendor mode picks one of three: warn (forward as-is, attach a response header), redact (substitute detected spans inline before forwarding, the default), or block (return 403 to the agent, no upstream call). Set org-wide defaults and override per vendor. Mode lives on the Settings page.
What happens to streaming responses?+
Streaming is supported on every vendor. Request scanning runs before the upstream call. Response scanning is optional: when on, each streaming delta gets the same per-delta detection and forwards inline. Per-delta detection has gaps for entities that split across deltas; the request-side scan is your primary defense.
CPU or GPU version, how do I choose?+
CPU version is the default and runs on standard pods. Full text and visual detection coverage. Suits almost every deployment. GPU version requires a CUDA-capable GPU pod and runs detection in a single multimodal pass with sub-second per-image latency. Pick GPU when your agent workload is image-heavy and latency budget is tight, or when you already provision GPU pods for other AI workloads.
Can we author our own detection rules?+
Yes. The console supports operator-authored regex patterns scoped per-vendor. Internal project codes, customer record IDs, anything the built-in detection does not know about. Rules apply alongside the built-in detection on every request.
What goes in the audit row?+
Per-vendor lineage event with entity counts, per-detector attribution (text vs visual), mode, outcome, and latency. The detection paths (which message, which content part) are included; the detected values themselves never are. The row flows to your configured SIEM sink alongside every other DataDam audit row.
Does this work in air-gapped environments?+
Yes. Detection runs entirely inside the proxy host. No third-party API calls, no model weights downloaded at runtime (the container ships with weights pre-baked, or you mount them once from your own volume). The proxy reaches your configured LLM vendor over your own network egress, not ours.
How do we turn it off if it breaks something?+
Three knobs. Per-vendor mode flip to warn (forwards everything, attaches a header). Org-wide image attachment toggle on the Settings page (image scanning off, text scanning on). Or kill switch on the LLM egress endpoint entirely. All three propagate to every proxy in your fleet via /sync within five minutes.