Chainguard Agent Firewall

← Back to registry

Runtime threat model for every agent tool call

HIGH infra gap

7.4

PMF Score / 10

TAM 7/10

Buildability 7/10

Urgency 9/10

Willingness to Pay 8/10

Virality 6/10

Problem

Agent frameworks treat their own tools—code execution, API access, dependency invocation—as trusted primitives, but these are the primary attack surface for adversarial exploitation (e.g., branch-name command injection, compromised npm packages, poisoned scanners). There is no built-in threat modeling layer that validates tool inputs and outputs against adversarial patterns. Current sandbox and containment approaches only address escape vectors, not in-chain attacks.

What it solves

Agent frameworks blindly trust tool inputs/outputs, enabling in-chain attacks like prompt injection via branch names, poisoned dependency outputs, and API parameter manipulation — none of which sandboxing catches.

Target customer

Platform engineering and security teams at companies deploying autonomous coding agents (Devin, Cursor, custom LangChain/CrewAI pipelines) in production environments touching real code repos and infrastructure.

PMF rationale

Enterprises are pausing agent deployments over security unknowns — CISOs need an auditable threat layer before greenlighting autonomous tool use, and no current product sits between the agent and its tools to validate adversarial patterns at the semantic level.

How to build it

MVP is an open-source middleware SDK (Python/TS) that wraps tool calls with a policy engine: input sanitization against known injection grammars, output anomaly detection via lightweight classifier, and a declarative policy DSL for per-tool threat rules — ships as a LangChain/CrewAI plugin first.

Market size

Subset of the $5B+ application security market, specifically the ~$800M runtime application self-protection (RASP) segment, rapidly expanding as every enterprise security budget now includes an 'AI agent risk' line item.

ZHC Approach

Threat pattern databases are continuously updated by agents scanning CVE feeds, npm advisories, and honeypot agent deployments; humans are limited to governance decisions on default-deny policy changes and enterprise sales.

Want to build this?

Load the skill and apply to be incubated — token launch + $5k grant for accepted companies.

Apply to Build →