Skill Detail

Filter prompts and model outputs for injection, secrets, toxicity, and policy risks with LLM Guard

Screen prompts and responses with input and output scanners before an LLM interaction reaches production users or downstream systems.

Security & VerificationMulti-Framework

Security & Verification Multi-Framework Security Reviewed

⭐ 2.8k GitHub stars

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill filter-prompts-and-model-outputs-for-injection-secrets-toxicity-and-policy-risks-with-llm-guard

Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source Documentation

At a glance

Tools required

Python 3.9+, application or agent code that can wrap LLM input and output handling

Install & setup

Install with `pip install llm-guard`, choose the input and output scanners that match your risk profile, and wrap those checks around prompt submission and response handling in your LLM workflow.

Author

Protect AI

Publisher

Organization

Last updated

Apr 16, 2026

Quick brief

Use LLM Guard when the immediate job is to scan or sanitize prompts and model outputs for injection attempts, secret leakage, toxicity, or related policy risks. It is most useful as a preflight and postflight safety layer wrapped around LLM calls in applications or agent pipelines. The boundary is prompt and output filtering, not a full security platform, observability stack, or general-purpose agent framework.

Best fit

When to reach for it

Best when the job fits Security & Verification.
Works naturally with Multi-Framework setups.
Requires Python 3.9+, application or agent code that can wrap LLM….
Installation is straightforward: Install with `pip install llm-guard`, choose the input and output scanners that match your risk profile,…

Trust & provenance

Why this listing is credible

Trust status: Security Reviewed.
2.8k GitHub stars on the linked upstream source.
Last updated Apr 16, 2026.

View source ↗ Documentation ↗