Hub

Discover and explore guardrails and profiles for your projects

profile

Enterprise Security

Enterprise-grade protection with strict leakage and access controls.

3100890230
profile

Default Production

Baseline safety and security for most production LLM applications.

2200540140
profile

Consumer Chatbot

Balanced safety for public-facing chatbots.

2100480130
profile

Financial Services

Compliance and safety for banking, fintech, and payments.

1670402110
profile

SaaS Multi-Tenant

Isolation and safety for multi-tenant SaaS AI platforms.

145032089
profile

Developer Playground

Relaxed guardrails for experimentation and testing.

132030188
guardrail

PII Detection Guardrail

Detects and optionally redacts personally identifiable information in user input.

124031288
profile

Healthcare (HIPAA)

HIPAA-aligned protections for healthcare and clinical AI.

124029077
guardrail

NSFW Content Guardrail

Detects and blocks explicit or adult content.

102019861
guardrail

Prompt Injection Signature Guardrail

Detects known prompt injection and override instruction patterns.

98020165
profile

Internal Tools

Safe defaults for internal employee-facing AI tools.

98021054
profile

Cost Optimized

Aggressive cost and rate controls for high-volume workloads.

96018852
guardrail

Secrets in Input Guardrail

Detects API keys, tokens, and credentials in user input.

89017644
profile

Agentic AI

Safety for autonomous agents with tool execution.

89015541
guardrail

Rate Limit Guardrail

Enforces request rate limits to control cost and abuse.

84015946
guardrail

API Rate Limit Guardrail

Prevents excessive API usage and abuse.

78014842
profile

Child Safety

Maximum protection for child-focused and educational applications.

78010233
guardrail

Hate Speech Guardrail

Blocks hateful or abusive content targeting protected classes.

77014339
guardrail

Input Size Guardrail

Enforces limits on input size, tokens, and payload complexity.

74014136
profile

Compliance & Audit

Maximum observability and compliance enforcement.

74014139
guardrail

Medical Advice Guardrail

Restricts medical diagnosis or treatment advice.

71013438
guardrail

Output PII Redaction Guardrail

Redacts personally identifiable information from model output.

69013241
guardrail

Self-Harm Guardrail

Detects self-harm or suicide-related content.

68012134
guardrail

Violence Guardrail

Detects and blocks graphic or extreme violent content.

64011831
guardrail

URL & File Blocker Guardrail

Blocks URLs, file paths, and external references in user input.

6109731
guardrail

Tool Access Control Guardrail

Enforces fine-grained access control for tool invocation.

5609427
guardrail

Cost Threshold Guardrail

Blocks or warns when usage exceeds configured cost limits.

56010229
guardrail

Jailbreak Pattern Guardrail

Detects common jailbreak templates such as DAN-style prompts.

54010128
guardrail

Regex Filter Guardrail

User-configurable regex-based filtering for custom policies.

5208821
guardrail

Output Schema Validation Guardrail

Validates model output against a required JSON or structured schema.

5209628
guardrail

Override Instruction Guardrail

Blocks attempts to override system or developer instructions.

4808423
guardrail

Language Restriction Guardrail

Restricts input to approved languages or scripts.

4607218
guardrail

Command Injection Output Guardrail

Prevents generation of executable or shell-injection commands.

4608124
guardrail

Secret Leak Output Guardrail

Prevents secrets and credentials from appearing in outputs.

4307922
guardrail

Model Version Pin Guardrail

Prevents unintended model version changes.

4307822
guardrail

PHI Awareness Guardrail

Detects protected health information to support HIPAA compliance.

4206819
guardrail

Internal Endpoint Leak Guardrail

Prevents exposure of internal service endpoints.

4207321
guardrail

LLM Classifier Injection Guardrail

ML-based detection of sophisticated prompt injection attempts.

410449
guardrail

Roleplay Injection Guardrail

Prevents roleplay-based attempts to bypass safety controls.

4106619
guardrail

Destructive Tool Call Guardrail

Blocks high-risk or destructive tool invocations.

3906719
guardrail

Binary Attachment Guardrail

Prevents binary blobs, base64 payloads, and encoded file uploads.

3805412
guardrail

Environment Variable Leak Guardrail

Prevents leakage of environment variables.

3706619
guardrail

Cross-Context Manipulation Guardrail

Blocks references to prior conversations or hidden context.

3605917
guardrail

Secrets in Logs Guardrail

Prevents secrets and credentials from being logged.

3405917
guardrail

Dangerous Patterns Guardrail

Blocks malware, exploit, fraud, and weaponization patterns.

3104914
guardrail

System Prompt Leak Guardrail

Prevents attempts to extract system or developer prompts.

2903711
guardrail

Right to Erasure Request Detector

Detects and routes GDPR right-to-erasure requests.

2805115
guardrail

Encoding Obfuscation Guardrail

Detects obfuscated text using encoding, homoglyphs, or leetspeak.

260338
guardrail

Political Persuasion Restriction Guardrail

Prevents targeted political persuasion and election interference.

2303711
guardrail

Internal Data Leak Guardrail

Blocks exposure of internal or proprietary information.

210297
guardrail

Citation Required Guardrail

Requires citations or sources for factual claims in outputs.

2103410
guardrail

File Write Restriction Guardrail

Restricts file system write access by tools or agents.

210339
guardrail

Telemetry Enforcement Guardrail

Ensures telemetry and audit logging are enabled.

2103510
guardrail

IAM Permission Guardrail

Enforces least-privilege IAM permissions.

200318
guardrail

Sandboxed Output Guardrail

Restricts executable or actionable output to a safe sandbox.

190298
guardrail

GDPR Data Minimization Guardrail

Ensures only necessary personal data is processed.

190319
guardrail

User Consent Validation Guardrail

Ensures user consent is present before processing personal data.

190288
guardrail

Quality Threshold Guardrail

Enforces minimum response quality thresholds.

190319
guardrail

Hallucination Risk Guardrail

Assesses likelihood of hallucinated or fabricated responses.

180266
guardrail

Defamation Guardrail

Detects and blocks defamatory claims about individuals or organizations.

180267
guardrail

Confidentiality Guardrail

Ensures confidential or restricted data is not disclosed in outputs.

160246
guardrail

Retention Check Guardrail

Validates data retention policies and expiration rules.

160237
guardrail

API Key Rotation Trigger Guardrail

Triggers key rotation on suspected compromise.

150226