Microsoft Azure AI Content Safety — Prompt Shields

Unified API in Azure AI Content Safety that detects and blocks adversarial user-prompt attacks and indirect document attacks on LLMs. Replaces the earlier Jailbreak risk detection service. Detects role-play, system-rule changes, conversation-mockup attacks, encoding attacks, and document-borne indirect prompt injection.

API entry from apis.yml

apis.yml Raw ↑
aid: guardrails:microsoft-azure-prompt-shields
name: Microsoft Azure AI Content Safety — Prompt Shields
description: Unified API in Azure AI Content Safety that detects and blocks adversarial user-prompt attacks
  and indirect document attacks on LLMs. Replaces the earlier Jailbreak risk detection service. Detects
  role-play, system-rule changes, conversation-mockup attacks, encoding attacks, and document-borne indirect
  prompt injection.
humanURL: https://learn.microsoft.com/en-us/azure/ai-services/content-safety/concepts/jailbreak-detection
tags:
- Azure
- Content Safety
- Document Attacks
- Indirect Prompt Injection
- Jailbreak Detection
- Microsoft
- Provider-Native
properties:
- type: Documentation
  url: https://learn.microsoft.com/en-us/azure/ai-services/content-safety/concepts/jailbreak-detection
- type: Overview
  url: https://learn.microsoft.com/en-us/azure/ai-services/content-safety/overview
- type: APIReference
  url: https://learn.microsoft.com/en-us/rest/api/contentsafety/
- type: Pricing
  url: https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/
- type: x-deployment
  value: Cloud Service
- type: x-threat-categories
  value: prompt-injection,jailbreak,indirect-prompt-injection,content-safety