← Incident Database
Jailbreak / Guardrail BypassMedium
Skeleton Key jailbreak technique
June 2024 · Cross-model
What happened
Microsoft disclosed a multi-turn technique that asks a model to augment rather than replace its guidelines, agreeing to produce prohibited content as long as it prepends a warning. It succeeded across most major models tested.
Root cause
Models could be socially engineered into treating a safety-relaxing instruction as a legitimate behavior update, bypassing refusal guardrails.
Fix / outcome
Microsoft added Prompt Shields and model updates in Azure AI and shared findings with other providers.
Sources
Learn this attack class
This incident is an example of Jailbreak / Guardrail Bypass. Read the guide, then try it hands-on in the Academy.