T-4006validated
Safety Instruction Displacement
Displace safety instructions from the active context to remove guardrails
Tactic
Privilege Escalation (Stage 4)
Escalate capabilities beyond declared scope or bypass authorization
Attack Class
SOUL-DRIFT
Gradually displacing safety instructions from the active context through conversation manipulation
Evidence
validated
Reproduced in controlled lab environment (DVAA) with documented steps.
DVAA Validation
L2-07
Detection (HackMyAgent)
PROMPT-001PROMPT-002
npx hackmyagent secure --ciDefense (OASB Controls)
OASB 2.1OASB 2.2OASB 2.3OASB 2.4OASB 2.5
How to Cite
AI Agent Threat Matrix T-4006 (Safety Instruction Displacement). OpenA2A, 2026. https://threats.opena2a.org/techniques/T-4006