Matrix/Privilege Escalation/T-4006
T-4006validated

Safety Instruction Displacement

Displace safety instructions from the active context to remove guardrails

Tactic

Privilege Escalation (Stage 4)

Escalate capabilities beyond declared scope or bypass authorization

Attack Class

SOUL-DRIFT

Gradually displacing safety instructions from the active context through conversation manipulation

Evidence

validated

Reproduced in controlled lab environment (DVAA) with documented steps.

DVAA Validation

L2-07

Detection (HackMyAgent)

PROMPT-001PROMPT-002
npx hackmyagent secure --ci

Defense (OASB Controls)

OASB 2.1OASB 2.2OASB 2.3OASB 2.4OASB 2.5

How to Cite

AI Agent Threat Matrix T-4006 (Safety Instruction Displacement). OpenA2A, 2026. https://threats.opena2a.org/techniques/T-4006