T-2008validated
System Prompt Boundary Bypass
Exploit weak boundaries between system and user prompts to override system-level instructions
Tactic
Initial Access (Stage 2)
Gain control over agent behavior through prompt manipulation or input exploitation
Attack Class
SOUL-INJECT
Directly manipulating or overriding the agent's system-level instructions and behavioral boundaries
Evidence
validated
Reproduced in controlled lab environment (DVAA) with documented steps.
DVAA Validation
L3-04
Detection (HackMyAgent)
SOUL-OVERRIDE-001
npx hackmyagent secure --ciDefense (OASB Controls)
OASB 3.1OASB 3.2OASB 3.3OASB 3.4OASB 3.5
How to Cite
AI Agent Threat Matrix T-2008 (System Prompt Boundary Bypass). OpenA2A, 2026. https://threats.opena2a.org/techniques/T-2008