November 5, 2025
ChatGPT launch ⢠Mass consumer adoption
Understanding risk through system architecture
Breaking guardrails
Getting models to do what they're trained not to do
Breaking guardrails
Getting models to do what they're trained not to do
Hijacking control flow
Extracting privileged data or taking unauthorized actions
Not connected to external data or tools
Retrieval-Augmented Generation: Connected to knowledge bases
Meta AI publicly endorsed this framework (October 31, 2025)
Yes â through architectural design patterns
Private Data
Customer PII, transactions, credentials
Private Data
Customer PII, transactions, credentials
Untrusted Content
Web, emails, user uploads, third-party feeds
Private Data
Customer PII, transactions, credentials
Untrusted Content
Web, emails, user uploads, third-party feeds
Exfiltration/Action
External APIs, email, payments, case writes
If all three conditions are present:
If all three conditions are present: