The AI Security Landscape
OWASP Top 10 for LLMs, MITRE ATLAS, the AI Incident Database, and the CIA triad applied to AI.
Prompt Injection — The #1 Threat
Direct and indirect injection, the confused deputy problem, and real-world incidents.
Jailbreaking & Guardrail Bypass
Crescendo attacks, many-shot jailbreaking, DAN role-play, and encoded payloads.
Data Poisoning & Training-Time Attacks
Sleeper agents, PickleRAT supply chain attacks, safetensors, and model signing.
Adversarial Machine Learning
FGSM, PGD, C&W attacks, evasion of safety classifiers, and transferability.