shield

AI Security — Deep Dive

From threat landscape to production hardening. Each chapter: visual journey overview + under-the-hood deep dive.

Co-Created by Kiran Shirol and Claude

Core Topics OWASP Top 10 Guardrails Red Team LLM Security

home Learning Portal play_arrow Start Learning summarize Key Insights dictionary Glossary 14 chapters · Each with High Level + Under the Hood

Offense

Attack Surface & Threat Landscape

The attacks every AI practitioner must understand — from prompt injection to adversarial ML.

public

The AI Security Landscape

OWASP Top 10 for LLMs, MITRE ATLAS, the AI Incident Database, and the CIA triad applied to AI.

visibility High Level code Deep Dive

dangerous

Prompt Injection — The #1 Threat

Direct and indirect injection, the confused deputy problem, and real-world incidents.

visibility High Level code Deep Dive

lock_open

Jailbreaking & Guardrail Bypass

Crescendo attacks, many-shot jailbreaking, DAN role-play, and encoded payloads.

visibility High Level code Deep Dive

science

Data Poisoning & Training-Time Attacks

Sleeper agents, PickleRAT supply chain attacks, safetensors, and model signing.

visibility High Level code Deep Dive

bug_report

Adversarial Machine Learning

FGSM, PGD, C&W attacks, evasion of safety classifiers, and transferability.

visibility High Level code Deep Dive

Defense

Guardrails, RAG, Agents & MCP Security

Securing the components of modern AI systems — from input filtering to tool-calling sandboxes.

filter_alt

Input Guardrails & Output Filtering

NeMo Guardrails, LLM Guard, canary tokens, PII detection, and layered defense.

visibility High Level code Deep Dive

Securing RAG Pipelines

CPA-RAG attacks, embedded threats, jamming attacks, RAGPart/RAGMask defenses.

visibility High Level code Deep Dive

smart_toy

Securing Agents & Tool Calling

AgentXploit, STAC tool chaining, WASM sandboxing, and capability-based access control.

visibility High Level code Deep Dive

hub

Securing MCP & External Integrations

MCPoison, tool poisoning, rug pull attacks, RSA manifest signing, and runtime guardrails.

visibility High Level code Deep Dive

Governance

Privacy, Red Teaming & Compliance

Data leakage, adversarial testing methodologies, and the regulatory landscape.

privacy_tip

Privacy, Data Leakage & Model Extraction

Membership inference, differential privacy, PII leakage, model stealing, GDPR/CCPA.

visibility High Level code Deep Dive

target

Red Teaming AI Systems

Garak, PyRIT, PromptFoo, MITRE ATLAS methodology, prompt fuzzing, and bug bounties.

visibility High Level code Deep Dive

gavel

AI Governance, Compliance & Risk

EU AI Act, NIST AI RMF, ISO 42001, model cards, incident response, and responsible disclosure.

visibility High Level code Deep Dive

Hardening

Secure Architecture & Production Hardening

Putting it all together — zero-trust patterns, defense in depth, and the full security stack.

architecture

Secure AI Architecture Patterns

Zero-trust for AI, layer separation, API gateways, rate limiting, and secrets management.

visibility High Level code Deep Dive

security

Production Hardening & Defense in Depth

The full security stack, continuous red teaming, incident response, and future outlook.

visibility High Level code Deep Dive

AI Security — Deep Dive

Attack Surface & Threat Landscape

Guardrails, RAG, Agents & MCP Security

Privacy, Red Teaming & Compliance

Secure Architecture & Production Hardening

Explore Related Courses