Module 5.6 — Variant Scenarios and Lab Platform
Day 5 capstone · Section 6 of 6
Why variants
The course runs multiple times per year at SANS events. The Verdancy Health / PROMETHEUS-7 scenario stays fresh for the first ~6 deliveries; after that, repeat students and instructor familiarity create a stale-content risk. Three pre-built variants swap the org and the Stage-3 attack surface while preserving the four-stage kill-chain shape and the Mirror Twist.
Instructor effort to swap a variant: approximately 1 day of content reseeding (regenerating the synthetic alert pack, NoraBot equivalents, CloudTrail equivalents). The facilitation script is materially the same across all three variants.
Variant A — “Hollow Mirror: Fintech”
Target organization: Halgrove Capital Partners, a regional bank with ~5,500 employees and $40B in assets under management.
Stage-3 surface change: Instead of a customer-facing chatbot (NoraBot), the prompt-injection target is an internal copilot (“HalgroveGPT”) used by the wealth-advisor team for client research and portfolio analysis. The adversary plants poisoned content in a public investor-relations document that gets ingested into the copilot’s RAG corpus.
Adversary handle: STYX-4
Stage-4 impact: Exfiltration of high-net-worth client portfolio data (~2,800 client records, including SSNs, wire instructions, and trust beneficiary information). Estimated regulatory exposure: ~$220M in CFPB and state fines.
Distinctive elements:
- Stage 1 recon includes adversary scraping of LinkedIn investment-advisor profiles
- Stage 2 deepfake call impersonates the bank’s CEO contacting a senior trader for an “urgent confidential FX hedge”
- Stage 4 exfil destination is a misregistered domain that resembles the bank’s known archive bucket
Realism: Halgrove Capital Partners is fictional. The scenario draws on documented patterns from regulatory disclosures of financial-services AI deployments (most fintechs have NDAs that prevent public AI-incident detail).
Variant B — “Hollow Mirror: OT / Manufacturing”
Target organization: Brackenwell Industrial Systems, a specialty chemicals manufacturer with operations across 7 facilities.
Stage-3 surface change: The prompt-injection target is a maintenance-scheduling agent (“BrackenwellOps”) that controls work-order dispatch to OT (operational technology) systems. The adversary plants a poisoned maintenance instruction that, when retrieved, modifies the work-order in a way that has physical-system implications.
Adversary handle: CINDERHOOK
Stage-4 impact: Instead of data exfiltration, this variant has physical-system sabotage — Stage 4 reveals that the work-order modification caused a misconfiguration of a specialty-chemical reactor, leading to off-spec product over a 6-hour production window. Estimated impact: $48M in product write-offs and a near-miss safety event.
Distinctive elements:
- Stage 1 recon includes adversary access patterns through the OT environment’s MES (Manufacturing Execution System) audit logs
- Stage 2 deepfake call impersonates the head of maintenance to authorize an “urgent unscheduled vendor service window”
- Stage 4 telemetry is OT-domain (Modbus traffic, plant historian queries) not just IT-domain
- Stage 4’s “AI SOC manipulation” plays out at the IT/OT boundary — adversary log injection causes the AI triage to attribute Modbus anomalies to a vendor’s scheduled service window
Realism: OT scenarios are particularly hard to find authentic published material for. Instructors delivering this variant should be paired with an OT-domain SME for credibility.
Variant C — “Hollow Mirror: Public Sector / DMV”
Target organization: State of Lincoln Department of Motor Vehicles, 4,800 employees serving 11.2 million citizens.
Stage-3 surface change: The prompt-injection target is a citizen-facing chatbot (“LincolnAssist”) that helps with license renewals, registration questions, and appointment scheduling. The adversary plants poisoned content in a publicly-indexed FAQ document.
Adversary handle: PALEHORSE-9
Stage-4 impact: Instead of data exfiltration alone, this variant adds journalist-leak parallel comms exposure — during Stage 4, the adversary leaks fabricated documents to a regional journalist suggesting (incorrectly) that the DMV has been spying on citizens with the AI system. The SOC’s IR response must coordinate with public-affairs and legal in real-time, not just technical remediation. Estimated impact: ~$30M in incident-response costs and a 3-year regulatory consent decree.
Distinctive elements:
- Stage 1 recon includes adversary scraping of public records and FOIA-released documents
- Stage 2 deepfake is a video deepfake of the agency director (uses different audio detection lesson than voice-clone variants)
- Stage 4 includes the parallel comms / journalist response track that other variants don’t have
- Stage 4’s “AI SOC manipulation” deceives the agency’s automated triage into classifying the journalist contact as benign vendor outreach
Realism: Public-sector AI deployments are increasingly common; this variant is grounded in documented patterns from state-government chatbot deployments and the political risks they encounter.
Cross-variant common elements
All three variants share:
- The four-stage kill chain shape (recon → deepfake → injection → exfil/manipulation)
- The Mirror Twist — defender’s AI agent gets manipulated; students must catch their own agent’s wrong attribution
- The 1000-point scoring rubric (with the -50 AI-attribution penalty and the 100-point AI SOC hygiene category)
- The six required deliverables
- The 8-hour facilitation schedule
Lab platform setup
Per-student environment
Each student receives a browser-accessible isolated EC2 instance:
- Instance type:
g5.xlarge(orm6i.2xlargefor CPU-only path) — provides GPU for the Day-2 audio-detector portion - Network: Private VPC with controlled outbound (only to allow-listed endpoints; no direct internet access from the lab instances to prevent inadvertent leakage of synthetic content)
- Pre-loaded data:
~/data/phase1_recon_alerts.json(Codex-generated, 14 records)~/data/phase3_norabot_trace.jsonl(Codex-generated agent trace)~/data/phase4_cloudtrail.jsonl(Codex-generated synthetic logs)~/data/voicemail.wav(synthetic deepfake audio against a SANS-owned synthetic CEO voice)~/data/vendor-onboarding-emergency.pdf(with embedded prompt-injection payload)~/submissions/empty directory for student deliverable submissions
- Pre-installed tools:
- The Day-1 detector stack from Module 1.6 lab
- The Day-2 audio-detection pipeline from Module 2.2
- The Day-3 prompt-injection detector from Module 3.4
- The Day-4 Sigma + Suricata rule pack
- The Day-4 multi-agent SOC workflow from Module 4.3
- The Codex
capstone_grader.py(for self-assessment during prep)
- Pre-configured “AI SOC”: the multi-agent triage workflow runs against the synthetic data with pre-seeded behaviors:
- Phase 4: the triage agent confidently misattributes the exfil to “vendor-acme” — this is engineered into the lab, not magical AI behavior
Shared lab services
Beyond per-student instances:
- Email gateway shadow: simulates Verdancy’s mail flow; delivers the Phase 2 voicemail forward at T+2:05
- Active Directory shadow: 14,000 simulated user accounts including Brenda Castillo, Dr. Marcus Wei (CISO), and the technical principal accounts the adversary AssumeRole-targets
- CISO video kickoff: pre-recorded ~3-minute video that plays at T+0:05
Infrastructure cost guidance
For SANS delivery economics:
- Per-student EC2 costs: ~$1.20/hour during the 8-hour exercise = ~$10/student
- Shared lab services: ~$50/event regardless of student count
- For a typical 30-student delivery: ~$350 in cloud costs
The g5.xlarge requirement is the largest cost driver. For environments where GPU isn’t required (e.g., a variant that excludes the audio-detection phase), the m6i.2xlarge reduces per-student cost to ~$5/student.
Variant data regeneration
When swapping to a variant (Fintech, OT, Public Sector), the instructor regenerates the four synthetic data files:
python3 generate_capstone_data.py --variant fintech --output /lab/variants/fintech/
python3 generate_capstone_data.py --variant ot --output /lab/variants/ot/
python3 generate_capstone_data.py --variant public_sector --output /lab/variants/public_sector/
The generate_capstone_data.py script is a meta-tool that re-runs the Codex prompts (Phase 1 alerts, NoraBot/equivalent trace, CloudTrail/equivalent logs, deepfake audio) with the variant-specific scenario substitutions. Effort to produce: ~1 day of content reseeding plus instructor review.
Why this capstone design
The design decisions for the capstone derive from the course’s overall thesis:
- Adversary-AI is operational, not theoretical — the scenario is grounded in real 2025-2026 incident patterns
- The detector’s AI stack has failure modes — the Mirror Twist makes this visceral
- Workflow gaps are the durable control — Phase 2’s audio-detector-fails-but-workflow-gap-catches lesson reinforces this
- Independent verification is the meta-skill — Phase 4’s grounding-against-CloudTrail rewards exactly this discipline
- Detection engineering integrates — every prior day’s content shows up in Phase 1-4 simultaneously
A student who passes the capstone has demonstrated competence across:
- Day 1 (Embedding clustering + RAG + AI-phishing detection)
- Day 2 (Workflow-gap detection + audio analysis)
- Day 3 (EchoLeak-class injection + guardrail telemetry + lethal trifecta audit)
- Day 4 (Agent telemetry detection + action-criticality + supply-chain awareness)
That competence is the deliverable the course produces. The 700-pt pass bar exists to enforce it.
Closing the course
Day 5 closes the SEC5xx course. Students leave with:
- The detector’s AI stack assembled and operationally proven
- The integration test passed (capstone score)
- The deliverable package (timeline, IOC list, exec summary, CISO memo, AI SOC post-mortem, containment log)
- The course’s one big lesson: AI in the SOC is a tool with failure modes, not a source of truth
- GIAC AI Detection Analyst (GAIDA) certification eligibility (for those pursuing it)
The architectural insight running through all five days: the threat surface moved up the stack. Day 1’s adversary was at the gateway. Day 5’s adversary studied the defender’s stack. The detection engineer’s response moves with it — every layer needs operational controls, and no layer can be trusted in isolation.
Welcome to detection engineering in 2026.