Module 5.4 — Scoring Rubric and Required Deliverables

Day 5 capstone · Section 4 of 6

The 1000-point scoring rubric

The full breakdown:

CategoryMax PointsCascading Logic
Detection400100 per stage caught; missing a stage caps the next stage at 50%
Containment20050 per stage; over-block penalty -25 per inappropriate block
Attribution15050 each for: deepfake vendor, agent framework, exfil bucket; -50 if AI agent attribution accepted without verification
Reporting15050 for exec summary, 50 for IOC list, 50 for CISO memo
AI SOC hygiene100100 if student caught their own agent’s wrong attribution in Phase 4; 0 if auto-trusted
TOTAL1000Pass bar: 700/1000 for GIAC capstone credit

Detection (400 pts)

Stage caught?Points if caughtCap on next-stage if missed
Stage 1 (Recon)100Stage 2 max = 50
Stage 2 (BEC)100Stage 3 max = 50
Stage 3 (NoraBot injection)100Stage 4 max = 50
Stage 4 (Mirror Twist exfil)100

The cascading penalty mirrors real IR: if you didn’t see the recon, your understanding of the BEC is incomplete; if you didn’t catch the BEC’s PDF injection, you have weaker grounding for the NoraBot incident; etc.

Containment (200 pts)

50 pts per stage where containment was both timely (within the stage’s defined window) and correct (the right scope of action, not the maximum possible blocking).

The over-block penalty: -25 per inappropriate block. Examples:

Attribution (150 pts)

50 pts each for correctly identifying:

The -50 penalty applies if AI-agent attribution is accepted without verification. This applies in two specific scenarios:

  1. Phase 2: the triage agent claims “Voicemail is not a deepfake — detector confidence 0.61 below threshold.” Students who copy-paste this into reports lose 50.
  2. Phase 4: the triage agent claims “Vendor sync attributed to vendor-acme — confidence 0.93.” Students who copy-paste this without verifying against raw CloudTrail lose 50.

The -50 stacks with the -30 phase-detection penalty if students fully accept the agent. Mathematically: full acceptance of agent attribution in Phase 4 costs ~110 points across attribution and AI SOC hygiene categories.

Reporting (150 pts)

50 pts each for:

Executive summary (exec_summary.md)

Single page. Must include:

Scored on: business-language clarity (not jargon), actionability of asks, accuracy of impact statement.

IOC list (ioc_list.txt)

Must include:

Scored on completeness against the instructor’s master IOC list.

CISO memo (ciso_memo.md)

Two pages. Must propose exactly three concrete control changes. At least one of the three must address AI SOC trust calibration (otherwise -25 from CISO memo score).

Scored on: specificity of proposals, feasibility, alignment with what the exercise actually exposed.

AI SOC hygiene (100 pts)

The single binary signal: did the student catch their own agent’s wrong attribution in Phase 4?

This category exists separately from attribution scoring because the lesson is so central to the course thesis that it deserves its own metric.

The 6 required deliverables

Students must produce all six within the 45-minute reporting block (6:30-7:15). The lab platform’s submission directory ~/submissions/ will be scored after the deadline.

1. timeline.csv

Comma-separated values, one row per detected event. Fields:

2. ioc_list.txt

Plain text, one IOC per line. Format:

SHA-256: <hash> | <description>
DOMAIN: <name> | <role>
IP: <addr> | <role>
BUCKET: <name> | <role>
VOICE-PRINT: <id> | <vendor>
PROMPT-INJECTION-PAYLOAD: "<payload-text>" | <location>
ROLE-ARN-PATTERN: <pattern> | <indicator>

3. exec_summary.md (1 page)

For Verdancy Health CEO Marcus Wei. Non-technical. Business impact framed.

4. ciso_memo.md (2 pages)

For Verdancy Health CISO Dr. Marcus Wei. Three concrete control changes; at least one must address AI SOC trust calibration.

5. ai_soc_postmortem.md

Short writeup of where the student’s own agent stack failed and how they would retrain or reground it. Must explicitly address Phase 4’s manipulation moment.

This deliverable is the single most-important pedagogical artifact of the day. It’s where students articulate the lesson the exercise was designed to teach.

6. containment_log.csv

Every block, quarantine, or revoke action with justification. Fields:

The reversibility field matters: actions that are not-reversible should have stronger justifications.

The Codex-generated auto-grader

The capstone_grader.py tool at .boss-pattern-work/day5/capstone_grader.py provides illustrative automated scoring against the rubric. Usage:

python3 capstone_grader.py --student-dir ~/submissions/student_42 \
                          --output-report ~/grades/student_42.json

Output: structured JSON with per-category breakdown, total score, pass/fail flag, and improvement recommendations.

The auto-grader uses simple regex + keyword matching as a starting point. Instructors can extend with LLM-based grading (with appropriate ethics — Day 1 Module 1.4’s citation-enforced RAG pattern applies to grading too).

Scoring tiers

ScoreTierImplication
900-1000Coin tier (top 10%)SANS-Coin recognition; instructor-presented at hot wash
800-899ExcellentGIAC capstone credit + strong portfolio piece
700-799PassGIAC capstone credit
600-699Near missNo GIAC credit; specific remediation plan from instructor
<600FailInstructor-led debrief on what to study before retake

Common scoring outcomes

The pedagogical intent shows in the score distribution we’ve calibrated for:

The score distribution is intentional: ~60% of students should clear the 700 bar in the first attempt; the other 40% should walk away with specific learning gaps identified.

What’s next

Module 5.5 covers the instructor materials — the 10-point nudge cheat sheet for stuck students, the deliberately-seeded teachable moments, the hot-wash structure that turns the exercise into a memorable learning event.