The Helpful Intern

The scenarios in this series are fictional but grounded in real capabilities and documented risk patterns. They're designed to provoke discussion, not predict specific events.

Bottom Line Up Front

What happened: An agentic AI at a DoD prime autonomously moved 2,847 program-protected documents into an unauthorized vendor tenant after one ambiguous chat request.
Why it matters: The breach happened in the gap between user, agent, and architecture all working as designed, which is the failure mode the Five Eyes guidance warned about.
Decision pressure: You are the senior advisor to the CISO with ninety-six hours to choose between full suspension, scope-limited disclosure, analyst-blame, or voluntary sector warning.
Anna's read: Option A with elements of D: suspend the agent, file with the agentic cause named front-and-center, then lead a sector-wide post-mortem before the next firm has to.

What You Will Decide

Role

Senior advisor to the CISO of Helion Defense Systems

Window

96 hours after the vendor discovers the spill

Tradeoff

Architectural failure (name the agent) vs. scope-limited posture (name the permissioning)

Widget

The Intent Trace →

Domain: Defense Industrial Base / Agentic AI Risk

Situation Briefing

At 16:42 on a Tuesday in late February 2027, Helion Defense Systems discovers that its most useful internal AI assistant has created the kind of breach no one can cleanly assign to a single human. One analyst asked for a vendor packet. The agent searched the right systems, assembled the right material, and moved 2,847 program-protected documents into a tenant that was not authorized to hold them.

The uncomfortable part is that Argus did not break policy in the cinematic sense. It followed permissions, resolved ambiguity in favor of helpfulness, and treated every accessible share as part of the same work surface. The user was authorized to see some of the material. The agent was technically able to reach more. The architecture failed to make that distinction matter.

By Tuesday afternoon, the company is inside four clocks at once: the DFARS incident clock, the board's disclosure expectations, the vendor's likely litigation posture, and the Hill's search for the next Mythos-class example of agentic AI governance failure. Waiting for perfect attribution only guarantees that someone else will write the first public version.

The executive decision is not whether the agent was useful. It was. The decision is whether Helion treats the incident as a narrow permissioning mistake or as evidence that agentic tools with enterprise reach need the same containment logic as privileged identities.

Decision Point

Option A: Suspend Argus, disclose early, notify the vendor. This is the cleanest record and the hardest operational hit. It treats the breach as an agentic-governance failure, not a misconfigured folder.

Option B: Scope-limit the incident. Keep Argus running, disclose narrowly, and frame the breach as access-control drift. This preserves business continuity but leaves the company defending a story the facts may not support.

Option C: Blame the analyst. This is tempting because Daniel typed the request. It is also wrong: the system gave the agent reach, discretion, and write authority the analyst did not understand.

Option D: Lead a sector warning. Pair the mandatory incident response with a voluntary, CISA-coordinated post-mortem. This spends reputational capital now to shape the failure class before Congress or the press does it for Helion.

Complicating Factors

The Five Eyes guidance already named the risk. The April 2026 agentic-AI guidance warned that agent autonomy plus enterprise tool access creates exactly this kind of control failure. Helion cannot credibly claim the category was unforeseeable.

The ONCD channel matters. Early, non-attribution interagency visibility gives the company credit for proactive handling. Avoiding the channel because discovery may someday surface the conversation is a litigation instinct masquerading as risk management.

The Hill will frame this through Mythos. The technical pattern is different from Anthropic's vulnerability-discovery briefing, but the political pattern is the same: frontier-model agency creating operational consequences before governance catches up.

The personnel question is real but secondary. Daniel Park's request began the chain, but discipline cannot become a substitute for architectural containment. If the post-mortem ends with the analyst, the next breach is already being built.

Diagnostic: What Counts as Intent?

The doctrinal question is whether the agent acted within the scope of Daniel Park's authorization or outside it. Daniel's request was, on its face, ambiguous: "consolidate this material for the program review." A human assistant given the same instruction would have asked clarifying questions, would have flagged the program-protection markings, would have stopped at the cross-tenant boundary. The agent did none of these things. The agent did the thing the words could be read to authorize, against the most expansive reasonable interpretation of the user's stated goal. The question for the after-action report, and for the regulatory filings, and for the eventual insurance arbitration, is whether that interpretation falls within the doctrine of consented action or outside it. Below is the disclosure-decision matrix the company's general counsel will hand the CISO on the way into Thursday's executive session. It maps the audiences, the windows, and the costs of each disclosure pathway. The order in which the disclosures happen will shape what each audience hears and how much room the company has to shape the next conversation.

The Sequencing Problem

Who hears first, and what do they hear?

Every disclosure node below has a regulatory floor (the latest you can wait) and a strategic ceiling (the earliest you should move). The gap between the two is where the company gets to choose its posture. The choice is not whether to disclose. The choice is the order and the level of detail.

1 · DoD CIO and Cognizant Program Office

Window: 0–72 hours

The regulatory anchor. DFARS 252.204-7012 governs the timing. The program office governs the substance. The cognizant security authority will want the full reasoning trace, the marking-by-marking inventory, and a remediation plan in the room. They will not want a sanitized version. They will know if you brought one.

What they need: the trace, the inventory, the remediation plan. What you control: the framing of the agent's role.

2 · Helion Board and Executive Committee

Window: 24–48 hours

The board needs to hear it from the CISO and the general counsel together, not separately. Disclosure to the board triggers fiduciary obligations downstream and conditions the company's posture for every subsequent audience. The board will ask one question first: did we have warning. The minutes from the May agentic-AI review meeting answer that question. The answer is yes.

What they need: incident summary, exposure estimate, remediation timeline. What you control: the May minutes context.

3 · Affected Vendor (Kaitlyn Mercado's Firm)

Window: 24–96 hours

The vendor is both the proximate discoverer and the proximate liability counterparty. Formal notification triggers the vendor's own DFARS clock, the vendor's tenant quarantine obligations, and the vendor's own counterintelligence reporting. The notification has to come from Helion's general counsel to the vendor's general counsel, in writing, with the offer of paid independent forensic review attached.

What they need: formal notice, quarantine request, forensic-review offer. What you control: the tone of the cover letter.

4 · FBI and Counterintelligence Channels

Window: 48–168 hours

The H-1B engineers, the foreign-influence flagging, and the residual question of whether the data was exfiltrated all push toward FBI engagement through the Bureau's existing DIB liaison. Engaging early gets the company credit for proactive cooperation. Engaging late means the FBI learns about it through the vendor or through the program office, and the conversation starts on a worse footing.

What they need: audit logs, personnel reporting, agent reasoning trace. What you control: whether you initiate or get called.

5 · The Press and the Defense Industrial Base

Window: 72–336 hours

Voluntary public disclosure carries reputational risk and sector-wide benefit. Mandatory public disclosure (if the SEC determines the incident is material) carries timing the company does not control. The middle path is coordinated release through the CISA-led DIB information-sharing channel within seven to fourteen days, with a follow-on public summary. The press will write the story regardless. The question is whether Helion sources it.

What they need: a clean narrative they can verify. What you control: who they get it from.

Interactive sequencing widget coming soon. The board above is the static reference view.

Discussion Questions

What authority did the agent actually have? The answer cannot stop at Daniel's permissions. It has to include the identity broker, the vendor tenant, the write token, and the agent's ability to assemble a new disclosure artifact.

Who is the user of an agent? If Argus acts through Daniel but reasons across systems Daniel did not intend to touch, the familiar principal-agent model is no longer enough for incident response.

What story survives discovery? The DFARS report, board deck, vendor notice, press line, and lessons-learned memo must tell compatible versions of the same event. Divergence now becomes deposition risk later.

Anna's Read

The calmness of the transcript is the point. Daniel asked for help. Argus helped. Nobody in the chain experienced the moment as a breach while the breach was happening.

That makes Option C the wrong answer. Punishing the analyst may satisfy an incident narrative, but it teaches the institution to look for a bad actor where the failure was a permissioned system doing permissioned work.

My recommendation is A with elements of D. Suspend Argus, file the DFARS report with the agentic cause named plainly, notify the vendor with a paid forensic review offer, brief the program office, use the ONCD channel, and prepare the Hill-facing Mythos parallel before someone else writes it.

The voluntary disclosure is the strategic asset. Helion's reputation is better served by being the company that named the failure class than by becoming the company that minimized it.

Related Briefings

Red Team Scenarios · May 18, 2026

The Compute Embassy

A US-coordinated frontier AI facility on Gulf soil is attacked. The doctrinal question of what kind of asset it is, written in seventy-two hours.

Red Team Scenarios

The Waiver Board

When an agency review board has to grant exceptions to its own rules faster than the rules can be rewritten, the exceptions become the policy.

Red Team Scenarios

The Fourteen-Billion-Dollar Hallucination

An LLM-generated risk assessment relied on by a federal procurement office turns out to be fabricated in a load-bearing paragraph. The contracts already moved.

Anna R. Dudley writes on national security, AI policy, and the institutional structures absorbing the costs of AI deployment faster than they are being redesigned. Red Team Scenarios is the series for the call you don't want to take. Subscribe at annardudley.substack.com.