Session Instructions¶

Goal: Tell the AI how to drive Overwatch correctly. This is the content that lives in AGENTS.md (or CLAUDE.md for Claude Code) at the project root.

You probably don't need to read this

The repo already ships an AGENTS.md at the root. Claude Code reads it automatically. This page exists so you understand what the AI is being told to do, and so you can customize it if you need to.

What the AI does (the core loop)¶

flowchart TD
    START([Session start]) --> A[get_state]
    A --> B[next_task]
    B --> C{Score & pick<br/>highest leverage}
    C --> D[validate_action]
    D -->|valid| E[log_action_event<br/>action_started]
    D -->|invalid| B
    E --> F[Execute<br/>run_bash / run_tool / shell]
    F --> G[parse_output<br/>or report_finding]
    G --> H[log_action_event<br/>action_completed]
    H -->|new findings| I[Inference rules fire]
    I --> B
    H -.parallel work.-> J[register_agent]
    J -.results back.-> B

    classDef state fill:#22c55e,stroke:#15803d,color:#fff
    classDef decide fill:#f59e0b,stroke:#92400e,color:#000
    classDef act fill:#3b82f6,stroke:#1e40af,color:#fff
    class A,B state
    class C,D decide
    class E,F,G,H,I,J act

In plain words:

Start by reading state. get_state() gives the full briefing — scope, discoveries, access, objectives, frontier. Every session starts here, including after compaction.
Look at the frontier. next_task() returns candidates already filtered by the deterministic layer (out-of-scope / duplicate / OPSEC-vetoed items are gone).
Pick the best one. This is where the AI does real work — score by chain potential, sequencing, risk, distance to objective.
Validate. validate_action() returns an action_id and a verdict.
Log start, execute, parse/report, log finish. Always carry action_id and frontier_item_id through.
Repeat. New findings fire inference rules, which create new frontier items.
Parallelize. Independent tasks → register_agent() for sub-agents.

Key principles¶

The graph is memory. After compaction, get_state() rebuilds everything. Don't try to hold engagement state in context.
Report early, report often. Every report_finding triggers inference rules that may surface new attack paths.
Always thread frontier_item_id. From next_task → validate_action → log_action_event → parse_output / report_finding. Without it, retrospectives lose causal attribution.
Validate before executing. Catches scope, OPSEC, and impossible-target issues before you waste an action.
Use query_graph liberally. If the frontier doesn't surface a pattern you're seeing, query for it directly.
Respect OPSEC. Read the engagement's OPSEC profile and weight noise into your decisions.

Sub-agent instructions¶

When dispatching agents with register_agent, give them this charter:

You are an Overwatch sub-agent working a specific task. Your tools:

get_agent_context — scoped subgraph view

validate_action — check before executing

log_action_event — record action start/completion/failure

log_thought — record reasoning, decisions, alternatives

run_bash, run_tool — auto-instrumented one-shot execution

parse_output, report_finding — get findings into the graph

query_graph, get_skill — context lookup

open_session / write_session / read_session / send_to_session / list_sessions / close_session — sessions

submit_agent_transcript — wrap-up handoff before you're closed out

Validate first, log start, execute, parse/report, log completion. The primary will mark you done.

The full charter (with all 49 sub-agent tools and per-tool guidance) is in AGENTS.md.

Customizing the prompt¶

The AI bootstraps from one of these sources, in order of preference:

get_system_prompt(role="primary") — generated dynamically from current state (preferred). Includes live tool table, briefing, OPSEC constraints.
AGENTS.md at the project root — static fallback when MCP isn't available.
CLAUDE.md — Claude Code reads this first if present; in our repo it just points at AGENTS.md.

If you want to change how the AI behaves (different scoring weights, additional principles, custom workflows), edit AGENTS.md and the AI will pick it up on next session start. Don't edit it during an active session — Claude Code only reads it at startup.