Core Concepts¶

Findings¶

A Finding is the atomic unit of output. Every scan result, discovery match, fingerprint detection, and exploit result produces one or more findings. Findings share a common schema across all modules:

{
  "id": "f-a1b2c3d4e5f6",
  "timestamp": "2025-01-15T10:30:00Z",
  "source": "ollama",
  "target": "http://10.0.0.5:11434",
  "title": "Ollama Unauthenticated - 3 Models Exposed",
  "severity": "high",
  "description": "...",
  "remediation": "...",
  "evidence": "...",
  "tags": ["ollama", "auth"],
  "metadata": { ... }
}

Severity levels: critical, high, medium, low, info.

Source values identify which module generated the finding: vulncheck, file-discovery, fingerprint, ollama, vectordb, jupyter, mcp, openai-compat, ray, mlflow, gradio, bentoml, triton, torchserve, credential.

See Finding Schema for the complete field reference.

Templates¶

Templates are YAML files that define HTTP-based vulnerability checks. They follow a detect-then-check pattern:

Detect -- optional HTTP probe to confirm the target is the expected service
Checks -- one or more HTTP requests with matchers and extractors
Finding -- the output finding when matchers succeed, with variable interpolation from extractors

Templates are embedded in the binary. Operators can layer additional templates via --templates-dir.

See Template Format for the schema reference.

Discovery Rules¶

Discovery rules are YAML files that define filesystem scanning patterns for AI artifacts. Each rule specifies file patterns, path patterns, and content patterns (regex) to match:

API keys (OpenAI, Anthropic, Hugging Face, AWS, etc.)
MCP configurations (Claude Desktop, VS Code, Cursor)
Local LLM artifacts (GGUF models, Ollama configs, Docker AI)
Vector database data (ChromaDB, FAISS, Weaviate, Qdrant)
Fine-tuning datasets and RAG configurations

Rules are embedded in the binary. Operators can add custom rules via --rules-dir.

See Rule Format for the schema reference.

Workflow Plans¶

Workflow plans are structured follow-on guidance attached to findings. When a discovery or enumeration command finds something actionable, the finding includes concrete next-step commands.

Workflow plans use a staged progression model:

Stage	Purpose
discovery	Identify reachable AI surfaces
correlation	Map discovered assets to exploit paths
proof	Validate access with read-only operations
takeover	Demonstrate impact with state-changing operations

Each recommendation in a workflow plan includes:

command -- the exact aipostex command to run next
rationale -- why this step matters
gated -- whether the command requires --force-exploit
priority -- ordering hint (lower = run first)

Read-only recommendations always appear before gated ones.

Proof Stages and Strength¶

Findings from exploit modules carry proof metadata indicating the level of access demonstrated:

Proof Strength	Meaning
`reachable`	Service responds to probes
`influenced`	Behavior was observably changed
`read-confirmed`	Data was successfully read from the target
`execution-confirmed`	Code or commands executed on the target
`takeover-capable`	Full control of the service demonstrated

Scan Modes¶

The --mode flag controls which vulnerability templates execute during scanning:

Mode	Default	Detection Templates	Exploit Templates
`detect`	Yes	Runs (58 templates)	Skipped
`full`	No	Runs (58 templates)	Runs (44 templates)

Detection templates perform passive checks: authentication probes, version disclosure, config enumeration, tool listing. They never send exploitation payloads.

Exploit templates actively validate vulnerabilities: command injection via MCP tools, SSRF to internal services, path traversal file reads, unauthenticated inference, terminal creation, job submission, artifact exfiltration, and data extraction. They only run when explicitly opted in via --mode full.

This separation ensures operators can safely assess AI infrastructure without modifying target state, then escalate to active exploitation when authorized.

JSON and JSONL Inputs¶

Most post-processing commands accept either a full engagement JSON document or newline-delimited JSONL findings:

engagement merge
report summary
report generate
report graph
engagement bundle

This lets operators stream long-running scans to JSONL, then feed the results directly into reporting and analysis commands without an extra conversion step.

Credential Chain-Loading¶

When discover network or assess network produces findings that contain credentials (Jupyter tokens, OpenAI API keys, HF tokens, Anthropic keys, bearer tokens), those credentials are automatically extracted and injected into workflow recommendations. This closes the gap between discovery and exploitation: instead of manually copying tokens into follow-on commands, the workflow suggestions include the discovered credential values.

Supported credential types:

Jupyter tokens (from URL query params or evidence text)
OpenAI API keys (sk-... pattern)
Hugging Face tokens (hf_... pattern)
Anthropic API keys (sk-ant-... pattern)
Bearer tokens
Generic API keys

Force-Exploit Gating¶

aipostex separates read-only operations from state-changing or high-noise operations using the --force-exploit flag:

Ungated (no flag needed): enumeration, listing, reading, extraction, fingerprinting
Gated (requires --force-exploit): model creation/deletion/poisoning, code execution, file uploads, throughput testing, proxy validation

This ensures operators never accidentally modify target state during passive reconnaissance.

Output Contract¶

stdout -- findings output (console, JSON, or JSONL)
stderr -- progress, warnings, blocked-exploit messages, summaries, and Next Actions guidance
Ctrl+C -- cleanly cancels in-flight work and preserves findings already written