doc-coauthoring — agentic threat model

4.0AIVSS 4.0 · Medium

This agent presents a very low security risk profile as it is a pure guidance skill for document co-authoring with high human-in-the-loop integration. The primary risks are limited to prompt injection affecting document quality or leaking draft contents to the LLM provider.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 3.5AARS uplift 1.48Factor sum 2.4/10Threat ×0.95Mitigation ×0.8

Autonomy of Action		0.20
Goal-Driven Planning		0.40
Self-Modification		0.10
Dynamic Tool Use		0.10
Persistent Memory		0.10
Contextual Awareness		0.50
Dynamic Identity		0.00
Multi-Agent Interactions		0.30
Non-Determinism		0.50
Opacity & Reflexivity		0.20

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models✓ mapped

Relies on Anthropic's Claude models. Vulnerable to prompt injection that could hijack the co-authoring workflow, generate biased/malicious content, or leak the underlying system instructions.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — No vector database or RAG architecture is mentioned. The agent relies entirely on context gathered dynamically from the user during the session.

L3 · Agent Frameworks✓ mapped

The agent framework manages a structured three-stage workflow (Context Gathering, Refinement, Reader Testing). Vulnerabilities include workflow bypass or state-tracking failures where the agent skips stages or loses context.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — Described as 'pure guidance content with no bundled scripts', implying it runs within the host platform's existing LLM infrastructure without dedicated sandboxing needs.

L5 · Evaluation & Observability✓ mapped

Features an innovative built-in evaluation step ('Reader Testing' via a fresh Claude instance). However, there is no mention of external logging, guardrails, or policy enforcement to detect malicious inputs.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — No details are provided regarding data privacy, compliance standards (like GDPR/HIPAA), or access controls for the documents being co-authored.

L7 · Agent Ecosystem✓ mapped

Exhibits a simple multi-agent/multi-instance interaction by spinning up a 'fresh no-context Claude' for reader testing. A compromised draft document could theoretically contain prompt injections designed to exploit or break the reader-testing instance.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).