Agent S — agentic threat model

9.9AIVSS 9.9 · Critical

Agent S presents an extremely high-risk profile due to its 'computer use' capabilities, allowing autonomous GUI execution (clicking, typing) directly on the host OS. Without strict sandboxing, a prompt injection attack via web content or documents could lead to complete host compromise.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 9.8AARS uplift 0.13Factor sum 5.85/10Threat ×1.1Mitigation ×1.0

Autonomy of Action		0.90
Goal-Driven Planning		0.90
Self-Modification		0.20
Dynamic Tool Use		0.95
Persistent Memory		0.30
Contextual Awareness		0.80
Dynamic Identity		0.20
Multi-Agent Interactions		0.10
Non-Determinism		0.80
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models✓ mapped

Utilizes external foundation models (OpenAI, Anthropic, Gemini, etc.) paired with a specialized grounding model for UI understanding. Highly vulnerable to indirect prompt injection via on-screen text, malicious web pages, or documents that the agent processes during GUI navigation.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — No explicit RAG or vector database is mentioned in the description, though the agent processes real-time screen data and UI elements which could be poisoned by malicious on-screen content.

L3 · Agent Frameworks✓ mapped

The framework orchestrates multi-step planning and tool execution via an Agent-Computer Interface. Vulnerabilities include planning failures, logic loops, and tool misuse where the agent executes destructive OS commands or clicks malicious UI elements due to adversarial inputs.

L4 · Deployment & Infrastructure✓ mapped

Runs via CLI on Linux, macOS, and Windows. If executed directly on a host machine without robust containerization or VM sandboxing, any compromise of the agent translates directly to full host compromise and potential lateral network movement.

L5 · Evaluation & Observability✓ mapped

Includes evaluation assets and reports results on benchmarks like OSWorld, but the listing does not indicate active runtime guardrails, safety filters, or real-time anomaly detection to prevent harmful actions during live execution.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — No built-in security policies, access controls, or compliance frameworks are detailed in the open-source CLI description.

L7 · Agent Ecosystem⚠ not certain from listing

Not certain from the listing — The framework focuses on single-agent computer use; multi-agent coordination or marketplace interactions are not explicitly described.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.