PiloTY — agentic threat model

10.0AIVSS 10.0 · Critical

PiloTY presents an exceptionally high-risk agentic profile by granting LLMs direct, stateful, and interactive terminal control (PTY) and SSH capabilities, allowing arbitrary command execution on remote hosts.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 9.8AARS uplift 0.16Factor sum 7.1/10Threat ×1.1Mitigation ×1.0

Autonomy of Action		0.90
Goal-Driven Planning		0.80
Self-Modification		0.30
Dynamic Tool Use		1.00
Persistent Memory		0.60
Contextual Awareness		0.70
Dynamic Identity		0.90
Multi-Agent Interactions		0.40
Non-Determinism		0.80
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — PiloTY is an MCP tool/framework rather than a model itself. However, the underlying LLM is highly vulnerable to indirect prompt injection via terminal outputs or SSH banners, which could hijack the active shell session.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — No explicit RAG or vector database is described. The primary data risk is the exfiltration of sensitive terminal outputs, configuration files, or environment variables read during SSH/PTY sessions.

L3 · Agent Frameworks✓ mapped

The agent framework layer is highly critical here; PiloTY provides stateful PTY and SSH tool integrations. Insecure tool integration or lack of strict input sanitization allows an LLM to execute arbitrary, destructive shell commands or run unauthorized background processes.

L4 · Deployment & Infrastructure✓ mapped

Extremely high risk. PiloTY manages SSH connections and interactive terminals. Without strict containerization, network segmentation, and non-root execution, a compromised session allows immediate lateral movement, host compromise, and access to production infrastructure.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — There is no mention of built-in logging, session recording, or command guardrails. The lack of real-time monitoring for executed terminal commands represents a major observability blind spot.

L6 · Security & Compliance (cross-cutting)✓ mapped

The tool handles high-privilege SSH credentials and session states. The listing does not mention any built-in authentication, authorization policies, or access controls to restrict which commands the agent can execute or which hosts it can connect to.

L7 · Agent Ecosystem✓ mapped

As an MCP tool, PiloTY can be exposed to other agents. If a upstream orchestrator agent is compromised, it can abuse PiloTY's terminal access to execute cascading attacks across all connected remote hosts.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).