Hermes Agent — agentic threat model

10.0AIVSS 10.0 · Critical

Hermes Agent presents an extremely high-risk profile due to its combination of persistent shell access, browser automation, and a self-improving learning loop. Without strict sandboxing and robust input sanitization, its extensive toolset and remote accessibility make it a prime target for remote code execution and host compromise.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 9.8AARS uplift 0.17Factor sum 7.7/10Threat ×1.1Mitigation ×1.0

Autonomy of Action		0.90
Goal-Driven Planning		0.80
Self-Modification		0.90
Dynamic Tool Use		0.90
Persistent Memory		1.00
Contextual Awareness		0.80
Dynamic Identity		0.40
Multi-Agent Interactions		0.50
Non-Determinism		0.80
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — likely utilizes Nous Research's Hermes series models. Vulnerable to prompt injection, adversarial examples, and model alignment drift as the agent self-improves and modifies its behavior over time.

L2 · Data Operations✓ mapped

Persistent memory, user modeling, and past conversation search require a local or cloud vector database/state store. Vulnerable to memory poisoning and data exfiltration of sensitive user history.

L3 · Agent Frameworks✓ mapped

Features a self-improving learning loop, MCP connectivity, and plugins. Vulnerable to insecure tool execution, prompt injection leading to shell command execution, and malicious plugin exploitation.

L4 · Deployment & Infrastructure✓ mapped

Supports local/cloud deployment with persistent shell access and browser automation. Highly vulnerable to host compromise, privilege escalation, and container escape if not strictly sandboxed.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — no built-in evaluation, guardrails, or monitoring tools are mentioned. Gaps in logging could allow malicious shell commands or memory drift to go undetected.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — being an open-source developer-focused tool, it lacks explicit mention of enterprise identity, access management (IAM), or compliance frameworks.

L7 · Agent Ecosystem✓ mapped

Supports MCP (Model Context Protocol) and multi-channel use cases (e.g., Telegram). Vulnerable to trust abuse from external MCP servers or malicious inputs received via Telegram.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.