Hey Help — agentic threat model

9.4AIVSS 9.4 · Critical

Hey Help presents a high agentic risk profile due to its direct integration with Gmail APIs, exposing it to severe indirect prompt injection vulnerabilities from incoming untrusted emails that could lead to unauthorized data access, automated archiving, or malicious drafting.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.8AARS uplift 0.64Factor sum 5.1/10Threat ×1.05Mitigation ×1.0

Autonomy of Action		0.80
Goal-Driven Planning		0.50
Self-Modification		0.10
Dynamic Tool Use		0.70
Persistent Memory		0.60
Contextual Awareness		0.80
Dynamic Identity		0.20
Multi-Agent Interactions		0.10
Non-Determinism		0.60
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — The underlying LLM is not specified, but the agent is highly vulnerable to indirect prompt injection where malicious incoming emails manipulate the foundation model into executing unauthorized actions (e.g., exfiltrating data via drafts).

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — The agent processes sensitive personal and business email data. There is no mention of vector databases or RAG, but the risk of data exfiltration of inbox contents via automated drafting or external API calls is extremely high.

L3 · Agent Frameworks✓ mapped

The agent orchestrates actions like 'AI tagging', 'Auto archive', and 'AI email drafts' using Gmail APIs. Insecure tool integration or logic flaws in the framework could allow an attacker to trigger unintended email archiving or draft generation.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — The deployment architecture (likely a Chrome extension or Google Workspace Add-on backed by a cloud service) is not detailed. The primary threat is the compromise of OAuth tokens granting access to the user's entire Gmail account.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — No evaluation, guardrails, or logging mechanisms are described. Without robust guardrails, the agent cannot reliably distinguish between legitimate user instructions and adversarial instructions embedded in incoming emails.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — No security certifications (such as SOC 2) or compliance alignments are mentioned. The agent requires broad, highly sensitive Gmail permissions (read/write/modify) without explicit security controls detailed in the directory.

L7 · Agent Ecosystem⚠ not certain from listing

Not certain from the listing — No multi-agent interactions are described, but the agent operates in an open ecosystem where it interacts with emails sent by other automated systems or agents, potentially leading to cascading automated actions.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).