GPT Researcher — agentic threat model

8.9AIVSS 8.9 · High

GPT Researcher presents a high-risk profile primarily due to its ability to read local files and query the live web simultaneously, making it highly susceptible to indirect prompt injection and subsequent data exfiltration. Its autonomous multi-agent orchestration and support for arbitrary LLMs further compound the difficulty of enforcing strict input sanitization and execution boundaries.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 7.5AARS uplift 1.4Factor sum 5.6/10Threat ×1.0Mitigation ×1.0

Autonomy of Action		0.80
Goal-Driven Planning		0.80
Self-Modification		0.20
Dynamic Tool Use		0.70
Persistent Memory		0.30
Contextual Awareness		0.80
Dynamic Identity		0.10
Multi-Agent Interactions		0.80
Non-Determinism		0.60
Opacity & Reflexivity		0.50

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models✓ mapped

Allows users to connect any LLM of choice. This introduces risks of model misalignment, varying susceptibility to prompt injection, and inconsistent safety guardrails depending on the selected foundation model.

L2 · Data Operations✓ mapped

Supports hybrid research across local documents/files and web sources. This creates a high risk of data poisoning via malicious web content, and potential data exfiltration if local files are processed and sent to external LLM APIs.

L3 · Agent Frameworks✓ mapped

Utilizes an autonomous orchestration framework to plan, outline, and execute parallelized research tasks. The integration of search engines and local file readers as tools exposes the framework to indirect prompt injection where untrusted web data hijacks the agent's execution flow.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — As an open-source tool, deployment is typically local or self-hosted. Without default sandboxing, a compromise of the agent process could lead to local file system exposure, privilege escalation, or theft of stored LLM API keys.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — While the description claims to address misinformation and reliability, it does not specify built-in real-time guardrails, logging, or observability features to detect adversarial manipulation or anomalous tool usage.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — Being a free, open-source research tool, it lacks centralized identity access management, enterprise compliance certifications (like SOC2), or built-in policy enforcement mechanisms, leaving security entirely to the deployer.

L7 · Agent Ecosystem✓ mapped

Supports multi-agent frameworks and parallelized agent operations. This introduces risks of cascading failures, trust abuse between sub-agents, and complex attack vectors where one compromised sub-agent manipulates the aggregation phase of the research report.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).