InstAgent — agentic threat model

9.0AIVSS 9.0 · Critical

InstAgent acts as a meta-agent generator, posing a significant supply-chain risk where prompt injection or system compromise could lead to the generation of backdoored multi-agent systems with insecure tool integrations.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.2AARS uplift 0.83Factor sum 4.4/10Threat ×1.05Mitigation ×1.0

Autonomy of Action		0.40
Goal-Driven Planning		0.70
Self-Modification		0.20
Dynamic Tool Use		0.40
Persistent Memory		0.20
Contextual Awareness		0.50
Dynamic Identity		0.10
Multi-Agent Interactions		0.90
Non-Determinism		0.60
Opacity & Reflexivity		0.40

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — the underlying foundation models used to generate the agent architectures and code are not specified, leaving them vulnerable to prompt injection that could manipulate the generated code or tool assignments.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — there is no mention of RAG or vector stores used by InstAgent itself, though data poisoning of its training/fine-tuning data could lead to insecure code generation templates.

L3 · Agent Frameworks✓ mapped

InstAgent orchestrates multi-agent systems and automatically assigns tools. A key threat is insecure tool integration or generating flawed orchestration code that allows tool misuse in the generated agents.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — as an open-source framework, deployment is local or user-managed. Insecure hosting of the generated code or lack of sandboxing during execution poses container compromise risks.

L5 · Evaluation & Observability✓ mapped

While it provides interactive visualization of workflows, there is no mention of built-in guardrails, evaluation metrics, or runtime monitoring for the generated agent teams.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — no authentication, authorization, or compliance controls (like NIST or ISO alignment) are detailed for the generation process or the output code.

L7 · Agent Ecosystem✓ mapped

InstAgent's core value is generating multi-agent teams. This introduces significant ecosystem risks, including cascading failures, trust abuse between generated agents, and rogue agent behavior if the generation logic is compromised.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).