DemoGPT — agentic threat model

8.8AIVSS 8.8 · High

DemoGPT is an open-source AI framework focused on generating interactive LLM applications from prompts. Its primary security risk lies in the generation of untrusted code, which could lead to local or remote code execution if deployed without strict sandboxing.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.1AARS uplift 0.72Factor sum 3.8/10Threat ×1.0Mitigation ×1.0

Autonomy of Action		0.40
Goal-Driven Planning		0.50
Self-Modification		0.20
Dynamic Tool Use		0.60
Persistent Memory		0.20
Contextual Awareness		0.40
Dynamic Identity		0.10
Multi-Agent Interactions		0.20
Non-Determinism		0.70
Opacity & Reflexivity		0.50

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — DemoGPT likely relies on external foundation models (such as OpenAI) to interpret prompts and generate code. It is highly susceptible to prompt injection attacks that could manipulate the generated application's logic.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — The data operations layer is not detailed, but the framework likely processes user prompts and potentially utilizes template repositories. Risks include template poisoning or insecure handling of user-provided data sources during demo generation.

L3 · Agent Frameworks✓ mapped

As an agent framework itself, DemoGPT orchestrates the translation of prompts into functional application code. A key threat is insecure tool integration or pipeline hijacking, where malicious prompts force the framework to generate and execute harmful code structures.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — The deployment risk depends heavily on where the generated demos are executed (e.g., local developer machines vs. hosted cloud environments). Without strict containerization or sandboxing, running generated code poses a high risk of host compromise.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — There is no mention of built-in evaluation, guardrails, or observability tools to inspect the safety of the generated code before execution, creating a significant blind spot.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — As an open-source framework, it likely lacks enterprise-grade access controls, compliance certifications, or built-in policy enforcement mechanisms, leaving security responsibility entirely to the deployer.

L7 · Agent Ecosystem⚠ not certain from listing

Not certain from the listing — While primarily a standalone generator, if DemoGPT is integrated into a larger multi-agent ecosystem, a compromised generation pipeline could be used to distribute malicious applications or agents horizontally.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).