SweetAI Chat — agentic threat model

7.7AIVSS 7.7 · High

SweetAI Chat is a consumer-facing NSFW roleplay and image generation agent with low systemic autonomy but high confidentiality risks due to the highly sensitive nature of user-generated NSFW chat histories and images.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 6.5AARS uplift 1.15Factor sum 3.3/10Threat ×1.0Mitigation ×1.0

Autonomy of Action		0.20
Goal-Driven Planning		0.10
Self-Modification		0.10
Dynamic Tool Use		0.30
Persistent Memory		0.40
Contextual Awareness		0.50
Dynamic Identity		0.20
Multi-Agent Interactions		0.00
Non-Determinism		0.80
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — likely utilizes fine-tuned LLMs and text-to-image diffusion models optimized for NSFW content. Primary threats include prompt injection to bypass system instructions, model reprogramming, and adversarial inputs designed to generate illicit or non-consensual imagery.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — relies on user-provided character descriptions and chat histories. Key risks include data exfiltration of highly sensitive personal chats and potential leakage of private user data if chat logs are ingested to fine-tune future model iterations.

L3 · Agent Frameworks⚠ not certain from listing

Not certain from the listing — orchestration is likely limited to a basic conversational loop with image generation tool-calling. Vulnerabilities include insecure tool integration where prompt injection could manipulate image generation parameters or API calls.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — hosted as a closed-source web application. Risks include standard web application vulnerabilities, insecure storage of generated media assets, and potential exposure of backend API endpoints.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — observability and guardrails are likely minimal or heavily modified to permit NSFW content, creating significant blind spots regarding the generation of extreme, illegal, or non-consensual content.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — faces severe compliance and regulatory risks, particularly regarding robust age verification, COPPA/GDPR compliance for sensitive personal data, and the lack of visible security certifications.

L7 · Agent Ecosystem⚠ not certain from listing

Not certain from the listing — operates as a vertical, isolated platform. The primary ecosystem risk is the sharing of user-created character templates, which could be weaponized to distribute malicious prompts or bypass platform restrictions.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.