Site Rag — agentic threat model

7.1AIVSS 7.1 · High

Site Rag is a developer-focused, open-source CLI tool for RAG pipeline automation. Its primary security risks stem from indirect prompt injection via scraped web content and potential SSRF/local file access if the scraping tool is pointed at internal resources.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 6.5AARS uplift 0.63Factor sum 1.8/10Threat ×1.0Mitigation ×1.0

Autonomy of Action		0.20
Goal-Driven Planning		0.10
Self-Modification		0.00
Dynamic Tool Use		0.20
Persistent Memory		0.40
Contextual Awareness		0.30
Dynamic Identity		0.00
Multi-Agent Interactions		0.00
Non-Determinism		0.40
Opacity & Reflexivity		0.20

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — integrates with 'popular language models' but does not specify which. The primary threat is indirect prompt injection, where malicious instructions embedded in scraped web pages manipulate the model's behavior during querying.

L2 · Data Operations✓ mapped

Site Rag automates scraping, text extraction, and embedding generation. It is highly susceptible to data poisoning if it scrapes untrusted or attacker-controlled websites, leading to corrupted vector stores and manipulated query results.

L3 · Agent Frameworks✓ mapped

The tool orchestrates scraping and RAG querying via a CLI. Vulnerabilities include insecure tool integration, such as Server-Side Request Forgery (SSRF) if the scraper is coerced into targeting internal network endpoints or local files.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — as an open-source CLI tool, deployment is entirely local to the developer's machine or self-hosted. Security relies on the host environment's isolation, with risks of local privilege escalation if run with high privileges.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — no built-in evaluation, logging, or guardrail mechanisms are mentioned. This creates observability blind spots regarding malicious inputs, data drift, or hallucinated RAG outputs.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — as a lightweight utility, it lacks native identity, authorization, or compliance frameworks, shifting all access control and regulatory compliance burdens to the deploying developer.

L7 · Agent Ecosystem✓ mapped

No multi-agent or marketplace interactions are described. It operates as a standalone horizontal utility, meaning ecosystem-level threats like cascading agent-to-agent trust abuse are not applicable.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.