Weave
Lightweight toolkit for tracking and evaluating LLM applications
🛡️ AgentReady threat assessment
MAESTRO 7-layer threat model + OWASP AIVSS risk score for Weave, derived from its capabilities.
These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.
Overview
Building demos of Generative AI applications is deceptively easy; getting them into production (and maintaining their high quality) is not. W&B Weave is here to help developers build and iterate on their AI applications with confidence. Create rigorous apples-to-apples evaluations to score the behavior of any aspect of your app. Examine and debug failures by easily inspecting inputs and outputs. Deliver high performing AI applications to production.