AgentReadyHomeAgent ListingPricing

← Agent Listing

Voicebox

Voice AI AgentsfreeOpen SourceTechnology, Media, Education, Entertainment, Productivity

Open-source local AI voice studio for cloning voices, generating speech, dictation, and giving MCP-aware agents custom voices.

🛡️ AgentReady threat assessment

MAESTRO 7-layer threat model + OWASP AIVSS risk score for Voicebox, derived from its capabilities.

AIVSS 6.7 · Medium
View MAESTRO 7-layer threat model →

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.

Overview

Voicebox is a free, open-source, local-first AI voice studio from the jamiepine/voicebox GitHub project. It is described as an alternative to ElevenLabs and WisprFlow in one app, combining voice output and input workflows. Voicebox can clone voices from a few seconds of audio, generate speech in 23 languages across seven TTS engines, provide global-hotkey dictation into text fields, and let MCP-aware AI agents speak using voices the user owns. The project emphasizes local execution and privacy, with models, voice data, and captures running on the user's machine.

Key features

Use cases