Vocode
Vocode is an open source developer framework for building voice agents on any AI stack, offering Python and Node libraries, a hosted calling service, and full model flexibility.
Vocode is an open source developer framework for building voice agents, not a finished product you deploy off the shelf. Backed by Y Combinator, it gives engineers the building blocks to create real time, streaming conversations with large language models and connect them to phone calls, Zoom meetings, web apps, and more. Its value is as the orchestration layer that handles the hard parts of voice, including streaming audio, knowing when a caller has stopped speaking, and gracefully handling interruptions, so a developer does not have to wire all of that together from scratch.
There are two ways to use it. Vocode Core is the open source library, available in Python with Node and React software development kits, which teams host themselves for complete control over every component. On top of that sits a hosted application programming interface, an enterprise grade service for managing agents on phone calls, so a team can embed an AI call center directly into its own product or operations. Most of what Vocode builds is open source and free, with paid developer and enterprise tiers for the hosted service.
The framework's strengths are flexibility rather than turnkey features. It integrates with every leading speech to text, text to speech, and language model provider, so teams can assemble the exact stack they want and swap components freely, and it supports many languages. Because it is open source, teams can self host to keep full control of data and cost, paying only their own provider bills with no platform fee on top. Tool calling lets agents fetch data and take actions through a developer's own systems. Higher level features like knowledge grounding, human oversight, memory, and formal testing are left for the builder to add.
The honest picture in 2026 is that Vocode is an early and influential library whose active development appears to have slowed, with the project seeking community maintainers. That makes it a reasonable choice for teams that want maximum control and are comfortable owning the stack, but a riskier foundation for a new production system than more actively maintained frameworks. Because Vocode is developer infrastructure rather than an end to end agent, it scores low on the agent capability axes here: its real strengths are model flexibility, self hosted deployment, and open source extensibility.
Vendor details
Canonical URL
https://vocode.dev
Category
Agent infrastructure
Company status
independent
Use cases & customers
In practice
A developer uses Vocode Core to build a custom phone agent, wiring in their chosen speech to text, language model, and text to speech providers, then self hosts it for full control of data.
A product team embeds Vocode's hosted calling service into its own software to place and receive automated calls, programming the agents to run tasks through the team's existing systems.
A startup prototypes a voice based assistant on Vocode's open source library, using its streaming and interruption handling so early demos feel natural without building the audio pipeline itself.
Sources & related URLs
Research notes
Score 5.5 (3F/5P/6N). Category: AGENT INFRASTRUCTURE (categorized here, not Voice agent - it's an OPEN-SOURCE DEVELOPER FRAMEWORK/LIBRARY, same logic as Rime; the 14-axis AGENT score understates it as infrastructure). Researched in Voice lane. Vocode = open-source Python library/framework for building voice-based LLM agents; YC-backed. Vocode Core (OSS library, Python + Node/React SDKs, self-hosted, build on ANY AI stack) + Vocode API (hosted enterprise-grade call-management API on top of Core). Value = ORCHESTRATION LAYER handling streaming audio, endpointing, interruptions, streaming+turn-based conversations. Fulls: Dep (SELF-HOSTED open-source + hosted options = full data/deployment control), Model (integrates ALL leading STT/TTS/LLM providers, build on any stack, swap freely, multilingual; MAX model flexibility - headline for a framework), Ext (OPEN-SOURCE library + Python/Node/React SDKs + enterprise API + docs; extensibility IS the product - headline). Partials (5): Int (tool calling plug-in-your-APIs + provider integrations STT/TTS/LLM/telephony/Zoom; business/CRM integrations build-your-own), Orch (conversational orchestration abstractions: streaming/turn-based + endpointing + interruptions; code-level, no visual builder/higher-level engine), Obs (hosted service performance analytics; basic), Pack (OSS examples/quickstarts + Next.js template + React SDK; not prebuilt production agents), Trig (inbound + outbound call automation via API + telephony/Zoom/web; campaigns/dialer/event-triggers build-your-own). N (6): Know (no first-class RAG/knowledge - build-your-own), HITL (no first-class oversight/handoff/guardrails), Sec (no named certs; self-hosted = implementer's responsibility), Mem (no first-class memory beyond in-call streaming state), Eval (no first-class Vocode-specific testing/eval framework documented, unlike LiveKit's judges), Comp (no browser/computer use). **VENDOR-HEALTH FLAG [CONFIRMED Jul 2026 via primary source]: original team moved on - founder GitHub bio now 'Working on something new, formerly @Vocode'; official vocodedev/vocode-core README explicitly 'We're actively looking for community maintainers'; still Seed-stage only (Liquid 2 Ventures), no growth funding. Corroborates earlier Dograh (Apr 2026) read that 'active development has largely stopped' and building new production = 'inheriting technical debt without active maintainer.' NET: real, still-installable OSS library but low-momentum community-maintenance mode. KEEP canonical (genuine usable OSS infra) but do NOT feature (isFeatured stays false); position as stable-but-low-activity, not a scaling commercial platform.** Pricing free (OSS Core free/self-hosted) + hosted tiers: Free / Developer ~$25/mo / Enterprise custom. No per-minute platform fee (OSS) but YOU bear all STT/TTS/LLM/telephony provider bills = HIGH variable cost exposure. entry under_20 (free OSS). public_partial. confidenceLevel medium (product well-documented GitHub/docs/YC BUT maintenance/viability uncertain). Peer OSS voice frameworks: Dograh (visual builder), Pipecat, LiveKit Agents (more actively maintained per critics).
Capability coverage
5.5 / 14 capabilities · 39%
| Integrations & Tool CallingVocode supports tool calling so agents can fetch data and take actions through a developer's own systems, and integrates telephony, Zoom, and web, but business and customer relationship management integrations are build your own, so partial. | Partial |
|---|---|
| Workflow OrchestrationVocode acts as the orchestration layer for voice, managing streaming and turn based conversations with endpointing and interruption handling, but as a code framework with no visual workflow builder, so partial. | Partial |
| Knowledge Grounding & RAGVocode is a framework that does not provide first class knowledge grounding or retrieval, which a developer must add themselves, so not documented. | Unable to verify |
| Human Oversight & GuardrailsVocode does not provide first class human oversight, handoff, or guardrail features, leaving these for the builder to implement, so not documented. | Unable to verify |
| Security, Identity & GovernanceVocode is a self hosted open source framework with no named security certifications, so governance is the implementer's responsibility and not documented as first class, so not documented. | Unable to verify |
| Observability & AuditabilityVocode's hosted service provides analytics to monitor agent performance, but deep conversation intelligence and auditability are not documented, so partial. | Partial |
| Memory & State PersistenceVocode manages conversation state within a streaming call but does not provide first class persistent or long term memory, so not documented. | Unable to verify |
| Deployment & Data ResidencyVocode is open source and can be fully self hosted for complete control of every component, with a hosted service option as well, so full. | Full |
| Prebuilt Agents, Templates & PacksVocode ships open source examples, quickstart templates, and starter kits like a Next.js template and React software development kit, but not a library of prebuilt production agents, so partial. | Partial |
| Triggers & Channel CoverageVocode automates inbound and outbound calls through its hosted application programming interface and supports telephony, Zoom, and web, but campaign, dialer, and event triggers are build your own, so partial. | Partial |
| Model Flexibility & RoutingVocode integrates with every leading speech to text, text to speech, and language model provider so teams can assemble and swap any stack, with support for many languages, so full. | Full |
| APIs, SDKs & MCP ExtensibilityVocode is an open source library with Python, Node, and React software development kits plus an enterprise application programming interface, so extensibility is the core of the product, so full. | Full |
| Testing, Debugging & OptimizationVocode does not document a first class testing or evaluation framework for its agents, unlike some peer frameworks, so not documented. | Unable to verify |
| Browser & Computer UseVocode is a voice framework and has no browser or computer use capability, so not documented. | Unable to verify |
Pricing
Open source (free); hosted Developer ~$25/mo
Related vendors
- AgentOps — Agent observability and reliability platform with broad model and…
- Agno — High-performance agent runtime and framework (formerly Phidata) with…
- Apify — Cloud platform for web scraping and automation with 45,000+ prebuilt…
- Arcade — Authenticated tool calling platform and MCP runtime that handles…
- Arize AI — AI observability and evaluation platform that traces, evaluates, and…
- Braintrust — AI evaluation and observability platform with self-serve pricing,…