Back to vendors
V

Vocode

Visit site
Agent infrastructureindependentVerified 2026-07-01

Vocode is an open source developer framework for building voice agents on any AI stack, offering Python and Node libraries, a hosted calling service, and full model flexibility.

Vocode is an open source developer framework for building voice agents, not a finished product you deploy off the shelf. Backed by Y Combinator, it gives engineers the building blocks to create real time, streaming conversations with large language models and connect them to phone calls, Zoom meetings, web apps, and more. Its value is as the orchestration layer that handles the hard parts of voice, including streaming audio, knowing when a caller has stopped speaking, and gracefully handling interruptions, so a developer does not have to wire all of that together from scratch.

There are two ways to use it. Vocode Core is the open source library, available in Python with Node and React software development kits, which teams host themselves for complete control over every component. On top of that sits a hosted application programming interface, an enterprise grade service for managing agents on phone calls, so a team can embed an AI call center directly into its own product or operations. Most of what Vocode builds is open source and free, with paid developer and enterprise tiers for the hosted service.

The framework's strengths are flexibility rather than turnkey features. It integrates with every leading speech to text, text to speech, and language model provider, so teams can assemble the exact stack they want and swap components freely, and it supports many languages. Because it is open source, teams can self host to keep full control of data and cost, paying only their own provider bills with no platform fee on top. Tool calling lets agents fetch data and take actions through a developer's own systems. Higher level features like knowledge grounding, human oversight, memory, and formal testing are left for the builder to add.

The honest picture in 2026 is that Vocode is an early and influential library whose active development appears to have slowed, with the project seeking community maintainers. That makes it a reasonable choice for teams that want maximum control and are comfortable owning the stack, but a riskier foundation for a new production system than more actively maintained frameworks. Because Vocode is developer infrastructure rather than an end to end agent, it scores low on the agent capability axes here: its real strengths are model flexibility, self hosted deployment, and open source extensibility.

Vendor details

Canonical URL

https://vocode.dev

Category

Agent infrastructure

Company status

independent

Use cases & customers

In practice

A developer uses Vocode Core to build a custom phone agent, wiring in their chosen speech to text, language model, and text to speech providers, then self hosts it for full control of data.

A product team embeds Vocode's hosted calling service into its own software to place and receive automated calls, programming the agents to run tasks through the team's existing systems.

A startup prototypes a voice based assistant on Vocode's open source library, using its streaming and interruption handling so early demos feel natural without building the audio pipeline itself.

Sources & related URLs

Research notes

Score 5.5 (3F/5P/6N). Category: AGENT INFRASTRUCTURE (categorized here, not Voice agent - it's an OPEN-SOURCE DEVELOPER FRAMEWORK/LIBRARY, same logic as Rime; the 14-axis AGENT score understates it as infrastructure). Researched in Voice lane. Vocode = open-source Python library/framework for building voice-based LLM agents; YC-backed. Vocode Core (OSS library, Python + Node/React SDKs, self-hosted, build on ANY AI stack) + Vocode API (hosted enterprise-grade call-management API on top of Core). Value = ORCHESTRATION LAYER handling streaming audio, endpointing, interruptions, streaming+turn-based conversations. Fulls: Dep (SELF-HOSTED open-source + hosted options = full data/deployment control), Model (integrates ALL leading STT/TTS/LLM providers, build on any stack, swap freely, multilingual; MAX model flexibility - headline for a framework), Ext (OPEN-SOURCE library + Python/Node/React SDKs + enterprise API + docs; extensibility IS the product - headline). Partials (5): Int (tool calling plug-in-your-APIs + provider integrations STT/TTS/LLM/telephony/Zoom; business/CRM integrations build-your-own), Orch (conversational orchestration abstractions: streaming/turn-based + endpointing + interruptions; code-level, no visual builder/higher-level engine), Obs (hosted service performance analytics; basic), Pack (OSS examples/quickstarts + Next.js template + React SDK; not prebuilt production agents), Trig (inbound + outbound call automation via API + telephony/Zoom/web; campaigns/dialer/event-triggers build-your-own). N (6): Know (no first-class RAG/knowledge - build-your-own), HITL (no first-class oversight/handoff/guardrails), Sec (no named certs; self-hosted = implementer's responsibility), Mem (no first-class memory beyond in-call streaming state), Eval (no first-class Vocode-specific testing/eval framework documented, unlike LiveKit's judges), Comp (no browser/computer use). **VENDOR-HEALTH FLAG [CONFIRMED Jul 2026 via primary source]: original team moved on - founder GitHub bio now 'Working on something new, formerly @Vocode'; official vocodedev/vocode-core README explicitly 'We're actively looking for community maintainers'; still Seed-stage only (Liquid 2 Ventures), no growth funding. Corroborates earlier Dograh (Apr 2026) read that 'active development has largely stopped' and building new production = 'inheriting technical debt without active maintainer.' NET: real, still-installable OSS library but low-momentum community-maintenance mode. KEEP canonical (genuine usable OSS infra) but do NOT feature (isFeatured stays false); position as stable-but-low-activity, not a scaling commercial platform.** Pricing free (OSS Core free/self-hosted) + hosted tiers: Free / Developer ~$25/mo / Enterprise custom. No per-minute platform fee (OSS) but YOU bear all STT/TTS/LLM/telephony provider bills = HIGH variable cost exposure. entry under_20 (free OSS). public_partial. confidenceLevel medium (product well-documented GitHub/docs/YC BUT maintenance/viability uncertain). Peer OSS voice frameworks: Dograh (visual builder), Pipecat, LiveKit Agents (more actively maintained per critics).

Capability coverage

5.5 / 14 capabilities · 39%

Integrations & Tool CallingVocode supports tool calling so agents can fetch data and take actions through a developer's own systems, and integrates telephony, Zoom, and web, but business and customer relationship management integrations are build your own, so partial. Partial
Workflow OrchestrationVocode acts as the orchestration layer for voice, managing streaming and turn based conversations with endpointing and interruption handling, but as a code framework with no visual workflow builder, so partial. Partial
Knowledge Grounding & RAGVocode is a framework that does not provide first class knowledge grounding or retrieval, which a developer must add themselves, so not documented. Unable to verify
Human Oversight & GuardrailsVocode does not provide first class human oversight, handoff, or guardrail features, leaving these for the builder to implement, so not documented. Unable to verify
Security, Identity & GovernanceVocode is a self hosted open source framework with no named security certifications, so governance is the implementer's responsibility and not documented as first class, so not documented. Unable to verify
Observability & AuditabilityVocode's hosted service provides analytics to monitor agent performance, but deep conversation intelligence and auditability are not documented, so partial. Partial
Memory & State PersistenceVocode manages conversation state within a streaming call but does not provide first class persistent or long term memory, so not documented. Unable to verify
Deployment & Data ResidencyVocode is open source and can be fully self hosted for complete control of every component, with a hosted service option as well, so full. Full
Prebuilt Agents, Templates & PacksVocode ships open source examples, quickstart templates, and starter kits like a Next.js template and React software development kit, but not a library of prebuilt production agents, so partial. Partial
Triggers & Channel CoverageVocode automates inbound and outbound calls through its hosted application programming interface and supports telephony, Zoom, and web, but campaign, dialer, and event triggers are build your own, so partial. Partial
Model Flexibility & RoutingVocode integrates with every leading speech to text, text to speech, and language model provider so teams can assemble and swap any stack, with support for many languages, so full. Full
APIs, SDKs & MCP ExtensibilityVocode is an open source library with Python, Node, and React software development kits plus an enterprise application programming interface, so extensibility is the core of the product, so full. Full
Testing, Debugging & OptimizationVocode does not document a first class testing or evaluation framework for its agents, unlike some peer frameworks, so not documented. Unable to verify
Browser & Computer UseVocode is a voice framework and has no browser or computer use capability, so not documented. Unable to verify

Recent platform changes

No recent material changes tracked yet.

Pricing

Open source (free); hosted Developer ~$25/mo

Public — partialHigh variable cost
Verified 2026-07-01

Contact us

Found a vendor we missed? Have feedback on the index? We'd love to hear from you.

Agentic AI Index

A directory and comparison resource for AI agent platforms, autonomous workflow tools, and enterprise agentic automation products.

© 2026 Agentic AI Index

3801 N Capital of Texas Hwy, Ste E240 · Austin, TX 78746

Researched from public vendor sources. See Methodology.