Tavily
Also known as: Tavily Search, Tavily API
Real time web search, extraction, crawling, and research API built for AI agents, returning clean LLM ready content that keeps agents grounded.
Tavily is a web access layer for AI agents, a single API that gives language models real time search, content extraction, web crawling, and deep research. Unlike a traditional search engine built for humans, Tavily returns clean, structured, LLM ready content, ranked snippets, summaries, and citations rather than raw HTML, which keeps agents grounded in fresh information and reduces hallucinations. Founded in 2024 and headquartered in New York with offices in Tel Aviv and Abu Dhabi, it serves more than a million developers, handles over 100 million requests a month at 99.99 percent uptime, and returns search results with a median latency around 180 milliseconds.
The API exposes a few focused endpoints. Search returns ranked, relevance filtered results, with a basic depth for speed and an advanced depth that scrapes more sources, plus an optional synthesized answer. Extract pulls clean markdown text from up to twenty five URLs per call, stripping ads and navigation. Crawl and Map traverse whole sites, and a Research endpoint runs multi step deep research. Tavily reports leading results on retrieval benchmarks including SimpleQA at 93.3 percent. The company grew up inside the LangChain ecosystem, maintains the official langchain-tavily package, and is the default web search tool in many LangChain and LangGraph templates.
Developers integrate through Python and JavaScript SDKs and an official MCP server, with drop in integrations for OpenAI, Anthropic, Groq, LangChain, and LlamaIndex, and availability on the Databricks MCP Marketplace, JetBrains, and IBM watsonx. In February 2026, Nebius Group, an AI infrastructure company, announced an agreement to acquire Tavily for $275M. The founding team joined Nebius and the product continues under the Tavily brand with the same API, so existing integrations keep working, though it is now part of a larger AI cloud company rather than an independent startup.
Pricing runs on a credit system that reflects operation type and depth: a basic search is one credit, an advanced search two, extract is one credit per five URLs, and map is one credit per ten pages. A free tier provides 1,000 credits a month with no credit card, pay as you go costs about $0.008 per credit, and subscription plans lower the unit cost to roughly $0.005 to $0.0075 per credit, with Project and Enterprise tiers for higher limits. Tavily also offers its service free to students. Because cost scales with call volume and depth, agents that hit search on every loop should price their real mix of calls.
Vendor details
Canonical URL
https://www.tavily.com
Category
Agent infrastructure
Subcategory
Web search and retrieval
Funding status
Founded in 2024, headquartered in New York with offices in Tel Aviv and Abu Dhabi, with around $25M raised. In February 2026, Nebius Group, an AI infrastructure company, announced an agreement to acquire Tavily for $275M; the founding team joined Nebius and the product continues under the Tavily brand with the same API. Serves over a million developers and handles 100M+ monthly requests.
Company status
acquired
Use cases & customers
Primary use cases
Target customers
Deployment options
Integrations
Integrated through Python and JavaScript SDKs and an official MCP server, with drop in integrations for OpenAI, Anthropic, Groq, LangChain, and LlamaIndex. The default web search tool in many LangChain and LangGraph templates, available on the Databricks MCP Marketplace and integrated into JetBrains and IBM watsonx.
In practice
Your agent answers questions from stale model knowledge and sometimes invents facts. You add a Tavily search call to the loop, so it pulls fresh, ranked web context and citations before the model responds.
You need clean article text from a list of URLs, not messy HTML. Tavily Extract returns markdown for up to twenty five pages per call, stripping ads and navigation so the model ingests only the content.
You are on LangGraph and want web search without wiring a provider yourself. Tavily is the default search tool in many templates and ships an MCP server, so you drop it in and start grounding answers.
Sources & related URLs
Related / legacy domains
Research sources
Capability coverage
4.0 / 14 capabilities · 29%
| Integrations & Tool CallingDesigned as the web access tool agents call, integrated first class across the ecosystem: the default web search tool in many LangChain and LangGraph templates, an official MCP server, drop in integrations for OpenAI, Anthropic, Groq, and LlamaIndex, and listings on the Databricks MCP Marketplace, JetBrains, and IBM watsonx. | Full |
|---|---|
| Workflow OrchestrationThe Research endpoint orchestrates a multi step deep research pipeline that plans, searches, extracts, and synthesizes, though it does not orchestrate user defined agent workflows. | Partial |
| Knowledge Grounding & RAGCore product. Real time web retrieval that grounds agents in fresh information, returning ranked, relevance filtered, LLM ready content with citations purpose built for RAG, reducing hallucinations. | Full |
| Human Oversight & GuardrailsA retrieval API with no human review, approval, or content guardrail features. | Unable to verify |
| Security, Identity & GovernanceOffers secure APIs with API key authentication and an Enterprise tier, but no compliance certifications, SSO, or access governance controls are documented publicly. | Unable to verify |
| Observability & AuditabilityProvides a usage and credits dashboard with rate limit controls and visibility into API consumption, though it is usage metering rather than full agent tracing. | Partial |
| Memory & State PersistenceA stateless retrieval API; it provides no agent memory or state persistence layer. | Unable to verify |
| Deployment & Data ResidencyDelivered only as a cloud SaaS API with no open source self host, on premises, or in VPC deployment option documented. | Unable to verify |
| Prebuilt Agents, Templates & PacksProvides code examples in its docs but no prebuilt agents, installable packs, or its own template marketplace. | Unable to verify |
| Triggers & Channel CoverageA request and response API with no triggers, scheduling, webhooks, or conversational channel coverage. | Unable to verify |
| Model Flexibility & RoutingA model agnostic data source whose output any LLM can consume, but it provides no model routing or model selection layer of its own. | Unable to verify |
| APIs, SDKs & MCP ExtensibilityA clean REST API with focused endpoints, official Python and JavaScript SDKs, an official MCP server, cURL access, and an llms.txt index for agent consumption give comprehensive extensibility. | Full |
| Testing, Debugging & OptimizationPublishes retrieval benchmarks but offers no user facing testing, debugging, or optimization product. | Unable to verify |
| Browser & Computer UseCrawl and Map traverse and read websites, but this is web retrieval rather than interactive browser automation or computer use. | Unable to verify |
Pricing
Free 1,000 credits/mo · pay as you go $0.008/credit · subscription $0.005–$0.0075/credit
Credit system metered by operation type and depth (basic search 1 credit, advanced 2, extract 1 credit/5 URLs, map 1 credit/10 pages); pay as you go or subscription, plus Project and Enterprise tiers
Included quota
Free tier 1,000 credits/month (no card). Pay as you go $0.008/credit. Subscription plans reduce unit cost to ~$0.005–$0.0075/credit. Project and Enterprise tiers add higher rate limits. Free for students. Credit costs: basic search 1, advanced search 2, extract 1/5 URLs, map 1/10 pages.
What is public
The credit model, free tier, per credit pay as you go rate, subscription unit cost range, and per operation credit costs are public; named subscription tier totals and Enterprise pricing are not fully itemized.
Billing mechanics
Metered credits by operation type and depth, drawn from a monthly subscription allowance or paid per credit on pay as you go. Project and Enterprise tiers raise limits.
Cost watchouts
Advanced depth doubles search cost, the optional answer synthesis and extract calls add credits, and per loop search in busy agents multiplies spend; a heavy research agent can run far above the headline per credit rate.
Variable cost rationale
Cost scales directly with the number and depth of calls. Agents that hit search on every reasoning loop, use advanced depth, or extract many pages accumulate credits quickly. At 100,000 basic searches a month, spend is roughly $800 on pay as you go before subscription discounts.
Additional watchouts
Advanced search depth doubles the search credit cost, and high frequency search loops accumulate credits fast; price your real mix of search, extract, and research calls rather than the cheapest line item.
Overage / add-ons
Usage is metered in credits by operation type and depth; you draw down monthly subscription credits or pay per credit on pay as you go. Exceeding a plan means buying more credits or moving to a higher tier.
Sales call required
No — self-serve available
Free / trial
Free tier: 1,000 API credits/month, no credit card. Free for students.
Lowest paid plan
Pay as you go at $0.008/credit; subscription plans reduce to $0.005–$0.0075/credit
Commercial notes
Self serve, developer first with a generous free tier and agent native onboarding, popular as the default LangChain and LangGraph web search tool. Now part of Nebius following the February 2026 acquisition.
Key ambiguities
Named subscription tier prices and Enterprise pricing beyond the per credit unit range are not all public.
Cancellation / refund
Pay as you go and self serve subscriptions have standard cancellation. Enterprise terms are contractual.
Support SLA / resale
Community and self serve support on lower tiers; higher limits and support on Project and Enterprise.
Missing data
Named subscription tier dollar amounts and Enterprise pricing are not fully public.
Related vendors
- AgentOps — Agent observability and reliability platform with broad model and…
- Agno — High-performance agent runtime and framework (formerly Phidata) with…
- Apify — Cloud platform for web scraping and automation with 45,000+ prebuilt…
- Arcade — Authenticated tool calling platform and MCP runtime that handles…
- Arize AI — AI observability and evaluation platform that traces, evaluates, and…
- Braintrust — AI evaluation and observability platform with self-serve pricing,…