Back to vendors
T

Toolhouse

Also known as: Toolhouse AI, Toolhouse.ai

Visit site
Agent infrastructureindependentVerified 2026-06-30

Platform to build, deploy, and run AI agents from a prompt, with built in RAG, tools, and one link MCP connectivity to thousands of services.

Toolhouse is a platform for building, deploying, and running AI agents without the usual boilerplate. In its Agent Studio, you describe what you want an agent to do in plain language, for example scraping a URL and summarizing it, and Toolhouse assembles the agent, wires up the tools it needs, and turns it into a scalable, production ready API you can deploy in one click. It is pitched at both developers, who can drop its SDK into an existing stack, and less technical builders, who can ship working agents without writing code. The platform is used by companies including Cloudflare, NVIDIA, Groq, and Snowflake.

The core value is that Toolhouse handles the backend an agent needs out of the box: tool integrations, retrieval augmented generation with automatic setup, memory, caching, logging, debugging, and deployment. It ships a library of prebuilt tools such as web search through Google, Tavily, and Exa, scraping through Firecrawl, databases like MongoDB, Pinecone, and Supabase, and email through SendGrid. Its standout feature is MCP support. Agents connect to thousands of remote Model Context Protocol servers with a single link, and MCP Discovery reads your prompt, detects which MCP servers the agent needs, and configures them automatically, removing the usual setup friction. Bundles let you control exactly which tools an agent can use.

For developers, the Toolhouse SDK provides tool definitions directly to LLM calls and works with OpenAI, Vercel AI, and LlamaIndex, with Python and TypeScript libraries and a CLI. Each agent gets a sandbox for testing and debugging, then deploys to a production grade cloud with a unique API endpoint that can be called from applications or services like Zapier. On security, data is encrypted in transit and at rest, is never used to train models, and stays scoped to your workspace, and every agent action is logged for audit. Enterprise plans add on premises deployment, private data hosting, and compliance assistance.

Pricing is deliberately bootstrapper friendly. A free Sandbox tier costs nothing and includes 50 agent runs a month with unlimited MCP calls, suited to getting started and public projects. The Pro plan is $10 a month and adds 1,000 agent runs a month, private agents, built in RAG, and the ability to switch between models. Enterprise is custom and adds dedicated support, on premises deployment, private data hosting, and compliance help. An agent run is counted each time you call an agent to perform an operation and costs one credit, with default model LLM usage and unlimited MCP executions included in each run.

Vendor details

Canonical URL

https://toolhouse.ai

Category

Agent infrastructure

Subcategory

Agent tooling and MCP

Funding status

Independent startup. Its agent platform is used by companies including Cloudflare, NVIDIA, Groq, and Snowflake.

Company status

independent

Use cases & customers

Primary use cases

agent buildingtool integrationMCP connectivityworkflow automationagent deployment

Target customers

developersstartups

Deployment options

SaaSon-prem

Integrations

Ships a prebuilt tool library (web search via Google, Tavily, and Exa, scraping via Firecrawl, databases like MongoDB, Pinecone, and Supabase, email via SendGrid) and connects to thousands of remote MCP servers with a single link. MCP Discovery auto wires the servers an agent needs. The SDK works with OpenAI, Vercel AI, and LlamaIndex, and agents can be called via API or Zapier.

In practice

You want an agent that reads a Notion page and creates Linear tickets, but wiring up both MCP servers is a pain. You describe the agent in Toolhouse and MCP Discovery detects and connects both servers for you.

You need an agent in production fast without building RAG, logging, and a deploy pipeline. Toolhouse gives you those out of the box and ships your agent as a one click, scalable API endpoint.

Your non technical ops team is buried in repetitive lookups. You describe the task in plain language, and Toolhouse builds an AI worker that runs it and returns the finished result, no code required.

Capability coverage

7.5 / 14 capabilities · 54%

Integrations & Tool CallingCore product. A tool integration and tool calling platform: the SDK supplies tool definitions to LLM calls, a prebuilt tool library covers search, scraping, databases, and email, and an MCP client connects agents to thousands of remote MCP servers with one link across 1,000 plus services. Full
Workflow OrchestrationBuilds and runs agentic workflows where an agent plans a task and chains multiple tools to complete it, with scheduling for recurring runs, though it is prompt defined agent execution rather than a full visual or code orchestration framework with branching and multi agent control. Partial
Knowledge Grounding & RAGBuilt in retrieval augmented generation with automatic setup lets agents ground on private or proprietary data without building a vector database pipeline, a first class out of the box capability. Full
Human Oversight & GuardrailsOffers no runtime content moderation, human approval workflow, or output guardrails; agents are designed to complete and deliver work without human review steps. Unable to verify
Security, Identity & GovernanceData encrypted in transit and at rest, never used to train models, scoped to the workspace, a full audit log of every agent action, tool access control via Bundles, and enterprise on premises deployment and compliance assistance, though specific certifications, SSO, or RBAC are not detailed. Partial
Observability & AuditabilityBuilt in logging and debugging tools, a sandbox for inspecting agent behavior, and a full audit log of every agent action, though it is operational logging rather than a full tracing and analytics suite. Partial
Memory & State PersistenceProvides built in memory and caching as part of its out of the box agent infrastructure, though it is not detailed as a full persistent cross session memory layer. Partial
Deployment & Data ResidencyOne click deployment to a production cloud with a scalable API endpoint and a CLI, plus enterprise on premises deployment and private data hosting, though the on premises path is enterprise gated rather than an open self host. Partial
Prebuilt Agents, Templates & PacksOffers agent templates to start from, a library of prebuilt tools and MCP integrations, and example agents for common tasks, reusable starting points, though not a deep marketplace of installable production agents. Partial
Triggers & Channel CoverageAgents can be triggered by email or message, scheduled to run on a cadence, called via their API endpoint, or wired through Zapier, and deliver results to the inbox, though channel coverage is task oriented rather than full conversational omnichannel. Partial
Model Flexibility & RoutingSupports switching between multiple models and using custom models, with a default model included, though it is model selection rather than an automatic routing gateway. Partial
APIs, SDKs & MCP ExtensibilityPython and TypeScript SDKs, a CLI, a REST API where each agent is its own endpoint, and deep MCP support including an MCP client to thousands of servers and MCP Discovery for automatic configuration give comprehensive extensibility. Full
Testing, Debugging & OptimizationIncludes built in evals, a sandbox for testing, and debugging tools to refine agent behavior before going live, though evaluation is a built in capability rather than a dedicated evaluation product. Partial
Browser & Computer UseProvides web scraping through integrations but no interactive browser automation or computer use capability. Unable to verify

Recent platform changes

No recent material changes tracked yet.

Pricing

Free (50 runs/mo) · Pro $10/mo (1,000 runs) · Enterprise custom

Subscription tiers metered by agent runs per month (each run = 1 credit, includes default model LLM usage and unlimited MCP executions)

Public — exactLow variable costFree tier

Included quota

Sandbox free (50 agent runs/mo, unlimited MCP calls, public projects). Pro $10/mo (1,000 agent runs/mo, private agents, built in RAG, model switching). Enterprise custom (dedicated support, on-prem deployment, private data hosting, compliance assistance).

What is public

Sandbox free and Pro $10/mo tiers with run limits and features are public; Enterprise is custom.

Billing mechanics

Flat monthly subscription with a bundled monthly agent run allowance (1 credit per run, default model LLM and unlimited MCP executions included). Enterprise is contracted.

Cost watchouts

Bundled default model usage keeps cost predictable, but exceeding the monthly run allowance forces a tier upgrade, and premium model selection or heavy private data hosting on Enterprise may add cost.

Variable cost rationale

Plans bundle a fixed number of agent runs per month at a flat price, with default model LLM usage and unlimited MCP executions included in each run, so cost is largely predictable. Heavy usage that exceeds the monthly run allowance requires a higher tier, and using premium non default models may add cost.

Additional watchouts

The monthly agent run allowance is the main limit; high volume agents can exhaust the Pro plan's 1,000 runs and need an upgrade. Default model LLM usage is included, but switching to premium models may carry separate cost.

Overage / add-ons

Each agent run draws one credit from the monthly allowance and includes default model LLM usage plus unlimited MCP executions; exceeding the monthly run allowance means upgrading the plan.

Sales call required

No — self-serve available

Free / trial

Free Sandbox tier: 50 agent runs/month, unlimited MCP calls, for public projects.

Lowest paid plan

Pro $10/mo (1,000 agent runs, private agents, built in RAG, model switching)

Commercial notes

Bootstrapper friendly, self serve, developer and non coder friendly with a free tier and a low $10 Pro plan, scaling to Enterprise for on-prem and compliance. Used by Cloudflare, NVIDIA, Groq, and Snowflake.

Key ambiguities

Run overage handling, premium model cost, and Enterprise pricing are not public.

Cancellation / refund

Sandbox and Pro are self serve subscriptions with standard cancellation. Enterprise terms are contractual.

Support SLA / resale

Community and self serve on Sandbox and Pro; dedicated support on Enterprise.

Missing data

Enterprise pricing, run overage rates, and whether premium model usage costs extra are not public.

Verified 2026-06-30

Contact us

Found a vendor we missed? Have feedback on the index? We'd love to hear from you.

Agentic AI Index

A directory and comparison resource for AI agent platforms, autonomous workflow tools, and enterprise agentic automation products.

© 2026 Agentic AI Index

3801 N Capital of Texas Hwy, Ste E240 · Austin, TX 78746

Researched from public vendor sources. See Methodology.