Defog.ai
Defog.ai is an enterprise text to SQL analyst, powered by its open source SQLCoder model, that embeds in apps, deploys on premises, and answers data questions in plain language.
Defog.ai is an enterprise data analysis tool that lets business teams ask questions in plain language and get back SQL, tables, charts, and answers. Founded in 2023 by Medha Basu and Rishabh Srivastava and backed by Y Combinator and Script Capital, the company is best known for SQLCoder, its own family of open source large language models fine tuned for turning natural language into SQL. SQLCoder has been downloaded more than one hundred thousand times and, at points, outperformed larger general models on the company's own SQL evaluation benchmark.
Defog is built around a privacy first architecture. It never accesses or moves the underlying data. Instead it uses only metadata, such as table and column descriptions and example queries added through a cookbook, to fine tune models to a specific business. Well defined schemas can improve query accuracy substantially, and the models are designed to be interpretable and explainable so a user can trust and inspect the SQL behind an answer. Results appear first as tables and can be switched to charts or raw SQL, with an advanced mode for deeper work.
Teams can deploy Defog in the cloud or on premises to meet privacy requirements, and the open source SQLCoder model and the open source SQL evaluation framework can be run and inspected directly. Defog embeds inside an application as a conversational widget so a company's own users can ask data questions without leaving the product. It also offers agents that handle multi step workflows across SQL, Python, and R with a human in the loop, and a deep research capability for internal data. It connects to the major databases and warehouses including Postgres, Snowflake, Redshift, and Databricks.
Defog is SOC 2 Type II compliant, and its metadata only design gives it a strong privacy posture for regulated buyers. The system learns from user feedback and preferences and adapts its responses over time. A free plan includes application programming interface access, with paid tiers reported to start higher for teams and enterprises. Company records indicate a merger or acquisition event in April 2025, and while the product and its open source models remain active, the standalone commercial roadmap is less certain, so buyers should confirm current status directly.
Vendor details
Canonical URL
https://defog.ai
Category
Data analyst agent
Company status
unclear
Use cases & customers
In practice
A software company embeds the Defog widget in its product so end users can ask questions of their own data in plain language, getting answers without the vendor ever accessing the underlying database.
A regulated enterprise deploys Defog on premises, fine tunes the SQLCoder model on its own schema through the cookbook, and lets analysts ask questions in plain language while data never leaves their environment.
An analyst hands Defog a complex multi step task across SQL and Python, reviews the interpretable output at each human in the loop checkpoint, and validates accuracy against the open source evaluation framework.
Sources & related URLs
Research notes
Score 7.0 (3F/8P/3N). companyStatus UNCLEAR [CORRECTED Jul 2026, was 'acquired']: only getlatka reports an April 2025 'M&A Offer' (an offer, NOT a confirmed close); Tracxn shows NO completed acquisition; defog.ai is still live and selling and the SQLCoder/SQLEval open source models are actively maintained; acquirer never confirmed. Downgraded acquired->unclear pending confirmation; kept canonical (product plus open source models active and commercially real). Founders Medha Basu plus Rishabh Srivastava (YC W23); ~$2.7M seed; Singapore/SF. Enterprise text to SQL analyst on open source SQLCoder; embeds in apps; deploys cloud or on premises. Fulls: Dep (deploy cloud or on premises plus open source self hostable SQLCoder model), Ext (open source SQLCoder plus open source SQLEval plus API plus Python client; open source plus SDK counts), Eval (SQLEval open source accuracy benchmark framework). Partials: Int (major DB and warehouse connectors plus a few app integrations plus API), Orch (multi step SQL/Python/R workflows and deep research, human in the loop, not fully autonomous), Know (metadata fine tuning plus cookbook schema definitions; not a governed semantic layer), HITL (human in the loop agents), Sec (SOC 2 Type II from own blog plus metadata only privacy architecture; single named cert, no documented SSO/RBAC), Obs (interpretable/explainable, SQL view, switchable results), Mem (learns from feedback plus metadata fine tuning persistence), Trig (embeddable widget plus API; no proactive push documented). N: Pack (predefined datasets/cookbook, no agent marketplace), Model (own SQLCoder single model, no multi provider routing), Comp. FLAG: the April 2025 event is an unconfirmed OFFER, not a verified acquisition; revisit if a close is announced.
Capability coverage
7.0 / 14 capabilities · 50%
| Integrations & Tool CallingDefog connects to the major SQL databases and warehouses including Postgres, Snowflake, Redshift, and Databricks plus a few application integrations and an application programming interface, solid coverage short of a broad marketplace, so partial. | Partial |
|---|---|
| Workflow OrchestrationDefog runs agents that handle multi step workflows across SQL, Python, and R and a deep research capability, agentic and multi step but human in the loop rather than fully autonomous, so partial. | Partial |
| Knowledge Grounding & RAGDefog fine tunes models on customer metadata and cookbook schema and column descriptions to improve accuracy, a schema and definition grounding rather than a governed semantic layer, so partial. | Partial |
| Human Oversight & GuardrailsDefog offers human in the loop agents that let analysts collaborate on and refine multi step workflows and outputs, a review checkpoint rather than enforced runtime guardrails, so partial. | Partial |
| Security, Identity & GovernanceDefog is SOC 2 Type II compliant and uses a metadata only architecture that never accesses the underlying data, a strong single named certification without documented single sign on or role based controls, so partial. | Partial |
| Observability & AuditabilityDefog produces interpretable and explainable output and lets users switch between tables, charts, and the underlying SQL, giving query transparency rather than full execution tracing, so partial. | Partial |
| Memory & State PersistenceDefog learns from user feedback and preferences and persists metadata fine tuning, a learned and definition level persistence rather than full cross session memory, so partial. | Partial |
| Deployment & Data ResidencyDefog can be deployed in the cloud or on premises to meet privacy requirements and its SQLCoder model is open source and self hostable, so full. | Full |
| Prebuilt Agents, Templates & PacksDefog offers predefined datasets and a cookbook but no browsable marketplace of cloneable prebuilt agents or a template pack library, so not documented. | Unable to verify |
| Triggers & Channel CoverageDefog embeds as a conversational widget inside an application and exposes an application programming interface, covering embeddable and programmatic channels short of proactive omnichannel push, so partial. | Partial |
| Model Flexibility & RoutingDefog is powered by its own SQLCoder family of models and no multi provider model selection or routing gateway could be verified, so not documented. | Unable to verify |
| APIs, SDKs & MCP ExtensibilityDefog open sources its SQLCoder model and its SQL evaluation framework and provides an application programming interface and a Python client, meeting the open source plus toolkit bar, so full. | Full |
| Testing, Debugging & OptimizationDefog builds SQLEval, an extensible open source evaluation framework that measures the accuracy of its text to SQL models, so full. | Full |
| Browser & Computer UseDefog is a text to SQL data analysis tool with no browser or computer use capability, as expected for this category, so not documented. | Unable to verify |
Pricing
Free plan (API access); paid from ~$599/mo (third party)
Related vendors
- AskYourDatabase — AskYourDatabase is a conversational data analyst that turns plain…
- Athenic AI — Athenic AI is an agentic data analyst that connects business apps…
- BlazeSQL — BlazeSQL is an AI data analyst that learns your SQL database, turns…
- Brewit — Brewit is a conversational business intelligence agent that turns…
- Buster — Buster is an open source, artificial intelligence native data…
- Datapad — Datapad is an autonomous data analyst agent that connects fifty plus…