Datavolo
Datavolo is a technology company.
Financial History
Datavolo has raised $21.0M across 1 funding round.
Frequently Asked Questions
How much funding has Datavolo raised?
Datavolo has raised $21.0M in total across 1 funding round.
Datavolo is a technology company.
Datavolo has raised $21.0M across 1 funding round.
Datavolo has raised $21.0M in total across 1 funding round.
Datavolo has raised $21.0M in total across 1 funding round.
Datavolo's investors include Ballistic Ventures, Citi Ventures, General Catalyst, Greylock, Mayfield, Owl Rock Capital Partners, Frederic Kerrest, Michael Fey, Sohaib Abbasi.
Datavolo is a technology company building multimodal data pipelines for AI, specializing in fast, flexible, reusable infrastructure that processes unstructured and structured data without custom coding.[1][2][4] It serves enterprises in tech, finance, research, and other data-heavy sectors, solving the problem of inefficient, single-use point-to-point data extraction tools that fail on unstructured data like documents, logs, and multimedia—enabling secure, scalable pipelines for generative AI, RAG applications, and real-time insights.[1][2][3] Powered by Apache NiFi, Datavolo offers a no-code, containerized service with APIs for vector databases, centralized governance, and observability, generating revenue via tiered subscriptions based on data volume.[1][2][4] Snowflake announced its acquisition of Datavolo on an unspecified date in 2025 (noted as "1. Snowflake" in announcements), accelerating multimodal integration for AI, analytics, and apps within Snowflake's ecosystem.[3]
The platform's growth momentum includes investments from Citi Ventures, highlighting its enterprise-grade compliance and scale for GenAI, plus testimonials of 10x faster feature delivery and million-dollar cost savings for regulated customers.[2][4]
Datavolo was founded in 2023 by Joe Witt (CEO) and Luke Roquet (COO), both veterans in data infrastructure.[2][4] Witt, through his prior startup Onyara and NSA collaboration, spearheaded Apache NiFi—an open-source tool for scalable unstructured data ingestion, released in 2006 and now adopted globally by thousands of agencies and companies.[2][4] The idea emerged from enterprise pain points: existing tools handle only structured data, limited formats, and lack compliance at scale, forcing costly in-house builds for unstructured data critical to LLMs and GenAI.[2] Early traction built on NiFi's proven foundation, with Citi Ventures investing for its no-code pipelines supporting diverse files, transformations, and AI models.[2] Pivotal moments include rapid roadmap expansion for LLM app tools, natural language dataflow creation, and the Snowflake acquisition to embed Datavolo's capabilities into a leading data platform.[2][3]
Datavolo rides the explosive growth of generative AI and multimodal data demands, where 80-90% of enterprise data is unstructured yet underserved by legacy ETL tools, timing perfectly with RAG, real-time analytics, and agentic AI trends.[1][2][3] Market forces like rising compliance needs, exploding data volumes from LLMs, and cloud-native shifts favor its NiFi-powered, open extensibility over rigid vendors.[2][3] By simplifying streaming (Flink/Kinesis) and bi-directional integrations with Snowflake/data lakes, it influences the ecosystem toward unified platforms, empowering data teams to focus on AI innovation rather than plumbing—now amplified via Snowflake's reach for broader adoption in AI apps and analytics.[3][4]
Post-Snowflake acquisition, Datavolo will embed its pipelines deeply into Snowflake's platform, unlocking seamless multimodal ingestion for customers' AI workloads, with expansions in LLM tools and AI-assisted engineering.[3] Trends like real-time unstructured processing and natural language data ops will propel it, evolving its influence from niche innovator to core enabler in enterprise GenAI stacks. As data remains the lifeblood of AI breakthroughs, Datavolo positions Snowflake—and adopters—to harness unstructured "multimodal magic" at scale, freeing focus for incredible work.[4]
Datavolo has raised $21.0M across 1 funding round. Most recently, it raised $21.0M Series A in April 2024.
| Date | Round | Lead Investors | Other Investors |
|---|---|---|---|
| Apr 1, 2024 | $21.0M Series A | Ballistic Ventures, Citi Ventures, General Catalyst, Greylock, Mayfield, Owl Rock Capital Partners, Frederic Kerrest, Michael Fey, Sohaib Abbasi |