
DataPelago
DataPelago is a technology company.
Financial History
DataPelago has raised $47.0M across 1 funding round.
Frequently Asked Questions
How much funding has DataPelago raised?
DataPelago has raised $47.0M in total across 1 funding round.

DataPelago is a technology company.
DataPelago has raised $47.0M across 1 funding round.
DataPelago has raised $47.0M in total across 1 funding round.
DataPelago is a Mountain View, California-based technology company founded in 2021 that develops the Universal Data Processing Engine (UDPE), a cloud-based software platform accelerating data processing for GenAI, analytics, AI model training, and cybersecurity workloads.[1][2][3] It serves enterprises handling massive datasets—structured, semi-structured, or unstructured—such as those in AI/GenAI pipelines, lakehouse analytics, and security systems, solving the core problem of slow, expensive processing that leaves 90% of data untapped by enabling lightning-fast performance on heterogeneous hardware (GPUs, FPGAs, CPUs) with zero application changes or vendor lock-in.[1][2][3] The platform integrates seamlessly with open-source tools like Spark, Trino, Gluten, Airflow, and clients like Tableau, delivering orders-of-magnitude better price/performance while supporting extract-filter-chunk-tokenize-embed workflows for RAG, fine-tuning, and inference.[1][3]
Early traction includes partnerships like Samsung SDS America, which tested promising performance and cost gains on AWS GPUs for unified GenAI/analytics pipelines, and security practitioners praising its modular plug-and-play for exponential data growth in AI-driven cybersecurity.[1][3] DataPelago recently fast-tracked SOC 2 and ISO 27001 compliance, signaling enterprise readiness amid rapid growth.[7]
DataPelago emerged from founders' deep expertise in hardware acceleration and data infrastructure, led by Rajan Goyal, Founder and CEO, who drew from cycles of compute demand outpacing processing power—similar to past networking and cloud bottlenecks.[4][5][6] Goyal's career includes pioneering deep packet inspection, domain-specific processors for internet-scale security, compression, packet movement, and Data Processing Units that revolutionized cloud infrastructure, amassing 500+ patents and building multi-billion-dollar businesses.[4][6] Co-founder and CPO Anand Iyer contributes to the pluggable DataApp layer integrating with Spark and Trino.[2]
The idea crystallized around "nonlinear thinking" to shatter data processing limits in the accelerated computing era, targeting the "first mile" of AI where data bottlenecks crush innovation; launched in 2021 from Mountain View, it quickly revealed UDPE to overcome x86 CPU scalability issues for GenAI and analytics.[2][4][5] Pivotal early validation came via customer pilots like Samsung SDS, proving value in days without re-engineering.[3]
DataPelago stands out in data processing through these key strengths:
DataPelago rides the GenAI data explosion trend, where exponential data volumes from multi-modal sources overwhelm traditional engines, amplified by AI/cybersecurity adoption demanding real-time, fresh insights at scale.[1][2][5] Timing is ideal post-2021 GPU/FPGA boom, addressing x86 limits as firms separate compute/storage for flexibility—aligning with lakehouse shifts and RAG/fine-tuning needs.[1][2][3] Market tailwinds include hyperscaler accelerated infra (e.g., AWS GPUs) and open frameworks like Substrait/Gluten, favoring plug-and-play innovators over rigid vendors.[1][3]
It influences the ecosystem by setting a new standard higher in the stack—between data lakes and query engines—enabling competitors like DataRobot, Cloudera, Domino, Dataiku to accelerate via integration, while democratizing GenAI for non-hyperscalers via cost-effective scale.[2]
DataPelago is primed to capture share in the $100B+ data/AI infrastructure market as GenAI matures beyond LLMs into agentic, multi-modal systems craving always-fresh, cheap processing.[1][2] Next: deeper enterprise wins (e.g., expanding Samsung-like pilots to production), FPGA/GPU hybrid optimizations, and potential acquisitions by cloud giants eyeing open acceleration layers. Trends like edge AI security and sovereign data sovereignty will amplify demand for its lock-in-free model. Its influence could evolve from accelerator to de facto standard, much like how Kubernetes unified containers—unlocking the "endless sea of data" for breakthrough intelligence, echoing Goyal's vision of turning impossible bottlenecks inevitable.[2][4][5]
DataPelago has raised $47.0M in total across 1 funding round.
DataPelago's investors include Anorak Ventures, CoinFund, Eclipse Ventures, Future Shape, Pathbreaker Ventures, Pillar VC, SOSV, Colin Carrier, Kevin Colas.
DataPelago has raised $47.0M across 1 funding round. Most recently, it raised $47.0M Series A in September 2024.
| Date | Round | Lead Investors | Other Investors |
|---|---|---|---|
| Sep 1, 2024 | $47.0M Series A | Anorak Ventures, CoinFund, Eclipse Ventures, Future Shape, Pathbreaker Ventures, Pillar VC, SOSV, Colin Carrier, Kevin Colas |