Stemma has raised $5.0M in total across 1 funding round.
Stemma's investors include A Capital, Addition, AIX Ventures, Betaworks Ventures, E1 Ventures, Factorial, Lux Capital, Otherwise Fund, Sequoia Capital, Summit Partners, Thirty Five Ventures, Augusto Marietti.
Stemma was an early-stage technology company that built a fully managed data catalog, enhancing the open-source Amundsen project to serve as an organization's source of truth for data discovery and trust.[1][2] It targeted enterprises struggling with data volume and validity, offering automated metadata generation, personalized search experiences, and enterprise-grade management to solve problems like manual data validation via Slack or wikis.[1][2] Stemma served data-heavy companies such as iRobot, Convoy, SoFi, Brex, Snap, and Asana, achieving early traction with a $4.8M seed round in 2021 led by Sequoia and adoption by 35+ firms using its underlying open-source tech.[1][3] In 2023, Stemma was acquired by Teradata, integrating its technology into Teradata's platform to reach larger enterprises with richer metadata and AI-driven data cataloging.[3]
Stemma originated from challenges at Lyft, where explosive data growth overwhelmed manual processes for validation and discovery, prompting co-founder and CEO Mark Grover—alongside Lyft engineers—to create the open-source Amundsen data catalog in 2018.[1][3] Amundsen quickly gained traction, with 750 weekly users at Lyft (80% of data users) and adoption by 35 companies including Brex, Snap, and Asana.[1][3] In fall 2020, Grover teamed up with co-founder and CTO Dorian Johnson to commercialize a managed version, launching Stemma amid COVID as a fully remote startup with $4.8M seed funding from Sequoia and others.[1] By 2023, after serving customers like iRobot and Convoy, the founders joined Teradata via acquisition to scale impact in enterprise data trust.[3]
Stemma stood out in data cataloging through these key strengths:
Stemma rode the data democratization trend, where exploding data volumes in enterprises demanded trustworthy, searchable catalogs to fuel analytics and AI—problems Amundsen solved openly at Lyft and beyond.[1][3] Timing was ideal post-2020, as remote work and cloud data growth amplified discovery challenges, positioning Stemma to commercialize proven open-source tech amid rising data governance needs.[1] Market forces like enterprise AI adoption favored it, with larger orgs facing acute "data trust" issues that Stemma mitigated, influencing the ecosystem by accelerating Amundsen's reach and paving the way for platform integrations like Teradata's.[3] Its acquisition underscores how startups enhance incumbents, broadening access to advanced cataloging in a \(10B+\) data management market.
Now part of Teradata, Stemma's tech will evolve through deeper AI integrations and metadata richness, targeting massive enterprises with 2-year sales cycles that startups struggle to crack alone.[3] Trends like multimodal AI and real-time data governance will shape its path, amplifying Teradata's innovation in cataloging. Its influence grows from niche disruptor to enterprise staple, proving open-source roots can scale via strategic exits—echoing how Stemma turned Lyft's headache into industry-wide truth for data chaos.[1][3]
Stemma has raised $5.0M across 1 funding round. Most recently, it raised $5.0M Seed in June 2021.
| Date | Round | Lead Investors | Other Investors |
|---|---|---|---|
| Jun 1, 2021 | $5.0M Seed | A Capital, Addition, AIX Ventures, Betaworks Ventures, E1 Ventures, Factorial, Lux Capital, Otherwise Fund, Sequoia Capital, Summit Partners, Thirty Five Ventures, Augusto Marietti, Brad Bitler, Clement Benoit, Florian Douetteau, Greg Brockman, Ronny Conway, Thibaud Elziere |