Gremlin Inc.
Gremlin Inc. is a company.
Financial History
Leadership Team
Key people at Gremlin Inc..
Gremlin Inc. is a company.
Key people at Gremlin Inc..
Key people at Gremlin Inc..
Gremlin Inc. is a SaaS platform specializing in chaos engineering and reliability management, enabling engineering teams to proactively test and strengthen complex systems by simulating failures. It serves large enterprises in sectors like retail, finance, technology, and e-commerce—including customers such as GrubHub, HEB, JPMorgan, Mailchimp, Target, Twilio, Under Armour, and Walmart—solving the problem of unexpected outages that cause revenue loss and poor customer experiences by identifying weaknesses through safe, controlled experiments.[1][2][3][4][8] With $15.4 million in revenue and 75 employees based in San Jose, California, Gremlin has shown growth through product expansions like Custom Reliability Scores (2023), Failure Flags for serverless (2023), and the Reliability Management Platform (2022), backed by investors including Amplify Partners, Index Ventures, and Redpoint VC.[1][4]
Gremlin was founded by industry veterans who served as "Call Leaders" at Amazon and Netflix, roles focused on resolving global outages and engineering resilient systems. Drawing from this expertise, the company launched as the world's first hosted chaos engineering service, evolving from open-source inspirations like Netflix's Chaos Monkey into a comprehensive platform.[4][5] Key milestones include raising an $18 million Series B in February 2018 to introduce Application Level Fault Injection (ALFI), launching free Chaos Monkey as-a-Service in 2021, and establishing the Gremlin Community for reliability resources that same year, building early traction with enterprise adopters.[4]
Gremlin rides the wave of chaos engineering and reliability engineering, a discipline born from Netflix and Amazon to combat increasing system complexity in cloud-native, microservices, and AI infrastructures where outages cost millions. Timing is ideal amid rising demands for 99.99% uptime, as market forces like Kubernetes proliferation, serverless adoption, and AI reliability needs amplify failure risks—Gremlin's tools proactively validate disaster recovery, influencing the ecosystem through reports like the State of Chaos Engineering and community education that standardize resilience practices across finance, retail, and tech.[3][4][6][8]
Gremlin is poised to expand as AI and edge computing heighten reliability stakes, with trends like automated reliability scoring and serverless testing driving deeper enterprise penetration via subscriptions and certifications. Expect further integrations (e.g., more AWS/Dynatrace expansions) and platform evolution toward full-stack observability, solidifying its role in preventing costly disruptions. As the pioneer turning failure into resilience, Gremlin's mission to build a more reliable internet positions it to lead in an era where downtime is unacceptable.[4][6][8]