CrowdFlower Inc.
CrowdFlower Inc. is a company.
Financial History
Leadership Team
Key people at CrowdFlower Inc..
CrowdFlower Inc. is a company.
Key people at CrowdFlower Inc..
Key people at CrowdFlower Inc..
CrowdFlower Inc., founded in 2008 and later rebranded as Figure Eight, developed a human-in-the-loop machine learning platform that combined crowdsourced human labor with AI to create high-quality training data from text, images, audio, and video.[1][3] It served data science teams at major companies like Google, Facebook, Autodesk, and Microsoft, solving the critical problem of labeling and curating massive datasets needed to train accurate ML models for applications in autonomous vehicles, natural language processing, computer vision, sentiment analysis, and search relevance.[2][4][3] The platform scaled to over 10 billion training data labels across industries including automotive, healthcare, finance, and retail, raising $58 million in funding before achieving strong growth with reported revenue nearing $41 million and nearly tripling in one year pre-acquisition.[1][2][4]
CrowdFlower emerged from Dolores Labs, started in 2007-2008 in San Francisco's Mission District by founder Lukas Biewald at age 25 during the financial crisis, initially operating from a loft before quickly outgrowing it.[1][4][2] Biewald, inspired by early interactions with Travis Kalanick (who provided seed funding and rides to pitches), pivoted from the research-sounding "Dolores Labs" to CrowdFlower, officially launching at TechCrunch50 in 2009 with a $1.15 million seed round from angels like Dave McClure, Jeff Hammerbacher, and Peter Thiel's Founders Fund.[1][4][7] Early traction came from breaking large data projects into microtasks distributed to a global crowd of over 1.5 million contributors, enabling rapid data labeling for ML training; pivotal moments included 2015 launches of CrowdFlower AI and "Data For Everyone," plus expansions like multi-language support and a 2016 Israel office.[1][6][5]
CrowdFlower rode the early AI and ML data explosion in the 2010s, when raw compute power outpaced training data availability, making human-labeled datasets essential for breakthroughs in deep learning, autonomous systems, and NLP.[5][3] Its timing was ideal post-2009 launch amid big data growth and VC interest in "human computation," influencing the ecosystem by popularizing crowdsourced data platforms that competitors like Scale and CloudFactory later emulated.[3][1] Market forces like exploding unstructured data volumes and enterprise AI adoption (e.g., via Microsoft, Salesforce investments) favored it, positioning CrowdFlower as a key enabler for Silicon Valley's ML teams and global AI scaling.[4][5]
Acquired by Appen for $300 million in March 2019 after raising $58M, CrowdFlower's assets were fully transitioned by 2020, dissolving the standalone entity but embedding its tech into Appen's AI data services.[1][3] Looking ahead, its legacy fuels the ongoing data labeling boom for generative AI and LLMs, with trends like multimodal data demands and ethical labeling shaping successors; Appen may evolve the platform amid competition, potentially amplifying CrowdFlower's influence in enterprise AI pipelines.[3] This pioneering crowd-AI hybrid underscores how early data infrastructure bets propelled the modern ML ecosystem.