Loading organizations...

§ Private Profile · San Francisco, CA, USA
Automated voice acting and emotive speech synthesis with deep neural…
Key people at Voicery.
Voicery was founded in 2017 by Andrew Gibiansky (Founder).
Voicery synthesizes the most realistic human voices using deep neural networks. Prior to starting Voicery, Andrew, one of the founders, led the speech synthesis research team at Baidu Research. By synthesizing speech nearly indistinguishable from human, Voicery enable new media applications, such as auto-generated audiobooks, podcasts, television dubs, and voice overs.
Voicery was founded in 2017 by Andrew Gibiansky (Founder).
Key people at Voicery.
Voicery is a technology company specializing in automated voice acting and emotive speech synthesis using deep neural networks. It builds customizable, natural-sounding text-to-speech (TTS) engines that enable brands and enterprises to create unique, emotionally expressive AI voices tailored to their identity. These voices serve applications such as voice assistants, call centers, audiobooks, podcasts, video game characters, and automated customer interactions. Voicery’s technology addresses the problem of robotic, unnatural synthetic speech by delivering human-like, flexible, and fast speech synthesis that enhances user engagement and brand consistency. The company demonstrated strong growth momentum by quickly developing its engine and securing pilot customers, setting a benchmark in neural TTS before its platform was discontinued in 2020, with its innovations influencing subsequent AI voice solutions[1][2][3][5].
Voicery was founded by Andrew Gibiansky and his team, who previously worked at Baidu Research leading deep learning speech synthesis efforts. Their experience with state-of-the-art machine learning techniques and commercial production systems at Baidu inspired them to create a more advanced, human-like speech synthesis engine. The idea emerged from the need to improve upon existing research, which was insufficient for their goals, prompting them to extend deep neural network approaches. Remarkably, they built the initial Voicery engine in just two and a half months. Early traction came from pilot partnerships with customers exploring new applications for synthetic voices, such as automated audiobooks and voice assistants[1].
Voicery rides the wave of advancements in AI, deep learning, and natural language processing that have transformed speech recognition and synthesis. The timing was critical as demand for more human-like AI voices grew across industries—customer service, media, gaming, and accessibility. Market forces such as the rise of voice assistants, automated content generation, and personalized user experiences favored Voicery’s approach. By pushing the boundaries of neural TTS, Voicery contributed to the broader ecosystem by enabling new voice-driven applications and setting a benchmark for quality and expressiveness in synthetic speech[1][2].
Although Voicery’s original platform was discontinued in 2020, its pioneering work in custom, emotive AI voices has left a lasting impact on the field. The future of voice synthesis will likely build on Voicery’s innovations, emphasizing even more natural, context-aware, and emotionally rich AI voices integrated across diverse applications. Trends such as multilingual voice agents, real-time conversational AI, and privacy-focused deployments (e.g., GDPR compliance) will shape the evolution of this technology. Companies inspired by Voicery’s model will continue to expand the role of synthetic voices in customer engagement, entertainment, and accessibility, making AI voice interaction a seamless part of everyday digital experiences[2][4].