About the job
Deepgram stands at the forefront of the burgeoning trillion-dollar Voice AI industry, offering real-time APIs for speech-to-text (STT), text-to-speech (TTS), and the development of scalable production-grade voice agents. Our platform empowers over 200,000 developers and more than 1,300 organizations, including Twilio, Cloudflare, Sierra, Decagon, Vapi, Daily, Cresta, Granola, and Jack in the Box, to create exceptional voice solutions that are 'Powered by Deepgram'. With our unparalleled accuracy, low latency, and cost-effectiveness, Deepgram’s voice-native foundational models are accessible via cloud APIs or as self-hosted and on-premises software. Having processed over 50,000 years of audio and transcribed over 1 trillion words, we are recognized as the leading authority in voice technology.
Company Operating Rhythm
At Deepgram, we foster an AI-first culture—where proficiency and innovation in AI are integral to our operations, creativity, and performance metrics. Every team member is encouraged to actively engage with advanced AI tools and integrate them into their daily work. We evaluate success through the effective application of AI to yield tangible results, emphasizing the importance of innovative and creative use of emerging technologies. Candidates should be ready to quickly adopt new models, seamlessly incorporate AI into their workflows, and continuously explore the potential of these technologies.
Our pace is dictated by AI advancements; thus, expect your daily tasks to evolve rapidly. If you thrive in a dynamic environment that encourages experimentation, adaptability, and continuous learning, you will find this role rewarding.
Opportunity:
We are seeking a talented Audio Engineer to lead and enhance audio quality across our voice AI products. This pivotal role merges professional audio engineering expertise with machine learning infrastructure. You will ensure that our voices not only sound accurate but also resonate genuinely with human listeners across diverse voices, recording conditions, and applications.
As a foundational member of our team, you will shape how audio engineering integrates into our end-to-end pipeline—from on-site voice actor recordings to speaker-specific cleanup for fine-tuning, synthetic data generation, and large-scale TTS training. Your goal will be to transform traditionally manual, GUI-based audio workflows into scalable, programmatic systems capable of operating at Deepgram's impressive scale.

