About the job
Speechify builds text-to-speech tools that turn written content into audio, helping over 50 million users access information in new ways. From PDFs and books to Google Docs and news articles, our products work across iOS, Android, Mac, Chrome, and the web. Google named us Chrome Extension of the Year, and Apple recognized our work with the 2025 Design Award for Inclusivity.
Our fully distributed team of nearly 200 people collaborates from locations around the world. Team members bring experience from Amazon, Microsoft, Google, top universities, and startups.
Role Overview
Speechify is hiring a Software Engineer focused on Data Infrastructure & Acquisition to join our AI team in Da Nang, Vietnam. This engineer will help collect and manage data critical for training our models. The team builds large-scale datasets, petabyte scale, while keeping costs low by combining infrastructure, engineering, and research expertise.
What You Will Do
- Find and source new audio data for integration into our ingestion pipeline.
- Manage and improve cloud infrastructure for the ingestion pipeline, currently built on Google Cloud Platform and orchestrated using Terraform.
- Collaborate with scientists to optimize data cost, throughput, and quality for developing future models.
- Work with AI team members and leadership to shape the dataset roadmap supporting new consumer and enterprise products.
