About the job
Speechify aims to make reading accessible for everyone. Over 50 million people use our text-to-speech products to convert PDFs, books, Google Docs, news articles, and websites into audio, helping them read faster and remember more.
Our tools run on iOS, Android, Mac, Chrome, and the web. We’ve earned recognition from Google as Chrome Extension of the Year and received Apple’s 2025 Design Award for Inclusivity.
Our team of nearly 200 works fully remotely. We bring together frontend and backend engineers, AI researchers, and specialists from companies like Amazon, Microsoft, and Google, alongside accomplished startup founders and top PhD graduates.
Role Overview
Speechify’s AI division is hiring a Software Engineer for the Data team in Yerevan, Armenia. This engineer will support every stage of data collection for model training. Our data operations handle petabyte-scale datasets, blending infrastructure, engineering, and research to keep costs low and quality high.
What You Will Do
- Find and source new audio data to improve our data ingestion pipeline.
- Maintain and expand our cloud infrastructure, mainly using Google Cloud Platform (GCP) and Terraform.
- Work with scientists to balance cost, throughput, and quality, delivering large-scale, enriched datasets for model development.
- Partner with the AI team and company leadership to shape a strategic dataset roadmap for both consumer and enterprise products.
