About the job
Speechify aims to remove reading as an obstacle to learning. More than 50 million people use Speechify’s text-to-speech tools to turn PDFs, books, Google Docs, news articles, and websites into audio. Our product suite includes iOS, Android, Mac, Chrome extension, and web apps. Google named Speechify the Chrome Extension of the Year, and Apple awarded us the 2025 Design Award for Inclusivity.
Our fully distributed team of nearly 200 works from locations around the globe. The group includes frontend and backend engineers, AI research scientists, and professionals from Amazon, Microsoft, and Google, as well as alumni of top PhD programs and founders from successful startups like Stripe, Vercel, and Bolt.
Role Overview
The Data Infrastructure & Acquisition Software Engineer joins our Data team within the AI division. This position centers on data collection for model training. The team builds and manages large, high-quality datasets at petabyte scale, blending infrastructure, engineering, and research to do so efficiently. The work directly supports Speechify’s next generation of products.
What You Will Do
- Find and source new audio data to improve our ingestion pipeline.
- Maintain and scale the cloud infrastructure for the pipeline, which runs on Google Cloud Platform and uses Terraform.
- Work with scientists to optimize cost, throughput, and data quality, enabling richer datasets at larger scale and lower cost.
- Collaborate with the AI team and company leadership to shape the dataset roadmap for future consumer and enterprise products.
What We Look For
- BS, MS, or PhD in Computer Science or a related field.
- At least 5 years of professional software development experience.
- Strong skills in bash and Python scripting on Linux systems.
- Hands-on experience with Docker, Infrastructure-as-Code (such as Terraform), and at least one major cloud provider (GCP preferred).
- Knowledge of web crawlers and large-scale data processing is a plus.
- Ability to handle multiple projects and shift priorities as needed.
- Clear written and verbal communication skills.
Location: Miami, FL, USA (fully distributed team)
