Speechify logoSpeechify logo

Software Engineer, Data Infrastructure & Acquisition

SpeechifyMiami, FL, USA
Remote Full-time $140K/yr - $200K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Mid to Senior

Qualifications

Ideal Candidate Qualifications BS/MS/PhD in Computer Science or a relevant field. 5+ years of professional experience in software development. Strong proficiency in bash/Python scripting in Linux environments. Experience with Docker, Infrastructure-as-Code principles, and at least one major cloud provider (we use GCP). Familiarity with web crawlers and large-scale data processing workflows is advantageous. Capacity to manage multiple tasks and adapt to changing priorities. Excellent written and verbal communication skills.

About the job

Speechify aims to remove reading as an obstacle to learning. More than 50 million people use Speechify’s text-to-speech tools to turn PDFs, books, Google Docs, news articles, and websites into audio. Our product suite includes iOS, Android, Mac, Chrome extension, and web apps. Google named Speechify the Chrome Extension of the Year, and Apple awarded us the 2025 Design Award for Inclusivity.

Our fully distributed team of nearly 200 works from locations around the globe. The group includes frontend and backend engineers, AI research scientists, and professionals from Amazon, Microsoft, and Google, as well as alumni of top PhD programs and founders from successful startups like Stripe, Vercel, and Bolt.

Role Overview

The Data Infrastructure & Acquisition Software Engineer joins our Data team within the AI division. This position centers on data collection for model training. The team builds and manages large, high-quality datasets at petabyte scale, blending infrastructure, engineering, and research to do so efficiently. The work directly supports Speechify’s next generation of products.

What You Will Do

  • Find and source new audio data to improve our ingestion pipeline.
  • Maintain and scale the cloud infrastructure for the pipeline, which runs on Google Cloud Platform and uses Terraform.
  • Work with scientists to optimize cost, throughput, and data quality, enabling richer datasets at larger scale and lower cost.
  • Collaborate with the AI team and company leadership to shape the dataset roadmap for future consumer and enterprise products.

What We Look For

  • BS, MS, or PhD in Computer Science or a related field.
  • At least 5 years of professional software development experience.
  • Strong skills in bash and Python scripting on Linux systems.
  • Hands-on experience with Docker, Infrastructure-as-Code (such as Terraform), and at least one major cloud provider (GCP preferred).
  • Knowledge of web crawlers and large-scale data processing is a plus.
  • Ability to handle multiple projects and shift priorities as needed.
  • Clear written and verbal communication skills.

Location: Miami, FL, USA (fully distributed team)

About Speechify

Speechify is dedicated to making reading accessible to all. Our innovative text-to-speech technology empowers millions to overcome reading barriers, enhancing learning experiences across various formats and platforms.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.