Speechify logoSpeechify logo

Software Engineer for Data Infrastructure & Acquisition

SpeechifyEugene, OR, USA
Remote Full-time $140K/yr - $200K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Mid to Senior

Qualifications

The ideal candidate will possess a strong educational background, including a BS, MS, or PhD in Computer Science or a related field, coupled with a minimum of 5 years of relevant industry experience. Proficiency in bash and Python scripting within Linux environments is essential, along with professional experience in Docker and Infrastructure-as-Code principles. Familiarity with GCP, web crawlers, and large-scale data processing is a plus. Candidates should demonstrate the ability to manage multiple tasks and adapt quickly to evolving priorities, paired with strong written and verbal communication skills.

About the job

Speechify builds tools that remove barriers to reading and learning. More than 50 million people use our text-to-speech products to turn PDFs, books, Google Docs, news articles, and websites into audio. Our suite includes iOS, Android, Mac, Chrome extension, and web apps. Google named us Chrome Extension of the Year, and Apple awarded us the 2025 Design Award for Inclusivity.

Our fully remote team includes nearly 200 professionals from a range of backgrounds. Engineers and AI researchers at Speechify have experience at Amazon, Microsoft, and Google, and many hold degrees from Stanford and other top universities. We value innovation and inclusivity in everything we do.

Role Overview

Speechify is hiring a Software Engineer focused on data infrastructure and acquisition for our AI team. This position centers on building and scaling high-quality datasets, integrating infrastructure, engineering, and research to support model training at the petabyte level.

What You Will Do

  • Identify and source new audio data to strengthen our ingestion pipeline.
  • Manage and expand cloud infrastructure on Google Cloud Platform (GCP) using Terraform.
  • Work with scientists to improve data processing cost, throughput, and quality, supporting next-generation model development.
  • Partner with the AI team and leadership to shape a strategic dataset roadmap for future consumer and enterprise products.

Qualifications

  • BS, MS, or PhD in Computer Science or a related field.
  • At least 5 years of professional software development experience.
  • Strong skills in bash and Python scripting within Linux environments.
  • Hands-on experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (GCP preferred).
  • Familiarity with web crawlers and large-scale data processing workflows is a plus.
  • Comfortable multitasking and adjusting to changing priorities.
  • Excellent written and verbal communication skills.

Location

This position is based in Eugene, OR, USA.

About Speechify

Speechify is dedicated to ensuring that reading is never a barrier to learning, empowering over 50 million users worldwide with our innovative text-to-speech solutions. Our commitment to inclusivity and quality has earned us notable awards and a diverse, fully remote team of experts from leading tech companies and prestigious academic institutions.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.