Speechify logoSpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyThessaloniki, Greece
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

Qualifications BS/MS/PhD in Computer Science or related field. 5+ years of software development experience. Proficient in bash and Python scripting in Linux environments. Experience with Docker and Infrastructure-as-Code, and familiarity with major Cloud Providers (GCP preferred). Experience with web crawlers and large-scale data processing is a plus. Strong multitasking abilities and adaptability to changing priorities. Effective written and verbal communication skills.

About the job

Speechify’s mission is to make reading accessible for everyone, removing barriers to learning for millions worldwide. Over 50 million people use Speechify to convert PDFs, books, Google Docs, news articles, and websites into audio, helping them read faster and retain more. Our products span iOS, Android, Mac, Chrome, and web. Google named us Chrome Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity.

Our fully remote team includes nearly 200 engineers, researchers, and professionals from companies like Amazon, Microsoft, and Google, as well as graduates of leading universities such as Stanford. Many team members have also founded their own startups, bringing a wide range of experience and creativity to our work.

Role Overview

Speechify is hiring a Software Engineer for the Data Infrastructure & Acquisition team, based in Thessaloniki, Greece. This position focuses on data collection and infrastructure to support AI model training. The team builds and maintains petabyte-scale datasets efficiently, combining engineering, research, and infrastructure expertise.

What You Will Do

  • Identify and integrate new audio data sources into the data ingestion pipeline.
  • Manage and improve cloud infrastructure for data ingestion, currently using Google Cloud Platform (GCP) and Terraform.
  • Collaborate with scientists to optimize cost, throughput, and data quality, supporting next-generation AI models.
  • Work with the AI team and leadership to plan the dataset roadmap for future consumer and enterprise products.

Requirements

  • BS, MS, or PhD in Computer Science or a related field.
  • At least 5 years of professional software development experience.
  • Proficiency in bash and Python scripting in Linux environments.
  • Experience with Docker and Infrastructure-as-Code tools, and professional experience with at least one major cloud provider (GCP preferred).
  • Background with web crawlers and large-scale data processing is a plus.
  • Strong organizational skills and ability to manage shifting priorities.
  • Excellent written and verbal communication skills.

About Speechify

Speechify is dedicated to eliminating reading barriers for learners across the globe. Our innovative text-to-speech solutions empower users to engage with content more effectively, enhancing learning and comprehension.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.