Qualifications
ResponsibilitiesIdentify and acquire new audio data sources to enhance our ingestion pipeline. Manage and expand our cloud infrastructure for the ingestion pipeline, currently running on Google Cloud Platform (GCP) and managed via Terraform. Work closely with our Scientists to enhance the cost, throughput, and quality of data, ensuring it meets the requirements for our next-generation AI models. Collaborate with colleagues across the AI Team and Speechify Leadership to develop the dataset roadmap that will drive future consumer and enterprise products. Ideal Candidate QualificationsBS/MS/PhD in Computer Science or a related field.5+ years of professional experience in software development. Strong proficiency in bash/Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code principles, with practical experience in at least one major cloud provider (GCP preferred). Familiarity with web crawlers and large-scale data processing workflows is an advantage. Excellent multitasking abilities and adaptability to shifting priorities. Exceptional communication skills, both written and verbal.
About the job
Speechify helps over 50 million people turn reading material into audio, making learning more accessible. Our text-to-speech tools work across PDFs, books, Google Docs, news articles, and websites, supporting users on iOS, Android, Mac, Chrome, and the web. Google named Speechify the Chrome Extension of the Year, and Apple awarded us the 2025 Design Award for Inclusivity.
Our team includes nearly 200 people working remotely from around the world, with backgrounds at companies like Amazon, Microsoft, and Google. We focus on pushing text-to-speech technology forward.
Role overview
Speechify is hiring a Software Engineer - Data Infrastructure & Acquisition to join the Data team within our AI division. This position centers on building and maintaining systems for large-scale data collection, supporting model training efforts. The work combines infrastructure, engineering, and research to efficiently assemble and manage petabyte-scale datasets.
Location
This role is based in Dhaka, Bangladesh.
About Speechify
Speechify is a leading tech company dedicated to making reading accessible to everyone. Our award-winning text-to-speech technology is used by millions worldwide, and our diverse, fully remote team is committed to driving innovation in the field.