About the job
About Formation Bio
Formation Bio is a pioneering pharmaceutical company harnessing technology and AI to revolutionize drug development.
As advancements in artificial intelligence and drug discovery yield more candidate drugs than the industry can feasibly progress through traditional clinical trials, Formation Bio recognizes that this bottleneck may hinder the availability of new medicines for patients. Established in 2016 as TrialSpark Inc., we have developed innovative technology platforms and processes designed to expedite every stage of drug development and clinical trials. By collaborating with pharmaceutical companies, research organizations, and biotechs, we in-license and develop drugs beyond clinical proof of concept, effectively bridging the gap between innovation and patient accessibility. Our endeavors are supported by prominent investors from both the pharma and tech industries, including a16z, Sequoia, Sanofi, Thrive Capital, Sam Altman, John Doerr, Spark Capital, SV Angel Growth, and others.
Discover more about our vision and initiatives through the following resources:
At Formation Bio, our core values drive our mission to transform the pharmaceutical landscape. Each team member contributes to our collective goal of accelerating the delivery of new treatments to patients.
About the Position
We are seeking a Senior Data Engineer to join our Scientific Data Intelligence (SDI) team at Formation Bio. In this role, you will play a critical part in converting Real World Data (RWD), which includes electronic health records, claims, and other longitudinal patient data, into structured, analytics-ready datasets. You will collaborate closely with our Data Science team to model and transform data, conducting analyses that answer research questions, generate evidence, and support scientific decision-making across our drug portfolio.
This position uniquely combines healthcare data engineering, real-world evidence analysis, and generative AI. A solid foundation in building reliable and scalable data pipelines is essential, alongside a hands-on approach to working directly with data, constructing cohorts and performing analyses to drive insights.
