About the job
ABOUT POOLSIDE
At Poolside, we are on a mission to develop Artificial General Intelligence within this decade. We believe that only a select few companies will lead this transformation, setting benchmarks in speed, talent acquisition, and groundbreaking research. Our focus is on harnessing advanced engineering and infrastructure to scale training for more sophisticated models, creating robust economic engines while prioritizing user and customer success.
Poolside's vision is to create a future where AI drives valuable economic work and scientific advancement. We aim to accelerate software development by transforming the developer experience through innovative agentic systems and coding assistants, integrating these directly into secure enterprise development environments.
ABOUT OUR TEAM
Founded in the US, our team spans Europe and North America, with monthly collaborative sessions in Paris, fostering in-person connections. We are a multidisciplinary group of experts in research, engineering, and business, united by a shared commitment to our mission. Our culture is built on low egos and kindness, fueling our intense pursuit of excellence as we work collaboratively towards achieving AGI through intelligent systems designed for software development.
ABOUT THE ROLE
As a pivotal member of our Pretraining Data team, you will be instrumental in constructing and scaling our Model Factory, the backbone of our foundation models. Your primary focus will be to design and maintain high-performance pipelines, converting vast amounts of raw tokens into the quality datasets essential for our models. This hands-on role is vital for enabling our research and development capabilities.
