CoreWeave logoCoreWeave logo

Senior Data & MLOps Engineer

CoreWeaveLondon, UK
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

Qualifications:Proven experience in data engineering and MLOps practices. Strong proficiency in designing and managing data pipelines. Expertise in machine learning model deployment and monitoring. Familiarity with orchestration technologies and distributed systems. Excellent problem-solving skills and the ability to work in a collaborative environment.

About the job

CoreWeave is The Essential Cloud for AI™. Designed for visionaries by visionaries, CoreWeave offers an innovative platform of technology, tools, and expert teams, empowering creators to develop and scale AI solutions with assurance. Our infrastructure is trusted by top AI labs, startups, and multinational enterprises, merging outstanding performance with profound technical knowledge to facilitate advancements and convert computational power into actionable capability. Established in 2017, CoreWeave became publicly traded (Nasdaq: CRWV) in March 2025. Discover more at www.coreweave.com.
 
We proudly hold the title of a Living Wage accredited Employer.

 

Your Responsibilities:

The Data Science team is dedicated to building a state-of-the-art reliability platform. This system encompasses various elements of data processing and analysis, including data intake, deriving significant metrics, detecting anomalies, forecasting potential challenges, identifying sluggish processes in distributed environments, and employing automated analysis to ascertain root causes. We work collaboratively with internal teams such as Fleet, Infrastructure, and AI Platform to bolster system stability, optimize resource utilization, minimize resolution times, and sustain service availability and financial performance.

Role Overview:

As a Senior Data & MLOps Engineer, you will architect and scale the infrastructure that underpins the GPU Intelligence Platform. Your role will entail developing pipelines for data handling, feature engineering, model training, and delivering insights and predictions regarding system health and optimization. You will lead the transition of the system from initial prototypes to a production-ready environment operational across the fleet, with a focus on scalability while differentiating between real-time services and periodic processing, as well as managing resources dynamically based on system load and data frequency. You will design and deploy scalable distributed services utilizing orchestration technologies.

Key Responsibilities:

  • Create and implement scalable data ingestion pipelines.
  • Develop feature processing and baseline computation systems.
  • Productionize models for predictive analysis and anomaly detection.
  • Establish and manage low-latency services and robust offline workflows.
  • Architect horizontally scalable services with a distinct separation between components, leveraging orchestration for distribution.

About CoreWeave

CoreWeave is at the forefront of providing innovative cloud solutions tailored for AI. With a commitment to empowering creators and organizations, we leverage cutting-edge technology and expert teams to ensure the successful development and scaling of AI initiatives. Our focus on superior infrastructure, combined with deep technical insights, positions us as a leader in the AI cloud space.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.