Andromeda logoAndromeda logo

Senior Site Reliability Engineer for AI Infrastructure

AndromedaGlobal Remote / San Francisco, CA
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

Proven experience in site reliability engineering or a similar role. Strong knowledge of cloud infrastructure, containerization (Docker, Kubernetes), and CI/CD practices. Experience with monitoring tools and incident response processes. Expertise in scripting and automation (Python, Bash, etc.). Excellent problem-solving skills and a proactive approach to system reliability.

About the job

Join Andromeda as a Senior Site Reliability Engineer specializing in AI Infrastructure. In this pivotal role, you will be responsible for ensuring the reliability, scalability, and performance of our cutting-edge AI systems. Collaborate with cross-functional teams to design and implement robust infrastructure solutions that support our innovative AI initiatives. Your expertise will play a crucial role in maintaining optimal service availability and improving system performance.

About Andromeda

Andromeda is at the forefront of AI technology, committed to pushing the boundaries of innovation. With a global presence and a dynamic team, we are dedicated to creating scalable solutions that revolutionize industries. Join us to be part of a transformative journey in AI!

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.