Apply

Site Reliability Engineer (SRE) - AI Infrastructure (Entry Level)

Nebius

Internship|On-site|Amsterdam, Netherlands

Why Join Nebius?Nebius is at the forefront of a transformative wave in cloud computing, dedicated to empowering the global AI economy. We provide essential tools and resources that enable our customers to tackle real-world challenges and revolutionize industries—all while avoiding exorbitant infrastructure expenses and the necessity of large in-house AI/ML t…

Apr 23, 2026

Apply

Senior Site Reliability Engineer (SRE) - Compute Node Team

Nebius

Full-time|Remote|Amsterdam, Netherlands; Remote - Europe

Why Choose Nebius?Nebius is at the forefront of revolutionizing cloud computing, catering specifically to the global AI economy. Our mission is to provide our clients with the essential tools and resources needed to tackle real-world challenges and innovate industries, all without incurring hefty infrastructure expenses or the necessity of assembling large in-house AI/ML teams. Join us and collaborate with some of the brightest minds in AI cloud infrastructure, alongside seasoned leaders and engineers.Where We OperateFounded in Amsterdam and publicly traded on Nasdaq, Nebius boasts a worldwide presence with R&D centers located throughout Europe, North America, and Israel. Our workforce comprises over 1,400 dedicated professionals, including more than 400 highly skilled engineers proficient in both hardware and software engineering, complemented by a dedicated in-house AI R&D team.Your RoleAs a Senior Site Reliability Engineer (SRE) within the Compute Node team at Nebius AI Cloud, you will play a pivotal role in constructing and managing the cluster scheduler and node-level services that oversee and maintain virtual machines across our cloud regions. The focus of this role is on Linux systems engineering, virtualization, and operational reliability. You will work closely with the operating system and hypervisor, influencing the integration of reliability and observability within the Compute platform.Your Key Responsibilities:Guarantee the reliability, availability, and performance of compute nodes hosting virtual machines.Analyze and troubleshoot Linux systems at both user and kernel space, recognizing their capabilities, limitations, and trade-offs.Resolve intricate production issues involving CPU, memory, NUMA, cgroups, and scheduling.Engage hands-on with virtualization and containerization using QEMU/KVM and Linux-based technologies.Develop and enhance observability as a core capability of the node layer, including metrics, logs, traces, alerts, SLIs, and SLOs.Lead incident response efforts, conduct root-cause analyses, and perform postmortems, driving long-term enhancements in reliability.Work in close partnership with platform, kernel/hypervisor, GPU, and infrastructure teams to refine system design and operability.

Apr 23, 2026

Apply

Senior Site Reliability Engineer (SRE)

Nebius

Full-time|Remote|Amsterdam, Netherlands; Israel; Remote - Europe

Why choose Nebius?Nebius is at the forefront of revolutionizing cloud computing to empower the global AI economy. We develop essential tools and resources that enable our clients to tackle real-world problems and innovate across industries—all without incurring substantial infrastructure costs or the necessity of assembling large in-house AI/ML teams. Our team operates at the cutting-edge of AI cloud infrastructure, collaborating with some of the most experienced and innovative leaders and engineers in the industry.Our Work EnvironmentWith our headquarters in Amsterdam and a presence on Nasdaq, Nebius boasts a global footprint with R&D hubs across Europe, North America, and Israel. Our workforce of over 1400 includes more than 400 expert engineers with extensive experience in hardware and software engineering, alongside a dedicated in-house AI R&D team.The RoleYour responsibilities will include:Ensuring fault tolerance, scalability, and uninterrupted operations for our services.Utilizing cutting-edge cloud technology to address various infrastructure challenges.Implementing and enhancing CI/CD processes.We expect you to have:Strong experience with programming languages such as Go, Python, or C++.A solid understanding of classic algorithms and data structures.Commercial experience with and a deep understanding of Unix systems and networking technologies.Experience with containerization and configuration management tools like Ansible, Salt, Terraform, Docker, Kubernetes, and Helm.Bonus points for:A keen interest in backend development.Experience in designing, developing, and managing high-load distributed systems.Commercial experience across various cloud platforms.Coding interviews are part of our hiring process.What we offer:A competitive salary and a comprehensive benefits package.Opportunities for professional advancement within Nebius.Flexible working arrangements.A dynamic, collaborative work environment that fosters initiative and innovation.

Apr 23, 2026

Apply

Site Reliability Engineer at airapps | Amsterdam

airapps

Full-time|On-site|Amsterdam

airapps is seeking a Site Reliability Engineer (SRE) based in Amsterdam. This position centers on maintaining the reliability, scalability, and performance of core systems. Role overview The SRE works alongside both development and operations teams. The main focus is to keep infrastructure running smoothly and to improve service quality for users. What you will do Monitor and support system reliability and uptime Collaborate with developers and operations staff to optimize infrastructure Contribute to enhancing the overall user experience by ensuring stable services Location This role is based in Amsterdam.

Apr 28, 2026

Apply

Senior Site Reliability Engineer - Token Factory (Inference Platform)

Nebius

Full-time|Remote|Amsterdam, Netherlands; Berlin, Germany; London, United Kingdom; Prague, Czech Republic; Remote - Europe; Remote - United States; United States

Why join Nebius?Nebius is at the forefront of a revolutionary shift in cloud computing, dedicated to empowering the global AI economy. We provide innovative tools and resources that enable our clients to tackle real-world challenges and revolutionize their industries without incurring substantial infrastructure costs or the necessity of assembling extensive in-house AI/ML teams. Our workforce operates on the cutting edge of AI cloud infrastructure, collaborating with some of the most seasoned and creative leaders and engineers in the industry.Our Work EnvironmentHeadquartered in Amsterdam and publicly traded on Nasdaq, Nebius boasts a worldwide presence with R&D centers across Europe, North America, and Israel. Our team consists of over 1,400 professionals, including more than 400 highly skilled engineers with profound expertise in both hardware and software engineering, complemented by an in-house AI R&D team.As part of Nebius Cloud, one of the largest GPU clouds globally, the Token Factory team operates tens of thousands of GPUs. We are developing an inference platform designed to deploy a variety of foundation models — including text, vision, audio, and cutting-edge multimodal architectures — quickly, dependably, and effortlessly at scale. To achieve this goal, we are seeking an engineer capable of ensuring the platform operates flawlessly under heavy loads and can recover seamlessly from unexpected issues.In this position, you will take ownership of the reliability, performance, and observability of the complete inference stack. Your day may start with designing and refining telemetry pipelines — turning hundreds of terabytes of signals into actionable insights through metrics, logs, and traces. You might also optimize Kubernetes autoscalers for enhanced GPU efficiency, create Terraform modules that incorporate resilience into every new cluster, or strengthen our request-routing and retry logic to ensure that transient failures remain unnoticed by users. When incidents occur, you will utilize the automation and runbooks you’ve developed to swiftly detect, isolate, and address issues, while fostering a post-mortem culture to prevent future occurrences. All these efforts are directed towards a singular objective: achieving smooth platform scaling while meeting rigorous cost and reliability targets.Success in this role requires a deep understanding of Kubernetes, Prometheus, Grafana, Terraform, and the principles of infrastructure-as-code. You should be comfortable scripting in Python or Bash, grasp the intricacies of alert design and SLOs for high-throughput APIs, and have enough production experience to recognize how distributed back-ends can fail in real-world scenarios. Experience managing GPU-intensive workloads — whether with vLLM, Triton, Ray, or a similar accelerator stack — will be advantageous, as will a background in MLOps or model-hosting platforms.

Apr 23, 2026

Apply

Site Reliability Engineer at pinely | Amsterdam

pinely

Full-time|On-site|Amsterdam, North Holland, Netherlands

Join pinely as we expand our innovative team! We are seeking a dedicated Site Reliability Engineer who thrives in a dynamic environment.Key Responsibilities:Deploy, configure, and manage Linux-based servers efficiently.Diagnose and resolve hardware and network availability issues while monitoring for failures.Oversee numerous nodes across various remote sites and cloud infrastructures.Contribute to infrastructure automation initiatives using Python and/or Go.Engage with cloud platforms including AWS, Google Cloud, and Alibaba Cloud.Enhance monitoring systems for production trading environments utilizing Grafana.Required Qualifications:A minimum of 3 years of experience in managing and troubleshooting high-load systems.Strong grasp of the Linux TCP/IP stack.Familiarity with essential network components such as DHCP, DNS, and BGP.Proficiency in at least one configuration management tool (e.g., Salt, Ansible).Extensive knowledge of infrastructure monitoring tools, including Prometheus and Grafana.Fluent in English (B2/Upper-Intermediate or above).Basic skills in Python/Bash/Go.Willingness to travel for work-related tasks.Preferred Qualifications:Familiarity with leading server hardware brands.Experience optimizing hardware and OS configurations for peak performance.What We Offer:Competitive salary and comprehensive social benefits.Attractive bonus structure with flexibility in salary negotiations.Opportunity to work with unique networks such as radio relay, shortwave, FPGA cards, and atomic clocks, including server optimization on overclocked systems.Access to cutting-edge technologies and a supportive environment for implementing innovative solutions.Flexible working conditions, minimizing bureaucracy and promoting autonomy.Tuition reimbursement and sponsorship for conferences and training.

Feb 25, 2026

Apply

Senior Site Reliability Engineer at Nebius | Amsterdam

Nebius

Full-time|On-site|Amsterdam, Netherlands

Why Join Nebius?Nebius is at the forefront of a transformative era in cloud computing, designed to empower the global AI economy. We provide innovative tools and resources that enable our clients to tackle real-world challenges and revolutionize industries, all while minimizing infrastructure costs and eliminating the necessity for extensive in-house AI/ML teams. Our workforce operates at the cutting edge of AI cloud infrastructure, collaborating with some of the industry’s most experienced and pioneering leaders and engineers.Where We OperateBased in Amsterdam and publicly listed on Nasdaq, Nebius boasts a worldwide presence with research and development hubs in Europe, North America, and Israel. Our team of over 1,400 professionals includes more than 400 highly skilled engineers, proficient in both hardware and software engineering, alongside a dedicated in-house AI research and development team.The RoleNebius is seeking a talented Senior Site Reliability Engineer to join our Hardware Infrastructure team. You will have the opportunity to work from our vibrant office in Amsterdam.The Hardware Infrastructure team is responsible for designing, developing, and maintaining systems integral to the data center lifecycle:Functional and load testing systems.Monitoring engineering equipment in our data centers (power supply, air and water cooling, etc.).Monitoring IT assets: racks, servers, JBODs, JBOGs, power shelves, network devices, etc.Asset management and tracking.Tracking hardware repair tasks.Server production oversight.Your Responsibilities Will Include:Ensuring fault tolerance, scalability, and uninterrupted service operation.Utilizing state-of-the-art technologies to address various infrastructure challenges.Implementing and refining CI/CD processes.We Expect You to Have:Expertise in Linux systems, alongside proficiency in Python and Bash scripting for automation.A proven track record of troubleshooting complex system issues, encompassing hardware, software, and networking.Strong analytical skills and adept problem-solving capabilities, aimed at optimizing system performance.Proficiency in English.Bonus Skills:An interest in backend development.Experience in designing, developing, and managing high-load distributed systems.

Apr 30, 2026

Apply

Senior Network Site Reliability Engineer at Nebius | Amsterdam, Netherlands

Nebius

Full-time|Remote|Amsterdam, Netherlands; Remote - Europe

Why Join NebiusNebius is pioneering a transformative era in cloud computing, tailored to meet the demands of the global AI economy. We provide the essential tools and resources that empower our clients to address real-world challenges and revolutionize their industries without incurring substantial infrastructure costs or assembling large in-house AI/ML teams. Our workforce is engaged at the forefront of AI cloud infrastructure, collaborating with some of the most talented and innovative leaders and engineers in the industry.Our Work EnvironmentHeadquartered in Amsterdam and publicly traded on Nasdaq, Nebius boasts a worldwide presence with R&D centers across Europe, North America, and Israel. Our diverse team of over 1400 professionals includes more than 400 highly skilled engineers, well-versed in both hardware and software engineering, complemented by an in-house AI R&D team.The RoleWe are seeking a Network Site Reliability Engineer (NetSRE) to play a critical role in developing and maintaining the foundational infrastructure of Nebius—the Network, which is essential for all other services. This engineering-centric SRE position will involve defining clear reliability objectives, implementing the necessary tooling and automation to achieve them, while enhancing the operational safety of the network as we scale rapidly.Your Responsibilities Will Include:Establish and oversee reliability benchmarks for network services and critical pathways (including SLIs/SLOs, availability targets, and error budgets as applicable).Enhance reliability across the entire network, focusing not just on services, but also on site readiness, inter-site connectivity (DCI), and operational protocols.Lead incident response efforts in your areas, directing investigations/postmortems and transforming failures into sustainable solutions rather than recurring issues.Develop and refine observability tools including actionable metrics, logs, traces, alerting systems, and expedited debugging processes.

Apr 30, 2026

Apply

Site Reliability Engineer | Trading Operations

Jump Trading

Full-time|On-site|Amsterdam

Join Jump Trading as a Site Reliability Engineer in our Trading Operations team. In this pivotal role, you will ensure the reliability and performance of our trading systems, utilizing your expertise to implement best practices in system design and operations.Your responsibilities will include monitoring system performance, troubleshooting issues, and collaborating with software engineers to improve system architecture. Your contributions will play a critical role in maintaining our competitive edge in the trading industry.

Mar 30, 2026

Apply

Site Reliability Engineer for Cutting-Edge Machine Learning Platform

dev2

Full-time|On-site|Amsterdam

Join our innovative team at dev2 as a Site Reliability Engineer, where you'll play a pivotal role in enhancing our cutting-edge Machine Learning Platform. You will be responsible for ensuring the reliability, availability, and performance of our systems while collaborating with cross-functional teams to implement best practices in software engineering and operations.

Nov 7, 2021

Apply

Early Career Network Engineer (AI Infrastructure)

Nebius

Internship|On-site|Amsterdam, Netherlands

Why Join Nebius?Nebius is at the forefront of cloud computing, dedicated to empowering the global AI economy. We provide the essential tools and resources to help our clients tackle real-world challenges and revolutionize industries, all while minimizing infrastructure expenses and the need for extensive in-house AI/ML teams. Our team works on the cutting edge of AI cloud infrastructure, collaborating with some of the most knowledgeable and innovative leaders and engineers in the industry.Work EnvironmentLocated in the vibrant city of Amsterdam and publicly traded on Nasdaq, Nebius boasts a global presence with R&D centers across Europe, North America, and Israel. Our team consists of over 1,400 professionals, including more than 400 exceptionally skilled engineers with in-depth expertise in both hardware and software engineering, complemented by a dedicated in-house AI R&D team.Role Overview:Primary Focus: Engage in learning network operations while assisting the team with routine tasks related to data center and backbone networks under close supervision.Operational Support (Supervised):Provide assistance in daily network operations for data center and backbone environments.Execute straightforward, well-defined tasks as directed.

Apr 23, 2026

Apply

Software Engineering Manager - Accommodations - SRE

dev2

Full-time|On-site|Amsterdam

We are seeking an experienced Software Engineering Manager to lead our Accommodations team within the Site Reliability Engineering (SRE) department. In this role, you will be responsible for driving innovative solutions that enhance the accommodation experience for our users. You will lead a team of talented engineers, fostering a collaborative environment while ensuring the reliability and performance of our systems.

Nov 7, 2021

Apply

Staff Software Engineer - Hardware Infrastructure

Nebius

Full-time|On-site|Amsterdam, Netherlands

Why Join Nebius?Nebius is at the forefront of a transformative wave in cloud computing, dedicated to empowering the global AI economy. We provide innovative tools and resources that enable our clients to tackle real-world challenges and revolutionize industries, all while minimizing infrastructure costs and eliminating the need for extensive in-house AI/ML teams. By joining our team, you'll collaborate with some of the most skilled and visionary leaders in AI cloud infrastructure.Our Work EnvironmentWith headquarters in Amsterdam and a presence on Nasdaq, Nebius boasts a global reach with R&D hubs in Europe, North America, and Israel. Our workforce of over 1,400 includes more than 400 highly skilled engineers specializing in both hardware and software, alongside an in-house AI R&D team.The OpportunityNebius is seeking a Senior Software Engineer to join our Hardware Infrastructure team, based in our Amsterdam office.The Hardware Infrastructure team is responsible for designing, developing, and supporting systems integral to the data center lifecycle.Your Responsibilities:Develop and design services that automate operations across a large server fleet.Qualifications:5+ years of software engineering experience.Strong proficiency in Python or Golang, or a willingness to quickly adapt to these languages.Proven ability to write reliable code and troubleshoot complex issues.Genuine interest in DevOps processes.Fluency in English.Preferred Qualifications:Experience with cloud infrastructure.Familiarity with container orchestration systems.

Apr 30, 2026

Apply

Observability Infrastructure Engineer

Adyen

Full-time|On-site|Amsterdam

Join our dynamic team at Adyen as an Observability Infrastructure Engineer. In this pivotal role, you will be responsible for enhancing the observability of our systems, ensuring seamless operation and reliability. Collaborate with cross-functional teams to design and implement monitoring solutions that provide insights into system performance and health.

Mar 12, 2026

Apply

Cloud and Infrastructure Engineer

studytube

Full-time|On-site|Amsterdam, Noord-Holland, Netherlands

Are you ready to advance your career? Join us at studytube as a Cloud and Infrastructure Engineer, where you'll enhance and innovate our cloud systems. Your hands-on experience with DevOps tools will be crucial in managing, optimizing, and scaling our learning platform's infrastructure.Key ResponsibilitiesAs a pivotal member of our team, you will help shape and advance our Cloud Platform by introducing innovative solutions, driving progress, and ensuring that our infrastructure is secure, scalable, and prepared for future demands.In an environment that champions excellence, teamwork, and personal development, you’ll have the autonomy to experiment, refine, and create a significant impact.Your Role Includes:Designing, constructing, and managing our cloud infrastructure to ensure it remains robust, efficient, and dependable.Automating processes by developing and maintaining Infrastructure as Code (IaC) utilizing tools such as Terraform.Enhancing our security posture by applying cloud security best practices and ensuring compliance with industry standards.Refining our CI/CD pipelines to accelerate and streamline deployments.Collaborating with cross-functional teams to translate requirements into scalable infrastructure solutions.Proactively monitoring system health to address potential issues before they affect users.

Jan 26, 2026

Apply

Senior Staff Software Engineer - Hardware Infrastructure Observability

Nebius

Full-time|On-site|Amsterdam, Netherlands

Why Join Nebius?Nebius is at the forefront of revolutionizing cloud computing to empower the global AI economy. We provide innovative tools and resources that enable our clients to tackle real-world problems and revolutionize industries without incurring exorbitant infrastructure costs or the necessity of establishing large in-house AI/ML teams. Our workforce is comprised of leading experts in AI cloud infrastructure, working closely with some of the most seasoned and innovative leaders and engineers in the sector.Workplace CultureBased in Amsterdam and publicly traded on Nasdaq, Nebius boasts a global presence with R&D centers across Europe, North America, and Israel. Our diverse team of over 1400 professionals includes more than 400 highly skilled engineers with extensive expertise in hardware and software engineering, along with a dedicated AI R&D team.The OpportunityNebius is seeking a talented Senior Software Engineer to become an integral part of our Hardware Infrastructure Observability team, with the option to work from our Amsterdam office. We specialize in developing and maintaining low-level monitoring systems for servers and data center engineering to ensure reliability at scale. Our responsibilities include designing and managing maintenance systems that facilitate safe, predictable fleet-wide changes while sustaining infrastructure health.

Apr 30, 2026

Apply

Senior Linux Infrastructure Engineer

Adyen

Full-time|On-site|Amsterdam

Join Adyen as a Senior Linux Infrastructure Engineer and be part of a dynamic team dedicated to providing cutting-edge solutions. In this role, you will leverage your expertise in Linux systems to enhance our infrastructure and ensure optimal performance. You will work collaboratively with cross-functional teams to architect and implement scalable solutions that drive our global payment platform.

Mar 3, 2026

Apply

Early Career Full Stack Developer (AI Infrastructure)

Nebius

Internship|On-site|Amsterdam, Netherlands

Why Join NebiusNebius is at the forefront of a transformative era in cloud computing, empowering the global AI economy. We are dedicated to providing innovative tools and resources that enable our clients to tackle real-world challenges and revolutionize industries, all while minimizing infrastructure costs and eliminating the need for extensive in-house AI/ML teams. Our team members are pioneering advancements in AI cloud infrastructure, collaborating with some of the most skilled and visionary leaders and engineers in the industry.Our Work EnvironmentBased in Amsterdam and publicly traded on Nasdaq, Nebius boasts a global presence with R&D hubs across Europe, North America, and Israel. Our workforce consists of over 1,400 professionals, including more than 400 highly knowledgeable engineers who specialize in both hardware and software engineering, along with a dedicated in-house AI R&D team.Position Overview:Location: AmsterdamDuration: 3 monthsStart Date: June 2026Compensation: PaidEligibility: Current University student (Computer Science or related field), Recent Graduate or Early Career specialistWork Authorization: Authorized to work in the job’s locationThe RoleWe are seeking an Early Career Full Stack Developer to create cloud-based integrations on Microsoft Azure. This position offers the opportunity to collaborate with seasoned engineers and acquire hands-on experience in developing API-driven and cloud-native integration solutions.

Apr 23, 2026

Apply

Machine Learning Engineer - Life Sciences (Entry Level)

Nebius

Internship|On-site|Amsterdam, Netherlands

Why Join NebiusNebius is at the forefront of the cloud computing revolution, dedicated to empowering the global AI economy. We provide our clients with innovative tools and resources to tackle real-world challenges and revolutionize industries, all while minimizing infrastructure costs and the need for extensive in-house AI/ML teams. Our employees are immersed in cutting-edge AI cloud infrastructure, collaborating with some of the most skilled and visionary leaders and engineers in the industry.Our Work EnvironmentBased in Amsterdam and publicly listed on Nasdaq, Nebius boasts a worldwide presence with research and development hubs in Europe, North America, and Israel. Our team comprises over 1400 employees, including more than 400 proficient engineers with extensive knowledge in both hardware and software engineering, along with a dedicated in-house AI R&D team.Role Overview:Location: AmsterdamContract Duration: 3-6 monthsStart Date: June 2026Compensation: Paid internshipEligibility: Current university student (Computer Science or related field), recent graduate, or early-career professionalWork Authorization: Must be legally permitted to work in the Netherlands.About the Position:This role involves enhancing the efficiency of biological AI models (including protein folding, protein design, and large foundational models) to ensure they run faster and more effectively during inference without compromising biological accuracy.

Apr 23, 2026

Apply

Mid-Level Platform Engineer

Hot ITem Conclusion

Full-time|On-site|Amsterdam, Noord-Holland, Nederland

As a Platform Engineer, you play a vital role in enhancing the experience of our customers and organization. Your technical expertise contributes significantly to simplifying the daily lives of thousands. Whether you are refining our package delivery windows, enhancing travel plans with real-time train information, or developing our internal chatbot, your contributions have a meaningful impact!Your responsibility includes designing and constructing robust, secure, and scalable cloud infrastructures that underpin the work of the Data Engineering team. You embrace Agile and DevOps philosophies and enjoy staying ahead of the curve with the latest technological advancements. Furthermore, you assist colleagues in developing their knowledge and experience by sharing your insights.Curious to learn more about this role? Read the interview with our Platform Engineer Mark to discover his experiences.About UsThe Platform Engineering cluster comprises approximately 15 Engineers, Tech Leads, and Architects. Our core values? Technology, innovation, and exciting initiatives. We host hackathons, enjoy laser tag at the office, or gather for a friendly game of pétanque. A healthy dose of humor is always welcome!Hot ITem Conclusion believes in agile, data-driven organizations that stay ahead of the competition by adapting swiftly to new developments. Everything we do aims to significantly enhance the performance of individuals, departments, and organizations, collaborating closely with our clients based on high-quality data and insights into crucial matters.

Mar 12, 2026

Senior Linux Infrastructure Engineer

Experience Level

Qualifications

About the job

About Adyen

Similar jobs