About the job
Join bet365 as a Site Reliability Engineer and play a crucial role in enhancing system reliability, observability, and performance through a robust engineering approach. You will be instrumental in incident resolution and in the implementation of best practices.
Bringing strong software engineering skills to the table, you will focus on monitoring the health, performance, and availability of our critical systems, significantly impacting our operational efficiency.
Your engineering expertise will be key in implementing solutions that boost reliability, which includes service instrumentation using tools like OpenTelemetry, improving logging practices, and developing features that enhance maintainability. Additionally, you will help create tools and automation for effective service management.
Collaboration is essential in this role, as you will work across various functions to integrate reliability and observability best practices into the software development lifecycle. By supporting governance standards established by central teams, you will cultivate a culture where these principles are fundamental to development. Your contributions will ensure our systems meet user needs and enhance overall service performance.
This position is eligible for our hybrid working from home policy.
