About the job
Director of Site Reliability Engineering – Coupang Pay
We seek an experienced leader to join our Site Reliability Engineering (SRE) team at Coupang Pay. In this pivotal role, you will exemplify exceptional operational and engineering excellence, ensuring the scalability and reliability of our FinTech capabilities across our extensive array of applications, platforms, and infrastructure. You will work closely with key stakeholders to define the technical strategy for reliability and lead a talented team of engineers and DBAs to execute this vision. Your ability to recruit and nurture top engineering and operational talent is essential, as is your skill in managing and delivering complex projects with multiple dependencies. As an effective leader and communicator, you will be instrumental in driving our success.
- Lead and direct a team of skilled engineers and DBAs in the development and management of our production and development environments.
- Formulate a comprehensive technology strategy focused on availability, resilience, and incident response across our FinTech portfolio.
- Oversee the Observability Platform and establish best practices for instrumentation, enabling teams to monitor and respond to the health of their applications.
- Drive the achievement of resilience objectives through rigorous scale and chaos testing.
- Continuously enhance procedures based on insights gained from real experiences and simulated drills.
- Establish baseline resilience requirements, instrumentation standards, and operational readiness checkpoints while tracking adherence.
- Collaborate closely with DevOps and Data teams to provide seamless developer experiences.
- Recruit, develop, and mentor exceptional individuals in SRE and related domains.

