About the job
As a Senior DevOps Engineer, you will be at the forefront of innovation, collaborating closely with architecture and development teams to design, operate, and scale AWS-based, cloud-native systems that are highly available and fault-tolerant. Your expertise will ensure that both internal and external applications consistently meet or surpass ambitious Service Level Agreements (SLAs), while you champion continuous improvement in reliability, performance, and operational efficiency.
This role involves managing the daily operations of AWS infrastructure, focusing on automated deployments, observability, incident response, and the ongoing optimization of systems and services. By leveraging both established and cutting-edge AWS tools, you will help create a secure, resilient, and cost-optimized cloud environment.
You will achieve all these objectives while keeping a keen eye on cost-efficiency.
Key Responsibilities:
- Infrastructure Design & Management – Architect, provision, and maintain scalable, secure, and highly available cloud infrastructure.
- CI/CD Pipeline Development – Build, optimize, and maintain automated build, test, and deployment pipelines to expedite delivery.
- Monitoring & Observability – Implement and manage robust monitoring, logging, and alerting systems to maintain system health and performance.
- Configuration Management & Automation – Utilize tools (e.g., CloudFormation, Terraform) to automate infrastructure and application management.
- Security & Compliance – Uphold security best practices (IAM, secrets management, patching, vulnerability scanning) while ensuring compliance with industry standards.
- Incident Response & Troubleshooting – Lead root cause analysis and resolution of production issues to minimize downtime.
- Cost Optimization – Monitor and optimize cloud resource usage to achieve a balance between performance and cost efficiency.
- Collaboration with Development Teams – Partner with developers to integrate DevOps practices, enhance workflows, and provide infrastructure guidance.
- Scalability & Reliability Engineering – Design and implement systems capable of handling growth while maintaining resilience and performance.
- Mentorship & Best Practices – Mentor fellow engineers, advocate for a DevOps culture, and establish standards for automation, deployment, and operations.

