Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.

Senior Site Reliability Engineer at Salla | Makkah | RoboApply Jobs

Senior Site Reliability Engineer at Salla | Makkah

SallaMakkah, Makkah Province, Saudi Arabia

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

To excel in this role, candidates should possess:A robust background in site reliability engineering or a related field.Proven experience in incident management and system performance optimization.Excellent problem-solving skills and the ability to work under pressure.Strong communication skills to collaborate effectively with engineering teams.

About the job

Join Salla as a Senior Site Reliability Engineer (SRE) where you will spearhead initiatives aimed at enhancing system reliability, manage complex incident responses, optimize platform performance, and mentor engineering teams in the development of resilient systems. Your role will also include participating in our on-call rotation to uphold our commitment to platform reliability.

Key Responsibilities

Lead the response to high-severity incidents and facilitate post-incident analyses.
Troubleshoot intricate issues spanning applications, infrastructure, and networks.
Enhance Mean Time to Recovery (MTTR) through improved monitoring, alerting, and diagnostic tools.
Engage in the on-call rotation to support our production systems.

Performance & Scalability

Identify and address performance bottlenecks and scaling obstacles.
Conduct load testing and strategic capacity planning for high-traffic scenarios.

Infrastructure & Operations

Advance cloud-native infrastructure, deployment methodologies, and automation processes.
Boost resilience, fault tolerance, and recovery mechanisms across systems.

Observability

Create and enhance dashboards, alerts, metrics, logs, and traces.
Establish Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to improve system visibility.

Tooling & Automation

Craft tools that diminish operational toil and bolster reliability.
Contribute to infrastructure-as-code practices, CI/CD pipelines, and GitOps workflows.

Collaboration

Collaborate closely with engineering teams to ensure services are robust and production-ready.
Mentor engineers in reliability, troubleshooting, and operational best practices.

Bonus Skills

Experience with large-scale, high-traffic systems.
Familiarity with fault-tolerant design, disaster recovery (DR), and high availability (HA) patterns.
Knowledge of SLIs, SLOs, and error budgets.

Location Preference

We prefer candidates located within GMT 0 to +6 time zones to facilitate team collaboration and on-call coverage.

Requirements

Extensive experience with Kubernetes, service mesh technologies, and cloud platforms (AWS, GCP, or Azure).
In-depth knowledge of Linux, networking, distributed systems, and load balancing.
Practical experience with Terraform or similar Infrastructure-as-Code tools.
Proficiency with observability platforms such as Prometheus, Grafana, Loki, Mimir, or equivalent.
Strong skills in scripting or programming languages such as Python, Go, or Java.

About Salla

Salla is a pioneering technology company dedicated to providing innovative solutions that empower businesses in the digital landscape. With a focus on reliability and performance, we strive to create systems that not only meet the demands of today but are also prepared for the challenges of tomorrow.

Senior Site Reliability Engineer at Salla | Makkah

SallaMakkah, Makkah Province, Saudi Arabia

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

About the job

Key Responsibilities

Lead the response to high-severity incidents and facilitate post-incident analyses.
Troubleshoot intricate issues spanning applications, infrastructure, and networks.
Enhance Mean Time to Recovery (MTTR) through improved monitoring, alerting, and diagnostic tools.
Engage in the on-call rotation to support our production systems.

Performance & Scalability

Identify and address performance bottlenecks and scaling obstacles.
Conduct load testing and strategic capacity planning for high-traffic scenarios.

Infrastructure & Operations

Advance cloud-native infrastructure, deployment methodologies, and automation processes.
Boost resilience, fault tolerance, and recovery mechanisms across systems.

Observability

Create and enhance dashboards, alerts, metrics, logs, and traces.
Establish Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to improve system visibility.

Tooling & Automation

Craft tools that diminish operational toil and bolster reliability.
Contribute to infrastructure-as-code practices, CI/CD pipelines, and GitOps workflows.

Collaboration

Collaborate closely with engineering teams to ensure services are robust and production-ready.
Mentor engineers in reliability, troubleshooting, and operational best practices.

Bonus Skills

Experience with large-scale, high-traffic systems.
Familiarity with fault-tolerant design, disaster recovery (DR), and high availability (HA) patterns.
Knowledge of SLIs, SLOs, and error budgets.

Location Preference

We prefer candidates located within GMT 0 to +6 time zones to facilitate team collaboration and on-call coverage.

Requirements

Extensive experience with Kubernetes, service mesh technologies, and cloud platforms (AWS, GCP, or Azure).
In-depth knowledge of Linux, networking, distributed systems, and load balancing.
Practical experience with Terraform or similar Infrastructure-as-Code tools.
Proficiency with observability platforms such as Prometheus, Grafana, Loki, Mimir, or equivalent.
Strong skills in scripting or programming languages such as Python, Go, or Java.

Senior Site Reliability Engineer at Salla | Makkah

Unlock Your Potential

Experience Level

Qualifications

About the job

About Salla

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

Senior Site Reliability Engineer at Salla | Makkah

Unlock Your Potential

Experience Level

Qualifications

About the job

About Salla

Senior Site Reliability Engineer at Salla | Makkah

Unlock Your Potential

Experience Level

Qualifications

About the job

About Salla

Senior Site Reliability Engineer at Salla | Makkah

Unlock Your Potential

Experience Level

Qualifications

About the job

About Salla