About the job
We are seeking a highly skilled Technical Cloud Architect to design, build, and oversee cloud platforms that truly scale. This pivotal role encompasses ownership of the cloud architecture from inception to execution, ensuring robust infrastructure, networking, security, automation, and reliability. As a key figure bridging engineering, operations, and leadership, you will be responsible for making critical technical decisions, justifying them, and ensuring their successful implementation.
Key Responsibilities:
Architecture & Design
• Design private, public, or hybrid cloud platforms using technologies such as VMware, CloudStack, OpenStack, and Kubernetes.
• Establish reference architectures for compute, storage, networking, and security.
• Own multi-AZ, high availability, and disaster recovery designs to eliminate single points of failure.
• Standardize platform components.
Infrastructure & Platforms
• Architect and optimize solutions across:
- Virtualization (VMware, KVM, CloudStack)
- Containers (Kubernetes, OpenShift, both managed and self-hosted)
- Storage (SAN, object storage, distributed storage such as Ceph, Pure, PowerStore)
- Networking (L2/L3, BGP, EVPN, SDN, load balancing)
• Drive performance, scalability, and cost efficiency.
Automation & IaC
• Implement Infrastructure as Code (Terraform/OpenTofu, Ansible, Helm).
• Define Git-based workflows (GitOps where applicable).
• Strive to eliminate manual provisioning wherever feasible.
Security & Governance
• Integrate security into the design:
- IAM, RBAC, network segmentation
- Secrets management (Vault, Bitwarden, KMS)
- Compliance, audit, and hardening standards
• Establish guardrails that facilitate rather than hinder.
Reliability & Operations
• Design systems for observability:
- Monitoring (Prometheus, Zabbix, Grafana)
- Logging (ELK, Loki)
- Effective alerting mechanisms
• Define Service Level Objectives (SLOs), Service Level Agreements (SLAs), and capacity models.
• Support incident reviews and root-cause analyses with a focus on solutions rather than blame.
Leadership & Enablement
• Serve as the technical authority for cloud-related decisions.
• Review and approve engineering and vendor designs.
• Mentor teams in platform, Site Reliability Engineering (SRE), and infrastructure.
• Challenge poor ideas constructively, even if they originate from management.
Required Skills & Experience
Core Technical Skills
• Strong Linux fundamentals.
• Deep knowledge of:
- Virtualization and cloud platforms
- Networking (routing, switching, load balancing)
- Storage architectures
- Hands-on experience with Kubernetes.
• Proficient in automation and scripting.
