Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Experience
Qualifications
Experience with cloud services (AWS, Azure, GCP)Proficiency in CI/CD tools and practicesStrong scripting skills (Python, Bash, etc.)Familiarity with containerization (Docker, Kubernetes)Knowledge of monitoring tools and practices
About the job
Join inetum2 as a DevOps/SRE Engineer and play a crucial role in optimizing and automating our software development and deployment processes. You will collaborate with cross-functional teams to improve system reliability, scalability, and performance.
About inetum2
inetum2 is a forward-thinking technology solutions provider, dedicated to delivering innovative digital services and solutions to our clients. Our team is passionate about leveraging technology to drive business success and transformation.
Join inetum2 as a DevOps/SRE Engineer and play a crucial role in optimizing and automating our software development and deployment processes. You will collaborate with cross-functional teams to improve system reliability, scalability, and performance.
Aghanim is hiring a Mid-Level/High-Level DevOps / SRE Engineer in Lisbon. This role focuses on managing and improving our production platform, which runs on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE). Cloudflare sits at the front, Datadog provides observability, and CI/CD pipelines run through GitHub Actions. Work closely with Senior and Principal engineers to strengthen reliability, expand monitoring, and reduce manual operational work. The systems you support handle high loads and must be ready for sudden traffic spikes. What You Will Do Platform Operations (GCP/GKE) Manage and support production systems on GCP, with a focus on GKE and other managed services. Carry out platform enhancements and operational tasks as directed by more senior engineers. Infrastructure as Code & Delivery Enablement Apply infrastructure changes using Terraform and, where needed, Terragrunt. Develop and maintain Helm charts and Kubernetes manifests. Improve reliability of GitHub Actions and CI/CD workflows, including deployment automation. Monitoring & Observability (Datadog) Create and manage Datadog dashboards and monitors to ensure effective alerting. Find and address monitoring gaps in key system components. Refine alerts to cut noise and improve signal quality. Incident Management Participate in incident response and operational support: triage, mitigation using runbooks, escalation, and follow-up remediation. Contribute to postmortem reviews with clear facts, timelines, and actionable remediation steps. Security Fundamentals (DevSecOps) Set up and operate security tools and monitoring systems. Help triage findings and implement solutions under supervision. Promote secure-by-default practices such as secrets management, access control, and baseline hardening. Cost Awareness Understand and manage operational costs for the platform.
Join Altersolutions as a Site Reliability Engineer (SRE) in Lisbon, where you will play a pivotal role in ensuring the reliability and performance of our cutting-edge systems. As an SRE, you will collaborate with cross-functional teams to enhance the stability and scalability of our services, while implementing best practices in monitoring, automation, and incident response.Your expertise will help us to strive for excellence in our product offerings, guaranteeing that our customers receive the highest level of service. If you are passionate about solving complex problems and want to be part of a dynamic team, we want to hear from you!
Join our dynamic team at inetum2 as a DevOps Engineer and contribute to innovative projects that enhance our operational efficiency. As a key player in our technology team, you'll collaborate closely with developers and operations to ensure seamless integration and deployment of applications.
We are seeking a talented and experienced Senior DevOps Engineer to join our dynamic team at inetum2. In this role, you will play a crucial part in designing, implementing, and managing our infrastructure and CI/CD pipelines. Your expertise will help us optimize our software development processes and ensure seamless deployment.As a key member of our team, you will collaborate with developers, system administrators, and other stakeholders to enhance our system's reliability and performance. You will also be responsible for troubleshooting and resolving issues in our development and production environments.
Join Altersolutions as a Senior DevOps Engineer and play a pivotal role in enhancing our cloud infrastructure. You will collaborate with cross-functional teams to design, implement, and maintain scalable software systems. Your expertise will ensure our operations run smoothly and efficiently.
Join our dynamic team at inetum2 as a DevOps Engineer, where you will play a crucial role in automating processes, improving system reliability, and enhancing the deployment pipeline. Your expertise will help our teams deliver high-quality software more efficiently.
Join Air Apps as a Site Reliability Engineer (SRE)At Air Apps, we are innovators at heart, committed to transforming the landscape of personal and entrepreneurial planning with our groundbreaking AI-powered Personal & Entrepreneurial Resource Planner (PRP). Established in 2018 in Lisbon, our family-founded company has grown to achieve over 100 million downloads globally, with offices in both Lisbon and San Francisco.We thrive on challenging the status quo and are passionate about leveraging AI to create solutions that have a meaningful impact on people's lives. Here, you will have the opportunity to unleash your creativity and contribute to products that empower users worldwide.We invite you to join our mission to redefine resource management and make a difference in the lives of individuals and entrepreneurs.Your RoleAs a Site Reliability Engineer (SRE), you will ensure the reliability, availability, and scalability of our systems. You will operate at the intersection of software development and operational excellence, implementing automation and performance optimization strategies to enhance system resilience.Key ResponsibilitiesDesign and implement scalable, reliable, and fault-tolerant systems in cloud environments.Develop and maintain observability tools for monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK).Automate infrastructure provisioning and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.Enhance system performance, scalability, and incident response workflows to maximize uptime.Collaborate closely with development and DevOps teams to improve system design and reliability.Conduct root cause analysis (RCA) and implement preventative measures to reduce failures.Ensure high availability through effective load balancing, failover, and disaster recovery strategies.Improve CI/CD pipelines to accelerate deployment speed while maintaining stability.Optimize cloud cost and resource usage for AWS, Azure, or Google Cloud Platform (GCP).Participate in on-call rotations to respond to incidents and maintain system integrity.
Aghanim is hiring a Senior/Principal DevOps Engineer in Lisbon. This position centers on owning and improving a fully cloud-native platform, built on Google Cloud Platform (GCP) and Cloudflare, and monitored through Datadog. The infrastructure is managed with Infrastructure as Code and automated CI/CD pipelines via GitHub Actions. Role Overview This is a hands-on role with significant responsibility. The Senior/Principal DevOps Engineer ensures the platform stays reliable during heavy traffic and rapid growth. The work includes meeting strict SLA/SLO targets, supporting scaling from 10 to 50 times current loads, and optimizing for both efficiency and cost as the company and its microservices expand. Main Responsibilities Cloud Infrastructure Management Oversee and improve production infrastructure on GCP and Cloudflare (cloud-only, no on-premises systems). Maintain high availability and performance for a SaaS platform serving both B2B and B2C customers. Scalability and Highload Management Design and operate systems that handle sudden traffic spikes, with increases up to 10–20 times within seconds. Develop strategies for scaling compute, network, and data layers: autoscaling, capacity planning, and safe degradation. SLA/SLO and Incident Management Monitor and take responsibility for reliability metrics: availability, latency, and error rates as defined by SLA/SLO. Lead incident response, from detection through mitigation, postmortem analysis, and implementing permanent solutions. Infrastructure as Code and Kubernetes Operations Build and maintain Infrastructure as Code using Terraform and Terragrunt when needed. Manage Kubernetes clusters on GKE, including upgrades, scaling, and security improvements. Create and maintain Helm charts and Kubernetes manifests. Observability with Datadog Implement and maintain observability systems in Datadog: metrics, logs, APM, dashboards, monitoring, and alerting.
airapps is looking for a Mobile DevOps Engineer in Lisbon to strengthen its technology team. This position centers on managing and improving deployment workflows for mobile applications. Role overview The Mobile DevOps Engineer will oversee mobile app deployment pipelines, focusing on smooth integration and delivery. The role calls for attention to detail and a drive to streamline processes. Key responsibilities Manage and optimize deployment processes for mobile applications Implement automation to support continuous integration and delivery Work with cloud services to improve operational efficiency Requirements Experience with automation tools and cloud platforms Background in mobile application deployment and integration
Join our dynamic team at Devoteam as a Senior DevOps Engineer in the exciting Telecom Sector. In this role, you will leverage your expertise to streamline our cloud operations, enhance deployment processes, and ensure high availability of our services. You will collaborate with cross-functional teams to implement innovative solutions that drive our projects forward.
airapps is looking for a DevOps Engineer with a focus on backend systems to join the team in Lisbon. This position centers on building and maintaining backend infrastructure that supports the company’s applications and services. Role overview The DevOps Engineer - Backend Specialist will take charge of implementing backend solutions that scale and remain reliable. Collaboration with development teams is a key part of the job, especially when refining deployment workflows and improving infrastructure. What you will do Develop and maintain backend systems to support business needs Work alongside developers to streamline deployment processes Contribute to infrastructure enhancements for better reliability and scalability Help ensure smooth, uninterrupted operation of applications and services Location This role is based in Lisbon.
Join Our Team at 1GLOBALAt 1GLOBAL, we are at the forefront of mobile connectivity, providing advanced solutions for enterprises and consumers worldwide. With our cutting-edge telecom platform, including our proprietary global mobile core network and innovative eSIM technology, we deliver seamless communication services across 40 countries.We proudly serve a diverse clientele, including leading banks, global consumer goods giants, and innovative digital businesses. Our robust infrastructure connects over 70 million users and devices, empowering our partners to thrive in the mobile ecosystem.As a rapidly expanding and profitable enterprise, we surpassed US$200 million in revenue in 2025, with profits exceeding US$25 million. Our growth allows us to continually invest in our platform and global reach. Founded in 2022 by seasoned technology entrepreneurs, we are reshaping the telecommunications landscape as a regulated Mobile Virtual Network Operator (MVNO) in 12 nations and as a telecom operator in another 28.With headquarters in the Netherlands and R&D centres in Lisbon, Berlin, and São Paulo, our team of nearly 500 professionals is dedicated to redefining global mobile connectivity through technological excellence and innovation.About the RoleWe are seeking a skilled DevOps Engineer focused on Cloud and On-premises Infrastructure to enhance our Technology Department. In this role, you will design, deploy, and manage our runtime infrastructure, ensuring it remains secure, scalable, and cost-effective. You will be responsible for implementing container orchestration and service mesh architecture, maintaining multi-account AWS Organization setups, and automating processes to reduce manual intervention. Your expertise will also include system monitoring, maintenance, and ensuring optimal performance of our systems and applications.
About the Role Renesas Electronics Corporation is hiring a Senior DevOps/Site Reliability Engineer in Lisbon. This position focuses on strengthening infrastructure and maintaining dependable services across the organization. What You Will Do Work closely with development teams to coordinate deployments and support ongoing projects. Drive improvements in system performance and reliability. Develop and implement monitoring solutions to catch issues early and maintain service uptime. Location Lisbon
Join our dynamic team at DaCodes, a leading software and digital transformation firm making significant impacts across various industries.With over a decade of experience, we pride ourselves on delivering innovative, technology-driven solutions through our talented team of over 220 #DaCoders, which includes developers, architects, UX/UI designers, project managers, and quality assurance testers. We collaborate with diverse clients across LATAM and the United States, consistently achieving outstanding results.At DaCodes, you will have the chance to advance your career, engage in diverse projects, and contribute to the development of cutting-edge, high-performance iOS applications.Our DaCoders are integral to our success, and you will have the opportunity to work with innovative startups and established global brands, applying your expertise to impactful projects.
Join our dynamic team at inetum2 as a Senior DevOps Engineer specializing in Kubernetes. In this role, you will leverage your expertise in DevOps practices to enhance our software development and deployment processes, ensuring high availability and scalability of our applications. Collaborate with cross-functional teams to design and implement robust CI/CD pipelines, automate infrastructure provisioning, and maintain system reliability.
Join our dynamic team at inetum2 as a DevOps Consultant, where you will leverage your expertise to enhance our clients' operational efficiency through innovative solutions. You will collaborate with cross-functional teams to implement best practices in DevOps, ensuring seamless integration and delivery.
Join our dynamic team as a Site Reliability Engineer, focusing on enhancing our cloud infrastructure. You will be responsible for ensuring the reliability and efficiency of our systems, collaborating closely with development and operations teams. Your expertise will help us maintain optimal performance while implementing innovative solutions.
About the RoleAs a critical member of the Site Reliability Engineering team at iCapital Network, you will play a pivotal role in ensuring that our platform consistently delivers dependable services to our esteemed clients. In this position, you will bridge the gap between software engineering and operational excellence by applying engineering principles to address infrastructure challenges. You will be tasked with designing and implementing scalable systems, architecting observability solutions for actionable insights, and developing automation strategies that enhance platform reliability. This role demands a systematic thinker who can effectively translate business needs into technical solutions and is passionate about fortifying complex systems.Responsibilities:Establish, implement, and refine service level objectives (SLOs) and service level indicators (SLIs) that align with customer and business expectations.Standardize monitoring and alerting practices through “monitors as code” (preferably using Terraform), incorporating quality gates such as severity, ownership, and runbook links.Develop and maintain observability standards encompassing metrics, logs, and traces, including instrumentation and dependency mapping patterns (OpenTelemetry where applicable).Lead technical evaluations and proofs of concept for observability platforms and integrations; set success criteria and outline the migration strategy for adoption.Define and implement reliability and operability standards for Kubernetes-based services, addressing scaling patterns, resource constraints, rollout safety, and establishing baseline dashboards and alerts during service onboarding.Drive automation efforts to reduce toil, enhance repeatability, and expedite recovery processes (incident workflows, runbooks, and remediation where suitable).Act as Incident Commander for high-severity incidents, facilitate postmortems, and promote continuous improvement through actionable items and measurable follow-through using established tooling workflows.
About Us at GoCardlessGoCardless is a pioneering global bank payment solutions provider, trusted by over 100,000 businesses ranging from innovative startups to well-established enterprises. Our platform enables seamless collection and transfer of payments through direct debit, real-time payments, and open banking technology.We process over US$130 billion in payments annually across more than 30 countries, simplifying the collection of both recurring and one-off payments without the hassle, stress, or burdensome fees. Leveraging AI-driven solutions, we enhance payment success rates while minimizing fraud. Our open banking connectivity with over 2,500 banks empowers our customers to make quicker, more informed financial decisions.Headquartered in the UK with offices in London and Leeds, our team also operates in Australia, France, Ireland, Latvia, Portugal, and the United States.At GoCardless, we prioritize supporting you! Our hiring process is designed to be inclusive and accessible. If you require additional support or adjustments, please connect with your Talent Partner — we are here to assist! Remember: while we have certain requirements, we encourage anyone excited about this role to apply!Platform Engineering at GoCardlessThe Platform Engineering team is a diverse, globally distributed group. Currently, we are positioned in London and Riga, with a new hub opening in Lisbon. We collaborate closely with all engineering teams to enable them to build, release, manage, and scale their products effectively.Our focus combines strategic project delivery with operational excellence, with responsibilities including:Project Delivery: Creating new platform components from initial design through to deployment.Operational Support: Maintaining the health and stability of our systems through on-call rotations and effective incident management.Business as Usual (BAU): Ongoing maintenance, improvements, and support for audits and compliance.Our technology stack comprises Golang, Python, Ruby, Terraform, Atlantis, AWS, GCP, Kubernetes, GKE, GitHub, GitHub Actions, ArgoCD, Grafana, Prometheus, and Elastic.
Join inetum2 as a DevOps/SRE Engineer and play a crucial role in optimizing and automating our software development and deployment processes. You will collaborate with cross-functional teams to improve system reliability, scalability, and performance.
Aghanim is hiring a Mid-Level/High-Level DevOps / SRE Engineer in Lisbon. This role focuses on managing and improving our production platform, which runs on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE). Cloudflare sits at the front, Datadog provides observability, and CI/CD pipelines run through GitHub Actions. Work closely with Senior and Principal engineers to strengthen reliability, expand monitoring, and reduce manual operational work. The systems you support handle high loads and must be ready for sudden traffic spikes. What You Will Do Platform Operations (GCP/GKE) Manage and support production systems on GCP, with a focus on GKE and other managed services. Carry out platform enhancements and operational tasks as directed by more senior engineers. Infrastructure as Code & Delivery Enablement Apply infrastructure changes using Terraform and, where needed, Terragrunt. Develop and maintain Helm charts and Kubernetes manifests. Improve reliability of GitHub Actions and CI/CD workflows, including deployment automation. Monitoring & Observability (Datadog) Create and manage Datadog dashboards and monitors to ensure effective alerting. Find and address monitoring gaps in key system components. Refine alerts to cut noise and improve signal quality. Incident Management Participate in incident response and operational support: triage, mitigation using runbooks, escalation, and follow-up remediation. Contribute to postmortem reviews with clear facts, timelines, and actionable remediation steps. Security Fundamentals (DevSecOps) Set up and operate security tools and monitoring systems. Help triage findings and implement solutions under supervision. Promote secure-by-default practices such as secrets management, access control, and baseline hardening. Cost Awareness Understand and manage operational costs for the platform.
Join Altersolutions as a Site Reliability Engineer (SRE) in Lisbon, where you will play a pivotal role in ensuring the reliability and performance of our cutting-edge systems. As an SRE, you will collaborate with cross-functional teams to enhance the stability and scalability of our services, while implementing best practices in monitoring, automation, and incident response.Your expertise will help us to strive for excellence in our product offerings, guaranteeing that our customers receive the highest level of service. If you are passionate about solving complex problems and want to be part of a dynamic team, we want to hear from you!
Join our dynamic team at inetum2 as a DevOps Engineer and contribute to innovative projects that enhance our operational efficiency. As a key player in our technology team, you'll collaborate closely with developers and operations to ensure seamless integration and deployment of applications.
We are seeking a talented and experienced Senior DevOps Engineer to join our dynamic team at inetum2. In this role, you will play a crucial part in designing, implementing, and managing our infrastructure and CI/CD pipelines. Your expertise will help us optimize our software development processes and ensure seamless deployment.As a key member of our team, you will collaborate with developers, system administrators, and other stakeholders to enhance our system's reliability and performance. You will also be responsible for troubleshooting and resolving issues in our development and production environments.
Join Altersolutions as a Senior DevOps Engineer and play a pivotal role in enhancing our cloud infrastructure. You will collaborate with cross-functional teams to design, implement, and maintain scalable software systems. Your expertise will ensure our operations run smoothly and efficiently.
Join our dynamic team at inetum2 as a DevOps Engineer, where you will play a crucial role in automating processes, improving system reliability, and enhancing the deployment pipeline. Your expertise will help our teams deliver high-quality software more efficiently.
Join Air Apps as a Site Reliability Engineer (SRE)At Air Apps, we are innovators at heart, committed to transforming the landscape of personal and entrepreneurial planning with our groundbreaking AI-powered Personal & Entrepreneurial Resource Planner (PRP). Established in 2018 in Lisbon, our family-founded company has grown to achieve over 100 million downloads globally, with offices in both Lisbon and San Francisco.We thrive on challenging the status quo and are passionate about leveraging AI to create solutions that have a meaningful impact on people's lives. Here, you will have the opportunity to unleash your creativity and contribute to products that empower users worldwide.We invite you to join our mission to redefine resource management and make a difference in the lives of individuals and entrepreneurs.Your RoleAs a Site Reliability Engineer (SRE), you will ensure the reliability, availability, and scalability of our systems. You will operate at the intersection of software development and operational excellence, implementing automation and performance optimization strategies to enhance system resilience.Key ResponsibilitiesDesign and implement scalable, reliable, and fault-tolerant systems in cloud environments.Develop and maintain observability tools for monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK).Automate infrastructure provisioning and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.Enhance system performance, scalability, and incident response workflows to maximize uptime.Collaborate closely with development and DevOps teams to improve system design and reliability.Conduct root cause analysis (RCA) and implement preventative measures to reduce failures.Ensure high availability through effective load balancing, failover, and disaster recovery strategies.Improve CI/CD pipelines to accelerate deployment speed while maintaining stability.Optimize cloud cost and resource usage for AWS, Azure, or Google Cloud Platform (GCP).Participate in on-call rotations to respond to incidents and maintain system integrity.
Aghanim is hiring a Senior/Principal DevOps Engineer in Lisbon. This position centers on owning and improving a fully cloud-native platform, built on Google Cloud Platform (GCP) and Cloudflare, and monitored through Datadog. The infrastructure is managed with Infrastructure as Code and automated CI/CD pipelines via GitHub Actions. Role Overview This is a hands-on role with significant responsibility. The Senior/Principal DevOps Engineer ensures the platform stays reliable during heavy traffic and rapid growth. The work includes meeting strict SLA/SLO targets, supporting scaling from 10 to 50 times current loads, and optimizing for both efficiency and cost as the company and its microservices expand. Main Responsibilities Cloud Infrastructure Management Oversee and improve production infrastructure on GCP and Cloudflare (cloud-only, no on-premises systems). Maintain high availability and performance for a SaaS platform serving both B2B and B2C customers. Scalability and Highload Management Design and operate systems that handle sudden traffic spikes, with increases up to 10–20 times within seconds. Develop strategies for scaling compute, network, and data layers: autoscaling, capacity planning, and safe degradation. SLA/SLO and Incident Management Monitor and take responsibility for reliability metrics: availability, latency, and error rates as defined by SLA/SLO. Lead incident response, from detection through mitigation, postmortem analysis, and implementing permanent solutions. Infrastructure as Code and Kubernetes Operations Build and maintain Infrastructure as Code using Terraform and Terragrunt when needed. Manage Kubernetes clusters on GKE, including upgrades, scaling, and security improvements. Create and maintain Helm charts and Kubernetes manifests. Observability with Datadog Implement and maintain observability systems in Datadog: metrics, logs, APM, dashboards, monitoring, and alerting.
airapps is looking for a Mobile DevOps Engineer in Lisbon to strengthen its technology team. This position centers on managing and improving deployment workflows for mobile applications. Role overview The Mobile DevOps Engineer will oversee mobile app deployment pipelines, focusing on smooth integration and delivery. The role calls for attention to detail and a drive to streamline processes. Key responsibilities Manage and optimize deployment processes for mobile applications Implement automation to support continuous integration and delivery Work with cloud services to improve operational efficiency Requirements Experience with automation tools and cloud platforms Background in mobile application deployment and integration
Join our dynamic team at Devoteam as a Senior DevOps Engineer in the exciting Telecom Sector. In this role, you will leverage your expertise to streamline our cloud operations, enhance deployment processes, and ensure high availability of our services. You will collaborate with cross-functional teams to implement innovative solutions that drive our projects forward.
airapps is looking for a DevOps Engineer with a focus on backend systems to join the team in Lisbon. This position centers on building and maintaining backend infrastructure that supports the company’s applications and services. Role overview The DevOps Engineer - Backend Specialist will take charge of implementing backend solutions that scale and remain reliable. Collaboration with development teams is a key part of the job, especially when refining deployment workflows and improving infrastructure. What you will do Develop and maintain backend systems to support business needs Work alongside developers to streamline deployment processes Contribute to infrastructure enhancements for better reliability and scalability Help ensure smooth, uninterrupted operation of applications and services Location This role is based in Lisbon.
Join Our Team at 1GLOBALAt 1GLOBAL, we are at the forefront of mobile connectivity, providing advanced solutions for enterprises and consumers worldwide. With our cutting-edge telecom platform, including our proprietary global mobile core network and innovative eSIM technology, we deliver seamless communication services across 40 countries.We proudly serve a diverse clientele, including leading banks, global consumer goods giants, and innovative digital businesses. Our robust infrastructure connects over 70 million users and devices, empowering our partners to thrive in the mobile ecosystem.As a rapidly expanding and profitable enterprise, we surpassed US$200 million in revenue in 2025, with profits exceeding US$25 million. Our growth allows us to continually invest in our platform and global reach. Founded in 2022 by seasoned technology entrepreneurs, we are reshaping the telecommunications landscape as a regulated Mobile Virtual Network Operator (MVNO) in 12 nations and as a telecom operator in another 28.With headquarters in the Netherlands and R&D centres in Lisbon, Berlin, and São Paulo, our team of nearly 500 professionals is dedicated to redefining global mobile connectivity through technological excellence and innovation.About the RoleWe are seeking a skilled DevOps Engineer focused on Cloud and On-premises Infrastructure to enhance our Technology Department. In this role, you will design, deploy, and manage our runtime infrastructure, ensuring it remains secure, scalable, and cost-effective. You will be responsible for implementing container orchestration and service mesh architecture, maintaining multi-account AWS Organization setups, and automating processes to reduce manual intervention. Your expertise will also include system monitoring, maintenance, and ensuring optimal performance of our systems and applications.
About the Role Renesas Electronics Corporation is hiring a Senior DevOps/Site Reliability Engineer in Lisbon. This position focuses on strengthening infrastructure and maintaining dependable services across the organization. What You Will Do Work closely with development teams to coordinate deployments and support ongoing projects. Drive improvements in system performance and reliability. Develop and implement monitoring solutions to catch issues early and maintain service uptime. Location Lisbon
Join our dynamic team at DaCodes, a leading software and digital transformation firm making significant impacts across various industries.With over a decade of experience, we pride ourselves on delivering innovative, technology-driven solutions through our talented team of over 220 #DaCoders, which includes developers, architects, UX/UI designers, project managers, and quality assurance testers. We collaborate with diverse clients across LATAM and the United States, consistently achieving outstanding results.At DaCodes, you will have the chance to advance your career, engage in diverse projects, and contribute to the development of cutting-edge, high-performance iOS applications.Our DaCoders are integral to our success, and you will have the opportunity to work with innovative startups and established global brands, applying your expertise to impactful projects.
Join our dynamic team at inetum2 as a Senior DevOps Engineer specializing in Kubernetes. In this role, you will leverage your expertise in DevOps practices to enhance our software development and deployment processes, ensuring high availability and scalability of our applications. Collaborate with cross-functional teams to design and implement robust CI/CD pipelines, automate infrastructure provisioning, and maintain system reliability.
Join our dynamic team at inetum2 as a DevOps Consultant, where you will leverage your expertise to enhance our clients' operational efficiency through innovative solutions. You will collaborate with cross-functional teams to implement best practices in DevOps, ensuring seamless integration and delivery.
Join our dynamic team as a Site Reliability Engineer, focusing on enhancing our cloud infrastructure. You will be responsible for ensuring the reliability and efficiency of our systems, collaborating closely with development and operations teams. Your expertise will help us maintain optimal performance while implementing innovative solutions.
About the RoleAs a critical member of the Site Reliability Engineering team at iCapital Network, you will play a pivotal role in ensuring that our platform consistently delivers dependable services to our esteemed clients. In this position, you will bridge the gap between software engineering and operational excellence by applying engineering principles to address infrastructure challenges. You will be tasked with designing and implementing scalable systems, architecting observability solutions for actionable insights, and developing automation strategies that enhance platform reliability. This role demands a systematic thinker who can effectively translate business needs into technical solutions and is passionate about fortifying complex systems.Responsibilities:Establish, implement, and refine service level objectives (SLOs) and service level indicators (SLIs) that align with customer and business expectations.Standardize monitoring and alerting practices through “monitors as code” (preferably using Terraform), incorporating quality gates such as severity, ownership, and runbook links.Develop and maintain observability standards encompassing metrics, logs, and traces, including instrumentation and dependency mapping patterns (OpenTelemetry where applicable).Lead technical evaluations and proofs of concept for observability platforms and integrations; set success criteria and outline the migration strategy for adoption.Define and implement reliability and operability standards for Kubernetes-based services, addressing scaling patterns, resource constraints, rollout safety, and establishing baseline dashboards and alerts during service onboarding.Drive automation efforts to reduce toil, enhance repeatability, and expedite recovery processes (incident workflows, runbooks, and remediation where suitable).Act as Incident Commander for high-severity incidents, facilitate postmortems, and promote continuous improvement through actionable items and measurable follow-through using established tooling workflows.
About Us at GoCardlessGoCardless is a pioneering global bank payment solutions provider, trusted by over 100,000 businesses ranging from innovative startups to well-established enterprises. Our platform enables seamless collection and transfer of payments through direct debit, real-time payments, and open banking technology.We process over US$130 billion in payments annually across more than 30 countries, simplifying the collection of both recurring and one-off payments without the hassle, stress, or burdensome fees. Leveraging AI-driven solutions, we enhance payment success rates while minimizing fraud. Our open banking connectivity with over 2,500 banks empowers our customers to make quicker, more informed financial decisions.Headquartered in the UK with offices in London and Leeds, our team also operates in Australia, France, Ireland, Latvia, Portugal, and the United States.At GoCardless, we prioritize supporting you! Our hiring process is designed to be inclusive and accessible. If you require additional support or adjustments, please connect with your Talent Partner — we are here to assist! Remember: while we have certain requirements, we encourage anyone excited about this role to apply!Platform Engineering at GoCardlessThe Platform Engineering team is a diverse, globally distributed group. Currently, we are positioned in London and Riga, with a new hub opening in Lisbon. We collaborate closely with all engineering teams to enable them to build, release, manage, and scale their products effectively.Our focus combines strategic project delivery with operational excellence, with responsibilities including:Project Delivery: Creating new platform components from initial design through to deployment.Operational Support: Maintaining the health and stability of our systems through on-call rotations and effective incident management.Business as Usual (BAU): Ongoing maintenance, improvements, and support for audits and compliance.Our technology stack comprises Golang, Python, Ruby, Terraform, Atlantis, AWS, GCP, Kubernetes, GKE, GitHub, GitHub Actions, ArgoCD, Grafana, Prometheus, and Elastic.