About the job
At Replit, we're revolutionizing software development by empowering individuals and businesses to create applications effortlessly through natural language. With a global user base of millions and over 500,000 business users, we are committed to breaking down traditional barriers in software creation.
About the Role:
Join our dynamic Infrastructure Engineering team, where you will play a pivotal role in ensuring the reliability, scalability, and performance of Replit's infrastructure that supports developers worldwide. As a Senior Infrastructure Engineer, you will be at the intersection of development and operations, focusing on automation and best practices that facilitate efficient scaling while ensuring high availability.
We are on the lookout for passionate Senior Infrastructure Engineers who excel in building and maintaining resilient systems at scale. Your objective will be to proactively identify and analyze reliability issues within our infrastructure, designing and implementing innovative software and systems to achieve significant improvements. You will create robust monitoring solutions, automate operational processes, and enhance infrastructure reliability, all while mentoring and fostering a culture of reliability within the broader engineering team at Replit.
Your Responsibilities:
Automate and Innovate: Architect and enhance automation to minimize manual effort and operational burdens. Design and maintain CI/CD pipelines and infrastructure automation using tools like Terraform or Pulumi. Develop self-healing systems capable of automatically addressing common failure scenarios.
Optimize Infrastructure Performance: Collaborate with core infrastructure and product teams to fine-tune and optimize our cloud deployments (Kubernetes, Docker, GCP). Identify and eliminate performance bottlenecks, implement capacity planning, and reduce latency across global regions.
Enhance Developer Experience: Design and execute improvements to our build, test, and deployment systems to accelerate, secure, and enhance software delivery for all engineers.
Foster Cross-Department Improvements: Work closely with service owners throughout Replit to identify challenges and collaborate on implementing enhancements in their specific services.
Develop Shared Tools: Create and manage centralized tools and automation that streamline the engineering lifecycle, from local development to production monitoring.

