About the job
Site Reliability Engineer
Overview:
Join Weedmaps as a Site Reliability Engineer and collaborate with diverse teams across application development, infrastructure, and quality assurance to elevate the performance, reliability, and scalability of our web services at Weedmaps.com. As a fully cloud-native organization, we operate all our services within Docker containers on Kubernetes, hosted on AWS. Our culture promotes observability, proactive monitoring, and CI/CD automation, enabling us to release multiple production updates daily.
In this role, you will utilize your engineering expertise to improve system monitoring, streamline CI workflows, and refine our deployment pipelines. You will serve as a knowledge resource for development teams, guiding them in utilizing standardized tools for metrics, logging, and deployment processes. Collaborate closely with both development and infrastructure teams to identify key service metrics that go beyond the basics, working with application teams to develop libraries that facilitate easy instrumentation of their services.
Your Impact:
- Collaborate with stakeholders to establish best practices in monitoring and CI/CD pipelines.
- Troubleshoot issues within our deployment CI pipeline.
- Promote and support a strong DevOps culture within Weedmaps.
- Identify automation opportunities and advocate for codification across all processes.
- Share best practices regarding collaboration, reliability, security, and performance with all partner teams.
- Take responsibility for the configuration and scaling of applications, ensuring adherence to organizational practices.
- Develop and enhance synthetic monitoring workflows.
