About the job
At Groupon, we are a dynamic marketplace that empowers customers to discover exciting experiences and services daily while supporting local businesses to flourish. With over a million merchant partners globally, we connect more than 16 million customers with incredible deals across diverse categories. Amidst the dominance of e-commerce giants, we pride ourselves on being a unique platform dedicated to enhancing the performance of local businesses.
As we embark on an ambitious journey to transform our operations, we are relentless in our pursuit of impactful results. Even with thousands of employees spreading across various continents, we maintain a culture that fosters innovation, embraces risk-taking, and celebrates achievements. Here, your impact can be immediate due to our scale and swift transformation. We embody the "best of both worlds" ethos: big enough to provide resources and scale, yet small enough that individual contributions can lead to significant changes.
Location: Hybrid office model across Prague, Warsaw, Valencia, and Madrid
The Mission: AI-First Transformation
Why This Role?
We are advancing from merely experimenting with AI to deploying it at a massive scale. As we transition to an AI-First organization, we are establishing a centralized AIOps team to tackle a crucial challenge: transforming AI features from disparate prototypes into high-performing, cost-efficient production realities.
As a Senior AIOps Engineer, you will not just manage servers; you will architect the "Golden Paths"—the automated, reusable infrastructure that empowers our product teams to deliver LLMs, Vector Search, and AI Agents with unprecedented speed.
The Impact You’ll Make
- Architect the AI Stack: Design and maintain core infrastructure on Kubernetes, encompassing Vector Databases, LLM Gateways (LiteLLM), and workflow automation tools (n8n).
- Enable at Scale: Facilitate AI adoption by developing self-service "Golden Paths" utilizing Terraform and Helm, enabling engineering teams to deploy RAG pipelines with a single click.
- Operational Excellence: Implement centralized observability, tracing (Langfuse), and governance to ensure our AI systems are reliable, auditable, and secure.

