About the job
About Traversal
Traversal stands at the forefront of AI Site Reliability Engineering (SRE) for enterprises, trusted by major corporations globally to effectively troubleshoot, remediate, and preemptively address complex production incidents. Our mission is to liberate engineers from constant firefighting, allowing them to concentrate on innovative, impactful work.
With deep roots in AI research, we are channeling scientific rigor and creativity into establishing a leading AI agent lab for enterprises. We take pride in assembling a talented and collegial team, including researchers from prestigious institutions such as MIT, Harvard, and Berkeley, along with world-class engineers from industry leaders like Citadel Securities, Cockroach Labs, and Datadog. Together, we tackle some of the toughest challenges in AI.
The Role
As a Full-Stack AI Engineer at Traversal, you will be instrumental in designing and building the essential product interfaces fueled by our AI Site Reliability Engineer. This position is perfect for engineers who relish the challenge of owning intricate systems from start to finish, encompassing everything from user interfaces to backend services and infrastructure. You’ll transform advanced AI capabilities into reliable, production-grade product experiences utilized by engineers managing large-scale systems. The ideal candidate will possess a blend of strong product intuition and profound engineering insight, and will be adept at navigating the entire tech stack—from distributed systems and data pipelines to dynamic front-end applications. Collaboration will be key as you work closely with product teams, AI researchers, and infrastructure engineers to define the platform architecture and develop features that facilitate real-time incident detection, root cause analysis, and automated remediation in complex production settings.
Responsibilities
Product Ownership: Lead the end-to-end development of core product functionalities, converting AI-driven insights into user-friendly workflows that empower engineers and minimize cognitive load.
Technical Architecture: Design scalable system architectures that encompass frontend, backend services, and data infrastructure to support real-time observability and automated operations.
API & Platform Development: Create and implement high-performance APIs and service layers that drive the product and ensure seamless integration between AI systems, backend services, and user interfaces.

