About the job
Latitude AI (lat.ai) is at the forefront of developing cutting-edge automated driving technologies, including Level 3 (L3) solutions, specifically designed for Ford vehicles on a large scale. We are motivated by the mission to transform the driving experience, making it safer, less stressful, and more enjoyable for everyone.
Joining the Latitude team means collaborating with top experts in machine learning, robotics, cloud platforms, mapping, sensor technology, compute systems, test operations, and systems and safety engineering, all working towards a common goal of positively impacting the driving experience for millions.
As a subsidiary of Ford Motor Company, we maintain operational independence to innovate and develop automated driving technology with the agility of a startup. Our headquarters is located in Pittsburgh, with additional engineering centers in Dearborn, Michigan, and Palo Alto, California.
Meet the Team:
As a Senior Site Reliability Engineer, you will play a vital role in building and maintaining our mission-critical systems. Your expertise in monitoring and automation will be essential to ensuring the health, reliability, scalability, and performance of our platforms.
The Site Reliability team collaborates closely with various engineering teams, including data ingestion and processing, mapping, labeling, triage, machine learning (detection, prediction, tracking), motion planning/control, offline simulation, and deployment, to establish uniform service observability and incident response.
Your Responsibilities:
- Develop monitoring solutions to ensure platform health and measurable reliability.
- Create alerting systems and runbooks for swift detection and remediation of platform issues.
- Troubleshoot complex issues across multiple components of the stack, implementing effective fixes to prevent recurrence.
- Participate in on-call rotations and foster a culture of continuous improvement through blameless postmortems.
