About the job
At Hitachi Vantara, we serve as the trusted foundation for data, empowering global innovators to achieve remarkable outcomes. Our robust, high-performance data infrastructure enables diverse clients—from financial institutions to entertainment venues—to harness the full potential of their data.
Take, for instance, the Las Vegas Sphere; it exemplifies how our solutions help organizations automate processes, optimize workflows, and elevate customer experiences. As we embark on our next growth phase, we seek passionate individuals to join our diverse, global team who are eager to make a significant impact through data.
The Role
As a Senior Site Reliability Engineer & DevOps Engineer, your key responsibilities will include:
- Implementing and maintaining CI/CD pipelines, developing automated workflows for data, model training, and deployment.
- Automating the ML model lifecycle, including deployment, retraining, and updates.
- Managing infrastructure through cloud deployment (AWS) and overseeing data/model versioning and lineage.
- Ensuring system reliability and performance by optimizing model serving and resource utilization.
- Implementing security and governance measures to uphold data integrity and regulatory compliance.
- Collaborating with cross-functional teams to seamlessly integrate models into production environments.
- Embracing Agile practices for enhanced collaboration.
What You Bring
Required Skills:
- Proven experience with CI/CD, automation, and cloud computing (AWS).
- Proficiency in scripting and infrastructure as code.
- Familiarity with machine learning concepts and the ML lifecycle.
- Solid skills in Git, Python/Bash, and Operating Systems.
- Understanding of TCP/IP, DNS, firewalls, and VPNs.
- Hands-on experience with machine learning and deep learning.
- Familiarity with AI/ML libraries and frameworks.
- A collaborative mindset to work effectively with both technical and non-technical teams.
Soft Skills:
- Exceptional communication, problem-solving, and teamwork abilities.

