Machine Learning Infrastructure Engineer jobs in Sunnyvale – Page 2 | RoboApply Jobs

Machine Learning Infrastructure Engineer jobs in Sunnyvale· Page 2

Results 21–40 of 600 for “Machine Learning Infrastructure Engineer” in Sunnyvale.

600 jobs found

21 - 40 of 600 Jobs
Apply
Meshy logo
Full-time|On-site|Sunnyvale

Join Meshy as an AI Infrastructure EngineerLocated in the heart of Silicon Valley, Meshy is a pioneering force in the realm of 3D generative AI. Our mission is to Unleash 3D Creativity, revolutionizing the content creation process. We empower both professional artists and enthusiastic hobbyists to effortlessly craft extraordinary 3D assets, converting text a…

Feb 11, 2026
Apply
Cerebras Systems logo
Full-time|On-site|Sunnyvale CA or Toronto Canada

Cerebras Systems is at the forefront of AI innovation, having developed the world's largest AI chip, which is 56 times greater in size than conventional GPUs. Our revolutionary wafer-scale architecture delivers the computational power of multiple GPUs on a single chip, simplifying programming to a single device experience. This unique approach enables Cerebras to provide unparalleled training and inference speeds, allowing machine learning professionals to seamlessly operate large-scale ML applications without the complexities of managing numerous GPUs or TPUs.Our clientele includes leading model labs, global corporations, and pioneering AI-native startups. Recently, OpenAI formed a multi-year collaboration with Cerebras to harness 750 megawatts of capacity, revolutionizing critical workloads with ultra-fast inference capabilities.Thanks to our innovative wafer-scale architecture, Cerebras Inference stands as the fastest Generative AI inference solution globally, boasting speeds over ten times faster than traditional GPU-based hyperscale cloud inference services. This significant enhancement in speed transforms user experiences with AI applications, facilitating real-time iterations and augmenting intelligence through additional agentic computation.About The RoleIn the capacity of a Senior Software Engineer within the ML Integration and Quality team, you will be instrumental in integrating and delivering all software and hardware components of the Cerebras AI platform. Your focus will be on software feature integration and quality assurance, including pre-deployment and production validation of Cerebras' training and inference solutions. You will advocate for superior testing practices, effective debugging methodologies, and exemplary cross-team communication to ensure the delivery of world-class products.

Feb 17, 2026
Apply
Wayve Technologies logo
Full-time|On-site|Sunnyvale

Join Wayve Technologies as a Principal Machine Learning Engineer, where you will lead innovative projects that push the boundaries of application software development. You will collaborate with cross-functional teams to design, implement, and optimize machine learning algorithms that enhance our software solutions.

Mar 18, 2026
Apply
Intuitive Surgical, Inc. logo
Full-time|On-site|Sunnyvale

Join our innovative team at Intuitive Surgical, Inc., where we are redefining healthcare through advanced robotics and computer vision. As a Machine Learning Engineer, you will be at the forefront of developing cutting-edge algorithms that enhance robotic performance and improve patient outcomes. This role offers an exciting opportunity to work on transformative technology that makes a real impact in the medical field.

Feb 11, 2026
Apply
ifm-us logo
Full-time|On-site|Sunnyvale, CA

About the Institute of Foundation ModelsWe are a pioneering research laboratory focused on developing, understanding, utilizing, and managing foundation models. Our mission is to propel research, cultivate the next generation of AI innovators, and create transformative impacts within a knowledge-driven economy.Join our dynamic team and seize the opportunity to engage in groundbreaking foundation model training, collaborating with elite researchers, data scientists, and engineers to address the most pressing challenges in AI development. You will contribute to the creation of innovative AI solutions with the potential to revolutionize industries. Your strategic and creative problem-solving abilities will play a crucial role in establishing MBZUAI as a global center for high-performance computing in deep learning, fostering discoveries that will motivate future AI trailblazers.The RoleAs a Machine Learning Engineer at the Institute of Foundation Models, your main duty will be to design and implement cutting-edge machine learning models that tackle real-world issues, pushing the limits of artificial intelligence research. You will work collaboratively with diverse teams to deploy scalable solutions, furthering MBZUAI’s goal of driving significant AI advancements and solidifying the institution’s status as a leader in the international AI research community. Your expertise will be vital in enhancing the performance of large-scale machine learning models and aiding in the development of transformative AI tools that can reshape industries globally.

Mar 17, 2025
Apply
DoorDash, Inc. logo
Full-time|On-site|Sunnyvale, CA

Join our team at DoorDash as a Principal Machine Learning Engineer specializing in Ads and Promotions Delivery. In this pivotal role, you will leverage cutting-edge machine learning techniques to optimize our advertising strategies and enhance customer engagement. Collaborate with cross-functional teams to design and implement scalable machine learning models that drive business growth.

Mar 28, 2026
Apply
42dot logo
Full-time|On-site|Sunnyvale, United States

42dot is seeking a Senior Machine Learning Platform Engineer to support its work in autonomous driving technology. This position is based in Sunnyvale, United States. Role overview This role focuses on developing machine learning platforms that support autonomous vehicle systems. The work involves designing and building scalable infrastructure to handle complex ML workloads, with a strong emphasis on performance and reliability. What you will do Lead the creation and enhancement of machine learning solutions for autonomous driving applications. Design, implement, and maintain ML platforms to ensure they meet high standards for scalability and reliability. Requirements Extensive experience in building and maintaining machine learning platforms. Background in supporting ML solutions for autonomous vehicle technology or similar fields. Strong skills in designing scalable and high-performance systems.

Apr 29, 2026
Apply
Bosch Group logo
Internship|On-site|Sunnyvale

About the Internship Bosch Group is seeking an Automated Driving Machine Learning Intern in Sunnyvale, California. This role offers hands-on experience with real projects in automated driving and machine learning. Interns will apply academic skills to practical challenges in the field.

Apr 15, 2026
Apply
Cerebras Systems logo
Full-time|On-site|Sunnyvale, CA

Cerebras Systems is at the forefront of AI technology, having developed the world's largest AI chip, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture delivers the computational power equivalent to dozens of GPUs on a single chip while maintaining the programming simplicity of a single device. This unique approach enables Cerebras to provide unparalleled training and inference speeds, allowing machine learning practitioners to seamlessly run large-scale ML applications without the complexities of managing numerous GPUs or TPUs. Cerebras proudly serves a diverse clientele, including leading model labs, global enterprises, and pioneering AI-native startups. Notably, OpenAI has recently formed a multi-year partnership with Cerebras to harness 750 megawatts of scale, revolutionizing key workloads with ultra high-speed inference. Our groundbreaking wafer-scale architecture ensures that Cerebras Inference stands as the world's fastest solution for Generative AI inference, achieving speeds over ten times faster than GPU-based hyperscale cloud inference services. This remarkable increase in speed is transforming the user experience of AI applications, enabling real-time iterations and enhancing intelligence through additional agentic computation.About The RoleCerebras is expanding its Machine Learning team to spearhead a new initiative that aligns with our existing teams. We are seeking a Principal Investigator to collaborate with our ML leaders in shaping this new effort while building the team and enhancing our capabilities. This new team will work in concert with our current ML divisions: Field ML, which directly engages with customers, Applied ML, which develops new ML capabilities and applications, and Core ML, which adapts ML algorithms to leverage the unique features of Cerebras hardware. The new team may undertake similar or complementary responsibilities.The new team will focus on areas such as:Post-training and reinforcement learning: Enhancing model deployment quality through advanced training, tuning, and reinforcement learning techniques, concentrating on specific downstream tasks;Dataset curation and optimization: Implementing strategies to gather and select high-quality data, facilitating quicker and higher-quality model training and tuning;LLM Pretraining: Engaging in...

Feb 17, 2026
Apply
Wayve Technologies logo
Full-time|On-site|Sunnyvale

Wayve Technologies is searching for a Tech Lead, Machine Learning Engineer to join the autonomous vehicle product engineering group in Sunnyvale. This leadership role centers on guiding a team that develops sophisticated machine learning models for future mobility technologies. Role overview The Tech Lead will direct the design and deployment of machine learning algorithms tailored for autonomous vehicle systems. The position also includes mentoring junior engineers and collaborating with colleagues across multiple disciplines to ensure projects are delivered on time and meet quality standards. What you will do Lead the creation and implementation of machine learning solutions for autonomous vehicles Support and mentor junior engineers to strengthen their technical skills and project impact Coordinate with cross-functional teams to achieve project goals efficiently and with high quality Requirements Strong background in machine learning and artificial intelligence Proven experience leading or mentoring engineering teams Genuine interest in the intersection of AI and autonomous vehicle technology

Apr 22, 2026
Apply
Cerebras Systems logo
Full-time|On-site|Sunnyvale CA or Toronto Canada

Cerebras Systems revolutionizes the AI landscape with the creation of the world’s largest AI chip, a remarkable 56 times larger than conventional GPUs. Our innovative wafer-scale architecture delivers the computational power of numerous GPUs on a single chip, simplifying programming efforts for users. This unique approach enables Cerebras to achieve unparalleled training and inference speeds, empowering machine learning practitioners to seamlessly execute large-scale ML applications without the complexities of managing hundreds of GPUs or TPUs.Our clientele includes leading model laboratories, global enterprises, and pioneering AI-native startups. Notably, OpenAI recently announced a multi-year partnership with Cerebras to deploy 750 megawatts of scale, significantly enhancing key workloads with ultra-high-speed inference.Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference provides the fastest Generative AI inference solution globally, exceeding the performance of GPU-based hyperscale cloud inference services by over ten times. This significant speed enhancement transforms the user experience of AI applications, facilitating real-time iterations and augmented intelligence through additional agentic computation.About The RoleWe are on the lookout for a highly skilled and experienced AI Infrastructure Operations Engineer to oversee and manage our state-of-the-art machine learning compute clusters. In this role, you will have the unique opportunity to work with the world’s largest computer chip, the Wafer-Scale Engine (WSE), and the systems that leverage its extraordinary power.You will play a pivotal role in ensuring the health, performance, and availability of our infrastructure, maximizing compute capacity, and supporting our expanding AI initiatives. This position requires an in-depth understanding of Linux-based systems, expertise in containerization technologies, and experience in monitoring and troubleshooting complex distributed systems. The ideal candidate is a proactive problem-solver with a strong background in large-scale compute infrastructure who is reliable and committed to customer success.

Feb 17, 2026
Apply
ifm-us logo
Full-time|On-site|Sunnyvale, CA

About the Institute of Foundation ModelsWe are a pioneering research lab focused on the development, understanding, application, and risk management of foundation models. Our mission is to propel research forward, cultivate the next generation of AI innovators, and make significant contributions to a knowledge-driven economy.Join our dynamic team and engage in the heart of innovative foundation model training, collaborating with top-tier researchers, data scientists, and engineers. Tackle groundbreaking challenges in AI development and contribute to transformative AI solutions that have the potential to revolutionize industries. Your strategic and innovative problem-solving skills will be vital in establishing MBZUAI as a global center for high-performance computing in deep learning, enabling impactful discoveries that inspire the future of AI innovation.Role OverviewDevelop and Enhance Distributed Pre-Training Frameworks· Implement DeepSpeed / FSDP / Megatron-LM on multi-node GPU clusters.· Design robust launch scripts, resilient checkpoints, and job monitoring systems (e.g., NCCL/GLOO/GPU).Transform Mathematical Concepts into High-Performance Production Code· Prototype novel optimizers or attention mechanisms using PyTorch/NumPy/JAX or similar frameworks.· Convert prototypes into efficient CUDA/Triton kernels with custom gradients and performance tests.Enhance Training Efficiency and Stability· Lead efforts in mixed-precision training, integrating bf16, fp8, etc., into regular workflows while assessing accuracy versus speed improvements and analyzing numerical stability.· Utilize kernel fusion, communication tuning, and memory optimization to achieve state-of-the-art throughput.Accelerate Research Progress· Develop logging and metrics systems, along with experiment-tracking tools, to facilitate rapid iteration.· Design ablation studies and statistical tests that validate or challenge new concepts.· Guide interns and junior engineers through clear asynchronous design documentation and code reviews.You will collaborate closely with researchers, deliver production code, and shape the landscape of large language models.

Jun 9, 2025
Apply
Cerebras Systems logo
Full-time|On-site|Sunnyvale CA or Toronto Canada

At Cerebras Systems, we are at the forefront of AI technology, developing the world's largest AI chip that is 56 times larger than conventional GPUs. Our innovative wafer-scale architecture enables the computational power of dozens of GPUs on a single chip, simplifying programming to the ease of handling one device. This unique design allows us to achieve unparalleled training and inference speeds, empowering machine learning practitioners to seamlessly deploy large-scale ML applications without the complexity of managing numerous GPUs or TPUs.Our clientele includes leading model labs, global enterprises, and pioneering AI-native startups. Recently, OpenAI announced a multi-year partnership with Cerebras aimed at leveraging 750 megawatts of scale to revolutionize critical workloads through ultra-high-speed inference.Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference delivers the fastest Generative AI inference solution globally, exceeding GPU-based hyperscale cloud inference services by over tenfold. This significant boost in speed is transforming the user experience of AI applications, facilitating real-time iteration and enhancing intelligence through added agentic computation.About The RoleAs an Applied Machine Learning Research Scientist at Cerebras, you will be instrumental in converting modern machine learning methodologies into scalable, high-performance systems. This position focuses on the intersection of modeling and systems, emphasizing the efficient execution of existing algorithms rather than merely publishing new ones. Your efforts will significantly influence the training, optimization, and deployment of large language models (LLMs) on one of the most sophisticated AI platforms in existence.You will collaborate closely with fellow researchers and senior engineers to enhance workflows for LLM pretraining, fine-tuning, and reinforcement learning-based post-training. Your responsibilities will encompass building training pipelines, debugging complex system behaviors, improving model quality, and refining data and evaluation strategies. Your contributions will have a direct and meaningful impact on advancing our capabilities in AI.

Mar 5, 2026
Apply
LinkedIn Corporation logo
Full-time|On-site|Sunnyvale

We are seeking a dynamic and experienced Manager for our AI and Machine Learning team at LinkedIn. In this role, you will lead a talented group of engineers and data scientists dedicated to developing cutting-edge solutions that enhance user experience and drive engagement across the platform. Your leadership will be crucial in shaping the direction of our AI initiatives, ensuring they align with our mission to connect the world's professionals.The ideal candidate will possess a strong background in machine learning algorithms, data analysis, and software development, as well as exceptional communication skills to effectively collaborate with cross-functional teams. If you are passionate about leveraging AI to create impactful solutions, we want to hear from you!

Mar 24, 2026
Apply
Meshy logo
Internship|On-site|Sunnyvale

Join us at Meshy as a Machine Learning Systems Intern, where your passion for AI, graphics, and innovative product development will thrive in a collaborative environment.What We're Looking For:Commit to a full-time internship for a minimum of 12 weeks.Aiming to transition to a full-time role at Meshy post-graduation (ideal candidates graduating between September 2026 and September 2027 are preferred).Open to candidates pursuing undergraduate, master's, or PhD degrees.A solid foundation in technical skills, coupled with a drive for innovation and a willingness to tackle challenges.Your RoleAs a key contributor to our team, you will assist in developing the most extensive end-to-end 3D native machine learning systems. This role encompasses the entire ML framework, from pretraining to fine-tuning and inference. We seek individuals with robust hands-on engineering capabilities, a thirst for knowledge, and the ability to excel in a dynamic, ownership-driven setting.About UsAt Meshy, we envision a world where 3D creation knows no limits. Our mission is to unleash creativity by offering a comprehensive 3D content pipeline, which includes transforming text and images into 3D models, texturing, editing, and animation rigging. We have cultivated a thriving community for creators, providing a platform to share work, draw inspiration, and utilize assets across projects. Recognized as the leader in 3D generative AI (top-ranked in the 2024 A16Z Games survey), our technology is embraced by industry giants like Meta, Square Enix, and Deepmind, impacting sectors like gaming, film, 3D printing, and robotics.Your Next Challenge3D is at the forefront of Generative AI, presenting unique challenges in training and inference. Your journey with Meshy will involve a full stack of AI responsibilities, including debugging and monitoring hardware platforms, creating training frameworks, scaling high-throughput 3D data pipelines, and collaborating on innovative model architectures with our research team.

Jan 11, 2026
Apply
Cylake Inc. logo
Full-time|$150K/yr - $250K/yr|On-site|Sunnyvale

Your ContributionBecome an integral part of a dynamic team dedicated to developing the next generation of cybersecurity solutions from the ground up. Work alongside industry experts with a proven history of innovation as you design, construct, and launch groundbreaking products that will make a significant impact in the field. This role offers you the chance to enhance your career and skills as part of a world-class organization from the very outset.Job ResponsibilitiesYou will play a pivotal role in architecting and implementing the platform layer, from the Bootloader to system software, for a large-scale embedded system. This encompasses image and software lifecycle management, including packaging, upgrades, high availability, and telemetry/debug infrastructure. You will have the chance to design and implement this system from the ground up.

Mar 5, 2026
Apply
Applied Intuition, Inc. logo
Full-time|$126K/yr - $423K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI technologies. Established in 2017 and currently valued at $15 billion, this Silicon Valley powerhouse is dedicated to developing the essential digital infrastructure that will empower intelligent operations in every vehicle and machine worldwide. Our innovative solutions cater to the automotive, defense, trucking, construction, mining, and agriculture sectors, focusing on three pivotal areas: tools and infrastructure, operating systems, and autonomous capabilities. Our reputation is underscored by the trust placed in us by 18 of the top 20 global automakers and the United States military, among others. Our headquarters is in Sunnyvale, California, with additional offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.We promote a collaborative in-office culture and expect our team members to primarily work from their Applied Intuition office five days a week. However, we also value flexibility, allowing our employees to manage their schedules responsibly, which may include occasional remote work, starting the day with morning meetings from home, or leaving early to meet family obligations.About the Role and TeamWe are seeking enthusiastic Research Scientists to join our dynamic Research Group at Applied Intuition. Our mission is to develop pioneering technologies that drive the evolution of physical AI, particularly in two transformative applications: end-to-end autonomous driving and general-purpose robotics. Our team comprises distinguished experts from leading institutions and companies, celebrated for their remarkable contributions to both academia and industry, including eight Best Paper awards at prestigious conferences such as CVPR and ICRA. Learn more about our research initiatives at appliedintuition.com/research.With access to industry-leading tools and infrastructure, our researchers can leverage millions of miles of data from extensive fleets and implement their innovative methods across diverse autonomous and robotic systems, including self-driving vehicles and autonomous machinery.

Feb 13, 2026
Apply
Coram AI logo
Full-time|On-site|Sunnyvale

Join Coram AI, where we are redefining video security for a modern landscape. Our innovative, cloud-native platform harnesses computer vision and artificial intelligence to empower businesses with enhanced safety, informed decision-making, and rapid operational responses, ranging from real-time alerts to effortless clip sharing and comprehensive visibility across multiple sites.As a member of our dynamic and agile team, you will embrace clarity, craftsmanship, and impactful contributions. Every team member's voice matters, they deliver significant results, and collectively shape the future of AI in making the world safer and more interconnected.About the Role:At Coram AI, our infrastructure transcends the conventional cloud-based stack. Alongside our AWS and Kubernetes framework, we manage an extensive array of IoT devices remotely. We are seeking a skilled engineer to take charge of a substantial segment of our edge and cloud architecture that supports our IoT product line—responsible not only for infrastructure but also for developing and maintaining our proprietary in-house software.Joining our team means tackling intriguing challenges at the crossroads of user experience, machine learning, and infrastructure. It embodies a commitment to excellence, continuous learning, and delivering exceptional products to our clients in a high-energy startup environment.Key Responsibilities:Develop and maintain production-grade software for our custom edge infrastructure stack.Provision and manage resources within AWS.Oversee provisioning and management for hundreds of thousands of deployed connected IoT devices.Create CI/CD and automation pipelines for various components of the stack.Implement observability and telemetry across our cloud applications and edge devices.Assist in maintaining compliance with various security standards (e.g., SOC2, HIPAA).Enhance developer productivity by optimizing development workflows.This is an onsite role located in Sunnyvale.Qualifications:Minimum of 3 years of experience in developing production infrastructure on AWS using infrastructure as code tools like Pulumi or Terraform.Proficient in Docker and Kubernetes, especially EKS.At least 3 years of experience with programming languages such as Python, Go, or similar.

Feb 18, 2026
Apply
Applied Intuition, Inc. logo
Full-time|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI. Established in 2017 and currently valued at $15 billion, our Silicon Valley-based company is dedicated to building the digital infrastructure necessary to infuse intelligence into every mobile machine globally. We cater to various sectors, including automotive, defense, trucking, construction, mining, and agriculture, focusing on three primary domains: tools and infrastructure, operating systems, and autonomy. Trusted by 18 of the world's top 20 automakers, along with the U.S. military and its allies, Applied Intuition is committed to delivering cutting-edge solutions that empower physical intelligence. Our headquarters is located in Sunnyvale, California, with additional offices across the globe, including Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.We uphold a strong in-office culture, expecting our team members to work primarily from their Applied Intuition office five days a week. Nonetheless, we value flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, morning meetings from home, or early departures for family commitments.About the RoleJoin us in designing the exceptional infrastructure that powers every intelligent machine. In this role, you will be responsible for implementing and enhancing core application libraries and frameworks utilized by engineers across our organization, as well as scaling our developer ecosystem throughout the build and CI infrastructure of our monorepo. If you thrive on tasks that enhance your peers' productivity and can collaborate effectively across functions to meet the demands of a rapidly evolving business, you will be an ideal fit. At Applied Intuition, we empower engineers to take ownership of both technical and product decisions, engage closely with users to gather feedback, and contribute to a dynamic, thoughtful team culture.The Developer Frameworks team is pivotal in ensuring our engineering talent can operate swiftly and confidently. As the company scales, developer velocity becomes increasingly critical, and you will have the chance to make a significant impact on the speed and success of our projects.

Jan 14, 2026
Apply
Wayve logo
Full-time|On-site|Sunnyvale

Join Wayve as a Cloud Infrastructure Engineer and play a pivotal role in shaping the future of autonomous driving technology. In this dynamic position, you will design, implement, and maintain scalable cloud infrastructure solutions that support our innovative projects. You will collaborate with cross-functional teams to ensure high availability and performance of our cloud services.

Mar 30, 2026

Sign in to browse more jobs

Create account — see all 600 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.