Senior AI Engineer - Machine Learning at Gauss Labs | Palo Alto
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Senior
Similar jobs
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.
Gauss Labs is seeking a dynamic and skilled Senior AI Engineer to pioneer transformative Industrial AI solutions, setting new standards for artificial intelligence in the manufacturing sector. Our collaborations with leading manufacturing clients provide unparalleled access to extensive real-time data derived from their operations. Leveraging advanced AI tec…
About UsHippocratic AI stands at the forefront of generative AI in the healthcare sector. Our innovative platform is the only one capable of engaging in safe, autonomous clinical conversations with patients, supported by our proprietary LLMs in the Polaris constellation, boasting an impressive accuracy rate of over 99.9%.Why Join Our TeamRevolutionize healthcare with safety-centric AI. We are pioneering the world's first healthcare-specific, safety-oriented LLM—a groundbreaking platform focused on enhancing patient outcomes on a global scale. This is a unique opportunity to contribute to category creation.Collaborate with visionaries. Co-founded by CEO Munjal Shah alongside a distinguished team of physicians, hospital executives, AI innovators, and researchers from esteemed institutions such as El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA.Supported by top-tier investors. We recently secured a $126M Series C funding round at a valuation of $3.5B, led by Avenir Growth, bringing our total funding to $404M with contributions from notable investors like CapitalG, General Catalyst, a16z, Kleiner Perkins, and others.Build alongside experts in healthcare and AI. Join a team of professionals dedicated to enhancing care, advancing science, and creating transformative technologies that ensure our platform is robust, reliable, and revolutionary.Location RequirementWe believe collaboration sparks the best ideas. To foster rapid teamwork and a vibrant company culture, this position requires daily presence in our Palo Alto office, five days a week, unless stated otherwise.About the RoleIn healthcare AI, evaluation is crucial—if it can't be measured, it can't be deployed. You will develop systems that assess the safety, accuracy, and readiness of our models for real-world patient interactions: evaluation frameworks, synthetic data pipelines, automated benchmarks, and LLM-as-judge systems. This role presents a high-impact engineering opportunity where your contributions directly influence what is launched into production.What You’ll DoCreate and implement evaluation frameworks focused on LLM safety, clinical accuracy, and conversational quality.Build synthetic data generation pipelines to rigorously test models across varied clinical scenarios.Develop scalable automated and human-in-the-loop evaluation pipelines.
Join Our Innovative Team Nubank is a leading digital financial platform, serving over 122 million customers across Brazil, Mexico, and Colombia. Our mission is to simplify financial services and empower individuals, marking the start of a vibrant future in Latin America. As a publicly listed company on the New York Stock Exchange (NYSE: NU), we leverage cutting-edge technology and data intelligence to create financial products that are not only accessible but also user-friendly. Our achievements have earned us recognition from prestigious rankings, such as Time 100 Companies, Fast Company’s Most Innovative Companies, and Forbes World’s Best Bank. Explore more about us on our institutional page here. About the Role At AI Core, we are expanding our AI initiatives to become the backbone of Nubank's key decision-making systems. We are in search of talented Machine Learning Engineers to spearhead impactful research projects that connect advanced AI technologies with real-world financial systems. Your role will involve tackling intricate challenges using Deep Learning and Foundation Models, ensuring our solutions are scalable, efficient, and yield tangible business outcomes. As a Machine Learning Engineer (MLE), your responsibilities will include: Leading and executing complex applied research initiatives independently, focusing on building and optimizing architectures (e.g., Transformers, GNNs) for critical applications such as Credit, Recommendation Systems, Generative AI, and real-time inference. Resolving challenging and ambiguous modeling problems that necessitate collaboration across various teams (Data, Infrastructure, Product), delivering innovative solutions with a clear emphasis on medium-term impact. Connecting the research and production worlds by designing architectures that comply with MLOps constraints, ensuring models are optimized for latency, interpretability, and cost-effectiveness. We invite you to be part of our journey to revolutionize the financial landscape.
Role Overview:Join Nace.AI as a Machine Learning Engineer, where you will be instrumental in transforming advanced machine learning research into scalable, production-ready applications. Collaborating with interdisciplinary teams, you will pinpoint areas where machine learning can enhance product offerings, design robust model-centric architectures, and guarantee their smooth integration into practical applications. This role demands a harmonious blend of theoretical insight and hands-on engineering, focusing on creating dependable, maintainable, and impactful AI-driven features that align with Nace.AI's strategic goals.Key Responsibilities:Develop and sustain complete ML systems, including synthetic data pipelines, model training, debugging, and performance assessment.Enhance large language models (LLMs) and utilize meta-learning strategies to boost model generalization and efficiency.Refine existing Nace.AI models by integrating breakthroughs from the latest ML research.
Join our dynamic AI R&D team as an AI Scientist focused on Machine Learning. In this pivotal role, you will lead the development and implementation of advanced deep learning models to address real-world temporal modeling challenges in the manufacturing sector. We are in search of a candidate with extensive practical R&D experience, firmly rooted in robust theoretical principles and possessing deep expertise across various AI disciplines. The ideal candidate will exhibit a profound understanding of cutting-edge machine learning algorithms and techniques, alongside a proven record of contributions to top-tier conferences such as NeurIPS, ICML, ICLR, KDD, CVPR, or ICCV. A solid foundation in computer science and engineering is essential. Familiarity with collaborating alongside software engineering teams to scale and commercialize ML solutions will be highly regarded. This high-impact role merges foundational research, system-level design, and hands-on implementation, allowing you to work closely with cross-functional teams to create innovative solutions that drive strategic decisions and deliver significant business value.
Gauss Labs Talent PoolThank you for expressing your interest in joining Gauss Labs! By submitting your application to our Talent Pool, you are taking the first step towards potential career opportunities in the future. We value your qualifications and will reach out to you if a position that matches your skills becomes available. If you have any questions, please do not hesitate to contact our Talent Acquisition Team at recruiting@gausslabs.ai.
About VoltaiAt Voltai, we are pioneering the future of artificial intelligence by developing world models and agents capable of learning, evaluating, planning, experimenting, and interacting with the physical world. Our initial focus is on understanding and creating advanced hardware, electronic systems, and semiconductors, utilizing AI to design and innovate beyond human cognitive boundaries.About Our TeamOur remarkable team is backed by esteemed Silicon Valley investors, Stanford University, and industry leaders including CEOs and Presidents of Google, AMD, Broadcom, and Marvell. We boast a diverse group of former Stanford professors, SAIL researchers, Olympiad medalists, CTOs of prominent tech firms, and high-ranking officials with experience in national security and foreign policy.What We Are Looking ForExceptional AI/ML engineering skills, ideally from top-tier programs in Computer Science, Electrical Engineering, Mathematics, or Physics.Demonstrated success in delivering AI/ML projects from initial concept through to production deployment.Hands-on experience in fine-tuning and deploying large language models (LLMs) within production environments.Experience working with multi-modal models that integrate text, image, or audio inputs.Bonus PointsExperience in competitive programming.Contributions to open-source projects.Recognition through awards or publications in leading journals and conferences.Ability to thrive in a dynamic, fast-paced startup environment.
About Nightfall:Nightfall is an innovative, AI-driven platform specializing in unified data loss prevention and insider risk management. We secure sensitive data across various environments, including SaaS applications, Generative AI tools, email, and endpoint devices. Trusted by numerous clients, from AI pioneers to Fortune 10 banks, Nightfall empowers organizations to innovate safely, mitigating the risks associated with data loss and intellectual property exposure. Our intelligent platform automates data loss prevention, allowing security teams to focus on strategic initiatives by resolving security violations proactively and providing real-time training to users.Our endeavors are supported by top-tier venture capital firms such as Bain Capital Ventures, Venrock, WestBridge Capital, and Pear VC, alongside cybersecurity leaders including Frederic Kerrest, Maynard Webb, Ryan Carlson, and Kevin Mandia.About the Role:We seek a highly skilled technical leader to join our expanding team at Nightfall. As the Lead AI/ML Data Scientist within the AI Engineering organization, you will be pivotal in developing ML/NLP models and Generative AI solutions that enhance our Data Loss Prevention (DLP) and security products. This role involves spearheading research and applying advanced machine learning techniques to address security challenges, while guiding ML and backend engineers in deploying systems into production. You will also be instrumental in shaping the future architecture of our AI platform.This position is hybrid, requiring three days in the office at our Palo Alto, California location, and represents a fantastic opportunity for those passionate about data science and machine learning engineering.
At Inflection AI, we are dedicated to leveraging the transformative capabilities of artificial intelligence to enhance human well-being and productivity.The future of AI will be characterized by agents we can trust to act on our behalf.We are at the forefront of this evolution with our human-centric AI models that integrate emotional intelligence (EQ) with cognitive intelligence (IQ), shifting interactions from mere transactions to meaningful relationships, thereby generating lasting value for individuals and organizations alike.Our initiatives manifest in two primary forms:Pi, your personal AI, designed to be a compassionate companion that enriches everyday life through practical support and insights.Platform — large language models (LLMs) and APIs that empower developers, agents, and enterprises to infuse Pi-level emotional intelligence into experiences where empathy and understanding are crucial.We are building towards a future of AI agents that foster trust, enhance understanding, and create aligned, long-term value for everyone.About the RoleAs a Model Training Engineer, you will be responsible for designing, building, and scaling post-training pipelines that transform general LLMs into brand-fluent, production-ready assistants. Your innovations in fine-tuning and preference optimization techniques (RLHF, DPO, GRPO, RLAIF) will significantly enhance reliability, alignment, and cost-effectiveness.
Grindr LLC
Join us at Grindr in a hybrid position based in our Palo Alto or San Francisco offices, with in-office attendance required on Tuesdays and Thursdays.Why This Role is Exciting:As a pivotal figure at Grindr, you will lead our transformative AI journey. This is your opportunity to leverage state-of-the-art machine learning techniques to revolutionize the way millions within the LGBTQ+ community connect, whether through engaging conversations, casual meetups, or meaningful relationships. Our commitment to machine learning is strong, and you will play an essential role in shaping our strategy and execution on this unique global platform.Impact from Day One: You will be instrumental in establishing foundational systems in an early-stage ML environment, charting the roadmap for our long-term strategy.Innovative Recommendations: Design and scale recommendation platforms that connect millions to their next significant experience, tailored to diverse user intents.Conversational Insights: Employ large language models (LLMs) to extract insights and establish best practices for conversational AI, enhancing user engagement with precision.Key Responsibilities:Develop and manage large-scale recommendation systems to serve millions of users while balancing performance and innovation.Utilize advanced LLMs to analyze extensive conversation data, enhancing connections among users.Prototype, iterate, and deploy production-ready ML solutions addressing real user challenges.Provide technical guidance across teams, collaborating with engineering, data science, and product teams to turn innovative ideas into reality.Assess and incorporate emerging AI tools and techniques organization-wide to maintain a leading-edge technology stack.Qualifications We Seek:Over 10 years of experience in building ML systems, particularly in developing 0-to-1 systems, platform architecture, and pioneering new capabilities. Familiarity with recommendation systems is advantageous.Proven track record of delivering scalable solutions, with proficiency in Python and popular ML frameworks.A proactive mindset and the ability to work in a fast-paced, dynamic environment.
About VoltaiVoltai is at the forefront of developing sophisticated world models and intelligent agents capable of learning, evaluating, planning, experimenting, and interacting with the physical environment. Our initial focus is on the realms of hardware development, electronics systems, and semiconductors where artificial intelligence can surpass human cognitive capabilities in design and creation.About the TeamSupported by leading investors from Silicon Valley, as well as Stanford University, our team includes distinguished individuals such as former Stanford professors, SAIL researchers, Olympiad medalists, and executives from industry giants like Google, AMD, Broadcom, and Cadence. Our diverse expertise encompasses technology, defense, and policy, with a mission to innovate and lead in the field of AI and hardware.About this RoleAs a Lab Automation Engineer, you will be responsible for designing, implementing, and managing the automation infrastructure that supports Voltai’s hardware validation laboratories. This role involves building systems that facilitate automated testing, characterization, and qualification of silicon and board-level designs by seamlessly integrating software, robotics, and data pipelines for ongoing validation and enhancement.
At Rhoda AI, we are pioneering the development of a comprehensive full-stack platform for the next generation of humanoid robots. Our innovative approach encompasses high-performance, software-defined hardware along with foundational and video world models that empower our robotic systems. Our robots are engineered as versatile generalists, adept at navigating intricate, real-world scenarios, including those not encountered during training. Collaborating with a distinguished research team from Stanford, Berkeley, Harvard, and other leading institutions, we operate at the forefront of large-scale learning, robotics, and systems engineering. With over $400M in funding, we are aggressively investing in research and development, hardware innovation, and scaling up manufacturing to bring our vision to life.We are on the lookout for a Staff / Principal Machine Learning Engineer to take charge of our training platform. This pivotal system is essential for ensuring that large-scale training is reliable, reproducible, and straightforward to execute. You will play a crucial role in defining the lifecycle of training jobs, including their launch, tracking, recovery, and debugging across our clusters. Your contributions will enable researchers to innovate rapidly without infrastructure hindrances.In this role, you will be at the heart of enhancing research efficiency: when a training job fails, your system will allow for automatic recovery; when experiments become challenging to reproduce, you will implement effective solutions; and when GPU hours are squandered, you will ensure visibility and preventative measures are in place.
AI ResidencyLocation: Palo Alto, CA (on-site)About 1XAt 1X, we are pioneering the development of humanoid robots designed to collaborate with humans, addressing labor shortages and enhancing productivity.About the RoleThe AI Residency offers a unique fixed-term opportunity (3–6 months) to engage in transformative AI and robotics initiatives alongside our dedicated team. As a resident, you will contribute to building critical infrastructure for simulation, data management, and machine learning, directly translating research concepts into practical applications. This is your chance to play a vital role in advancing deployed robotic systems while gaining invaluable hands-on experience at the intersection of AI and robotics.
At Rhoda AI, we are pioneering a comprehensive platform for the next generation of humanoid robots. Our cutting-edge technology encompasses high-performance, software-defined hardware alongside sophisticated foundational models and video world models. Our robots are crafted to be versatile and capable of navigating complex, real-world scenarios that may not be covered in traditional training. We are at the forefront of large-scale learning, robotics, and systems integration, boasting a research team that includes esteemed professionals from Stanford, Berkeley, Harvard, and other prestigious institutions. With over $400 million raised, we are committed to significantly investing in research and development, hardware innovation, and scaling up our manufacturing processes to realize our vision.We are seeking a talented mid-to-senior Application Engineer who will be instrumental in translating real-world customer use cases into effective intelligent robot deployments. You will engage in the design, prototyping, and enhancement of grippers, end-of-arm tooling, workstations, and integration hardware. Additionally, you will develop efficient strategies for robot learning and data collection, operating at the confluence of robotics, artificial intelligence, hardware, and deployment operations.
About PathwayPathway is revolutionizing artificial intelligence with the introduction of the world’s first post-transformer model that mimics human thought processes. Our innovative architecture surpasses traditional Transformer models, providing enterprises with unparalleled transparency into model operations. By integrating this foundational model with the fastest data processing engine available, Pathway empowers organizations to transcend mere incremental optimization and achieve genuinely contextualized, experience-driven intelligence. Trusted by prestigious clients including NATO, La Poste, and Formula 1 racing teams, we are at the forefront of AI advancements.Led by visionary CEO Zuzanna Stamirowska, a complexity scientist, our team includes AI trailblazers such as CTO Jan Chorowski, who pioneered the application of Attention in speech and collaborated with Nobel laureate Geoff Hinton at Google Brain, and CSO Adrian Kosowski, a distinguished computer scientist and quantum physicist who earned his PhD at just 20 years old.Supported by prominent investors and advisors like Lukasz Kaiser, co-author of the Transformer architecture (the “T” in ChatGPT) and a key researcher in OpenAI's reasoning models, Pathway is headquartered in Palo Alto, California.The OpportunityWe are on the lookout for passionate Machine Learning/AI Software Engineering interns with a solid foundation in machine learning model research.Your ResponsibilitiesAssist in training Large Language Models (LLMs)Conduct benchmarking of LLMsPrepare and evaluate training datasetsCollaborate with the core Pathway Research TeamYour contributions will significantly impact the advancement of the AI landscape.
Hippocratic AI
Hippocratic AI builds generative AI technology for healthcare, focusing on safe, autonomous clinical conversations with patients. The company’s proprietary large language models, known as the Polaris constellation, drive this platform and have achieved an accuracy rate above 99.9%. As a Senior Staff AI Engineer based in Palo Alto, this role centers on advancing healthcare through a safety-first approach. The team is working to launch the first healthcare-specific, safety-centric large language model, with the goal of improving patient outcomes worldwide. Collaboration and Team The team includes experienced physicians, hospital executives, AI researchers, and innovators from leading institutions such as El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA. Co-founder and CEO Munjal Shah leads a group committed to scientific progress and reliable technology in healthcare AI. Funding and Support Hippocratic AI is backed by a strong group of healthcare and AI investors. The company recently closed a $126M Series C funding round at a $3.5B valuation, bringing total funding to $404M. Investors include Avenir Growth, CapitalG, General Catalyst, a16z, Kleiner Perkins, Premji Invest, UHS, Cincinnati Children’s, WellSpan Health, John Doerr, Rick Klausner, and others. Location Requirement This position requires working onsite at the Palo Alto office five days a week. The company values in-person collaboration to strengthen team culture and accelerate innovation.
Mistral AI
About Mistral AIAt Mistral AI, we harness the transformative power of artificial intelligence to streamline tasks, save valuable time, and foster enhanced creativity and learning. Our innovative technology is crafted to effortlessly integrate into everyday work environments.We are committed to democratizing AI by offering high-performance, optimized, open-source models, products, and solutions. Our extensive AI platform caters to both enterprise and individual needs, featuring products like Le Chat, La Plateforme, Mistral Code, and Mistral Compute—creating cutting-edge intelligence accessible to all users.As a vibrant and collaborative team, we are driven by our passion for AI and its potential to revolutionize society. Our diverse workforce excels in competitive settings and is dedicated to fostering innovation. With teams distributed across France, the USA, the UK, Germany, and Singapore, we pride ourselves on our creativity, humility, and team spirit.Join us in shaping the future of AI at a pioneering company. Together, we can create a lasting impact. Discover more about our culture at https://mistral.ai/careers.Role OverviewAbout the Research Engineering TeamThe Research Engineering team operates across Platform (shared infrastructure & clean coding practices) and Embedded (integrated within research squads). Our engineers have the flexibility to navigate the research↔production spectrum as their interests and needs evolve.As a Machine Learning Research Engineer, you will be responsible for building and optimizing large-scale learning systems that underpin our open-weight models. Collaborating closely with Research Scientists, you may join either:- Platform RE Team: Focus on enhancing our shared training frameworks, data pipelines, and tools utilized across all teams; or- Embedded RE Team: Become part of a research squad (Alignment, Pre-training, Multimodal, etc.) to turn innovative ideas into scalable, repeatable code.Key Responsibilities• Support researchers by managing the complex aspects of large-scale ML pipelines and developing robust tools.• Bridge cutting-edge research with production: integrate checkpoints, optimize evaluations, and create accessible APIs.• Conduct experiments utilizing the latest deep-learning techniques (sparsification on 70B+ models, distributed training across thousands of GPUs).• Design, implement, and benchmark ML algorithms; produce clear and efficient code in Python.• Deliver prototypes that evolve into production-grade components for Le Chat and our enterprise API.
Hippocratic AI
About UsAt Hippocratic AI, we are pioneers in the realm of generative AI for healthcare. Our innovative system enables safe and autonomous clinical dialogues with patients, achieving over 99.9% accuracy through our proprietary large language models (LLMs) within the Polaris constellation.Why Join Our TeamTransform healthcare with safety in mind. We are developing the world’s first healthcare-specific, safety-centric LLM, a revolutionary platform aimed at enhancing patient outcomes globally. This represents a new category in healthcare technology.Collaborate with industry leaders. Co-founded by CEO Munjal Shah alongside a team of esteemed physicians, hospital executives, AI innovators, and researchers from top institutions such as El Camino Health, Johns Hopkins, Stanford, Google, and NVIDIA.Supported by premier investors. Recently, we secured $126M in Series C funding at a $3.5B valuation, led by Avenir Growth, contributing to a total funding of $404M from notable investors including CapitalG, General Catalyst, and a16z.Work with the best minds in healthcare and AI. Join a team dedicated to improving healthcare, advancing science, and creating transformative technologies that ensure our platform is powerful and trusted.About the RoleAs an AI Engineer at Hippocratic AI, you will be instrumental in advancing voice-based generative AI in healthcare. Your responsibilities will include designing and developing intelligent systems that power our clinically safe healthcare agents, integrating large language models with real-time voice technology and human-centered design.This is a hands-on, cross-functional position, requiring close collaboration with AI researchers, product managers, and clinical experts to deploy advanced language and speech models. Your contributions will significantly shape how patients and providers interact safely with generative AI.
At Rhoda AI, we are pioneering the development of a comprehensive foundation for the next generation of humanoid robots. Our focus spans high-performance, software-defined hardware to advanced foundational models and video world models that govern robot functionality. Our robots are engineered to be versatile, capable of navigating intricate, real-world environments and tackling scenarios not previously encountered in training. We stand at the crossroads of large-scale learning, robotics, and systems, bolstered by a research team comprising experts from prestigious institutions such as Stanford, Berkeley, and Harvard. Our ambition is not merely to add features; we are crafting a revolutionary computing platform for physical tasks, underpinned by over $400 million in funding, driving aggressive investments in research & development, hardware innovation, and scaling up manufacturing to bring our vision to fruition.Role OverviewWe are in search of a Principal Machine Learning Systems Engineer to take charge of our training systems' performance from start to finish. You will be instrumental in defining the scaling of our model training, enhancing efficiency, scalability, and accuracy across extensive multimodal training environments. This is a pivotal systems role, not merely focused on infrastructure support. Your contributions will significantly influence our compute utilization efficiency, scalability of models across thousands of GPUs, and the speed of research iterations.Your ResponsibilitiesOversee training performance from start to finishAnalyze and enhance the performance of large-scale multimodal training encompassing vision, video, proprioception, actions, and language.Create systematic performance attributions by breaking down step-time into compute, communication, and input pipeline, along with scaling curves for various cluster sizes and identifying key bottlenecks.Drive quantifiable improvements across:Distributed efficiency (e.g., communication and compute overlap, bucketization, topology-aware mapping, and parallelism strategies).Compute efficiency (e.g., identifying kernel hotspots, operator fusion, attention optimization, and minimizing framework/runtime overhead).Memory efficiency (e.g., activation checkpointing, sequence packing, and reducing fragmentation).Design training systems rather than just tuning themDefine and refine parallelism strategies including data, tensor, pipeline, sharding, and hybrid approaches.Enhance execution efficiency through communication scheduling, graph capture, execution optimization, and runtime enhancements.Contribute to the overall system architecture with innovative solutions.
Join Array Labs, a pioneering company dedicated to constructing cutting-edge radar systems that empower humanity to interpret and respond effectively to changes in our physical environment.We are embarking on an ambitious project to deploy a synchronized fleet of radar satellites, aimed at generating a highly detailed 3D representation of the Earth that is continuously updated. This initiative will facilitate quicker, more informed decision-making for both governmental and commercial entities involved in disaster management, infrastructure robustness, and critical geopolitical intelligence.Our team designs and constructs our satellites with a comprehensive end-to-end approach, resulting in the creation of the world's most sophisticated Earth observation satellites. Through our fleet, we aim to deliver unparalleled accuracy, extensive coverage, and rapid responsiveness, providing essential insights precisely where they are needed most.About the RoleAs a Senior Electrical Design Engineer, you will be instrumental in designing and validating the digital and mixed-signal electronics that underpin Array's radar payloads and satellite platforms. Your responsibilities will encompass high-speed digital interfaces, precision clocking and timing distribution, power regulation and control circuits, system monitoring and telemetry hardware, as well as fault-tolerant satellite avionics. You will oversee the hardware development lifecycle, including design, prototyping, initial testing, and qualification, collaborating closely with RF, antenna, mechanical, and systems engineers. The hardware you create will significantly influence system performance, power efficiency, stability, and overall reliability in orbit.
Sign in to browse more jobs
Create account — see all 1,197 results
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.
Gauss Labs is seeking a dynamic and skilled Senior AI Engineer to pioneer transformative Industrial AI solutions, setting new standards for artificial intelligence in the manufacturing sector. Our collaborations with leading manufacturing clients provide unparalleled access to extensive real-time data derived from their operations. Leveraging advanced AI tec…
About UsHippocratic AI stands at the forefront of generative AI in the healthcare sector. Our innovative platform is the only one capable of engaging in safe, autonomous clinical conversations with patients, supported by our proprietary LLMs in the Polaris constellation, boasting an impressive accuracy rate of over 99.9%.Why Join Our TeamRevolutionize healthcare with safety-centric AI. We are pioneering the world's first healthcare-specific, safety-oriented LLM—a groundbreaking platform focused on enhancing patient outcomes on a global scale. This is a unique opportunity to contribute to category creation.Collaborate with visionaries. Co-founded by CEO Munjal Shah alongside a distinguished team of physicians, hospital executives, AI innovators, and researchers from esteemed institutions such as El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA.Supported by top-tier investors. We recently secured a $126M Series C funding round at a valuation of $3.5B, led by Avenir Growth, bringing our total funding to $404M with contributions from notable investors like CapitalG, General Catalyst, a16z, Kleiner Perkins, and others.Build alongside experts in healthcare and AI. Join a team of professionals dedicated to enhancing care, advancing science, and creating transformative technologies that ensure our platform is robust, reliable, and revolutionary.Location RequirementWe believe collaboration sparks the best ideas. To foster rapid teamwork and a vibrant company culture, this position requires daily presence in our Palo Alto office, five days a week, unless stated otherwise.About the RoleIn healthcare AI, evaluation is crucial—if it can't be measured, it can't be deployed. You will develop systems that assess the safety, accuracy, and readiness of our models for real-world patient interactions: evaluation frameworks, synthetic data pipelines, automated benchmarks, and LLM-as-judge systems. This role presents a high-impact engineering opportunity where your contributions directly influence what is launched into production.What You’ll DoCreate and implement evaluation frameworks focused on LLM safety, clinical accuracy, and conversational quality.Build synthetic data generation pipelines to rigorously test models across varied clinical scenarios.Develop scalable automated and human-in-the-loop evaluation pipelines.
Join Our Innovative Team Nubank is a leading digital financial platform, serving over 122 million customers across Brazil, Mexico, and Colombia. Our mission is to simplify financial services and empower individuals, marking the start of a vibrant future in Latin America. As a publicly listed company on the New York Stock Exchange (NYSE: NU), we leverage cutting-edge technology and data intelligence to create financial products that are not only accessible but also user-friendly. Our achievements have earned us recognition from prestigious rankings, such as Time 100 Companies, Fast Company’s Most Innovative Companies, and Forbes World’s Best Bank. Explore more about us on our institutional page here. About the Role At AI Core, we are expanding our AI initiatives to become the backbone of Nubank's key decision-making systems. We are in search of talented Machine Learning Engineers to spearhead impactful research projects that connect advanced AI technologies with real-world financial systems. Your role will involve tackling intricate challenges using Deep Learning and Foundation Models, ensuring our solutions are scalable, efficient, and yield tangible business outcomes. As a Machine Learning Engineer (MLE), your responsibilities will include: Leading and executing complex applied research initiatives independently, focusing on building and optimizing architectures (e.g., Transformers, GNNs) for critical applications such as Credit, Recommendation Systems, Generative AI, and real-time inference. Resolving challenging and ambiguous modeling problems that necessitate collaboration across various teams (Data, Infrastructure, Product), delivering innovative solutions with a clear emphasis on medium-term impact. Connecting the research and production worlds by designing architectures that comply with MLOps constraints, ensuring models are optimized for latency, interpretability, and cost-effectiveness. We invite you to be part of our journey to revolutionize the financial landscape.
Role Overview:Join Nace.AI as a Machine Learning Engineer, where you will be instrumental in transforming advanced machine learning research into scalable, production-ready applications. Collaborating with interdisciplinary teams, you will pinpoint areas where machine learning can enhance product offerings, design robust model-centric architectures, and guarantee their smooth integration into practical applications. This role demands a harmonious blend of theoretical insight and hands-on engineering, focusing on creating dependable, maintainable, and impactful AI-driven features that align with Nace.AI's strategic goals.Key Responsibilities:Develop and sustain complete ML systems, including synthetic data pipelines, model training, debugging, and performance assessment.Enhance large language models (LLMs) and utilize meta-learning strategies to boost model generalization and efficiency.Refine existing Nace.AI models by integrating breakthroughs from the latest ML research.
Join our dynamic AI R&D team as an AI Scientist focused on Machine Learning. In this pivotal role, you will lead the development and implementation of advanced deep learning models to address real-world temporal modeling challenges in the manufacturing sector. We are in search of a candidate with extensive practical R&D experience, firmly rooted in robust theoretical principles and possessing deep expertise across various AI disciplines. The ideal candidate will exhibit a profound understanding of cutting-edge machine learning algorithms and techniques, alongside a proven record of contributions to top-tier conferences such as NeurIPS, ICML, ICLR, KDD, CVPR, or ICCV. A solid foundation in computer science and engineering is essential. Familiarity with collaborating alongside software engineering teams to scale and commercialize ML solutions will be highly regarded. This high-impact role merges foundational research, system-level design, and hands-on implementation, allowing you to work closely with cross-functional teams to create innovative solutions that drive strategic decisions and deliver significant business value.
Gauss Labs Talent PoolThank you for expressing your interest in joining Gauss Labs! By submitting your application to our Talent Pool, you are taking the first step towards potential career opportunities in the future. We value your qualifications and will reach out to you if a position that matches your skills becomes available. If you have any questions, please do not hesitate to contact our Talent Acquisition Team at recruiting@gausslabs.ai.
About VoltaiAt Voltai, we are pioneering the future of artificial intelligence by developing world models and agents capable of learning, evaluating, planning, experimenting, and interacting with the physical world. Our initial focus is on understanding and creating advanced hardware, electronic systems, and semiconductors, utilizing AI to design and innovate beyond human cognitive boundaries.About Our TeamOur remarkable team is backed by esteemed Silicon Valley investors, Stanford University, and industry leaders including CEOs and Presidents of Google, AMD, Broadcom, and Marvell. We boast a diverse group of former Stanford professors, SAIL researchers, Olympiad medalists, CTOs of prominent tech firms, and high-ranking officials with experience in national security and foreign policy.What We Are Looking ForExceptional AI/ML engineering skills, ideally from top-tier programs in Computer Science, Electrical Engineering, Mathematics, or Physics.Demonstrated success in delivering AI/ML projects from initial concept through to production deployment.Hands-on experience in fine-tuning and deploying large language models (LLMs) within production environments.Experience working with multi-modal models that integrate text, image, or audio inputs.Bonus PointsExperience in competitive programming.Contributions to open-source projects.Recognition through awards or publications in leading journals and conferences.Ability to thrive in a dynamic, fast-paced startup environment.
About Nightfall:Nightfall is an innovative, AI-driven platform specializing in unified data loss prevention and insider risk management. We secure sensitive data across various environments, including SaaS applications, Generative AI tools, email, and endpoint devices. Trusted by numerous clients, from AI pioneers to Fortune 10 banks, Nightfall empowers organizations to innovate safely, mitigating the risks associated with data loss and intellectual property exposure. Our intelligent platform automates data loss prevention, allowing security teams to focus on strategic initiatives by resolving security violations proactively and providing real-time training to users.Our endeavors are supported by top-tier venture capital firms such as Bain Capital Ventures, Venrock, WestBridge Capital, and Pear VC, alongside cybersecurity leaders including Frederic Kerrest, Maynard Webb, Ryan Carlson, and Kevin Mandia.About the Role:We seek a highly skilled technical leader to join our expanding team at Nightfall. As the Lead AI/ML Data Scientist within the AI Engineering organization, you will be pivotal in developing ML/NLP models and Generative AI solutions that enhance our Data Loss Prevention (DLP) and security products. This role involves spearheading research and applying advanced machine learning techniques to address security challenges, while guiding ML and backend engineers in deploying systems into production. You will also be instrumental in shaping the future architecture of our AI platform.This position is hybrid, requiring three days in the office at our Palo Alto, California location, and represents a fantastic opportunity for those passionate about data science and machine learning engineering.
At Inflection AI, we are dedicated to leveraging the transformative capabilities of artificial intelligence to enhance human well-being and productivity.The future of AI will be characterized by agents we can trust to act on our behalf.We are at the forefront of this evolution with our human-centric AI models that integrate emotional intelligence (EQ) with cognitive intelligence (IQ), shifting interactions from mere transactions to meaningful relationships, thereby generating lasting value for individuals and organizations alike.Our initiatives manifest in two primary forms:Pi, your personal AI, designed to be a compassionate companion that enriches everyday life through practical support and insights.Platform — large language models (LLMs) and APIs that empower developers, agents, and enterprises to infuse Pi-level emotional intelligence into experiences where empathy and understanding are crucial.We are building towards a future of AI agents that foster trust, enhance understanding, and create aligned, long-term value for everyone.About the RoleAs a Model Training Engineer, you will be responsible for designing, building, and scaling post-training pipelines that transform general LLMs into brand-fluent, production-ready assistants. Your innovations in fine-tuning and preference optimization techniques (RLHF, DPO, GRPO, RLAIF) will significantly enhance reliability, alignment, and cost-effectiveness.
Grindr LLC
Join us at Grindr in a hybrid position based in our Palo Alto or San Francisco offices, with in-office attendance required on Tuesdays and Thursdays.Why This Role is Exciting:As a pivotal figure at Grindr, you will lead our transformative AI journey. This is your opportunity to leverage state-of-the-art machine learning techniques to revolutionize the way millions within the LGBTQ+ community connect, whether through engaging conversations, casual meetups, or meaningful relationships. Our commitment to machine learning is strong, and you will play an essential role in shaping our strategy and execution on this unique global platform.Impact from Day One: You will be instrumental in establishing foundational systems in an early-stage ML environment, charting the roadmap for our long-term strategy.Innovative Recommendations: Design and scale recommendation platforms that connect millions to their next significant experience, tailored to diverse user intents.Conversational Insights: Employ large language models (LLMs) to extract insights and establish best practices for conversational AI, enhancing user engagement with precision.Key Responsibilities:Develop and manage large-scale recommendation systems to serve millions of users while balancing performance and innovation.Utilize advanced LLMs to analyze extensive conversation data, enhancing connections among users.Prototype, iterate, and deploy production-ready ML solutions addressing real user challenges.Provide technical guidance across teams, collaborating with engineering, data science, and product teams to turn innovative ideas into reality.Assess and incorporate emerging AI tools and techniques organization-wide to maintain a leading-edge technology stack.Qualifications We Seek:Over 10 years of experience in building ML systems, particularly in developing 0-to-1 systems, platform architecture, and pioneering new capabilities. Familiarity with recommendation systems is advantageous.Proven track record of delivering scalable solutions, with proficiency in Python and popular ML frameworks.A proactive mindset and the ability to work in a fast-paced, dynamic environment.
About VoltaiVoltai is at the forefront of developing sophisticated world models and intelligent agents capable of learning, evaluating, planning, experimenting, and interacting with the physical environment. Our initial focus is on the realms of hardware development, electronics systems, and semiconductors where artificial intelligence can surpass human cognitive capabilities in design and creation.About the TeamSupported by leading investors from Silicon Valley, as well as Stanford University, our team includes distinguished individuals such as former Stanford professors, SAIL researchers, Olympiad medalists, and executives from industry giants like Google, AMD, Broadcom, and Cadence. Our diverse expertise encompasses technology, defense, and policy, with a mission to innovate and lead in the field of AI and hardware.About this RoleAs a Lab Automation Engineer, you will be responsible for designing, implementing, and managing the automation infrastructure that supports Voltai’s hardware validation laboratories. This role involves building systems that facilitate automated testing, characterization, and qualification of silicon and board-level designs by seamlessly integrating software, robotics, and data pipelines for ongoing validation and enhancement.
At Rhoda AI, we are pioneering the development of a comprehensive full-stack platform for the next generation of humanoid robots. Our innovative approach encompasses high-performance, software-defined hardware along with foundational and video world models that empower our robotic systems. Our robots are engineered as versatile generalists, adept at navigating intricate, real-world scenarios, including those not encountered during training. Collaborating with a distinguished research team from Stanford, Berkeley, Harvard, and other leading institutions, we operate at the forefront of large-scale learning, robotics, and systems engineering. With over $400M in funding, we are aggressively investing in research and development, hardware innovation, and scaling up manufacturing to bring our vision to life.We are on the lookout for a Staff / Principal Machine Learning Engineer to take charge of our training platform. This pivotal system is essential for ensuring that large-scale training is reliable, reproducible, and straightforward to execute. You will play a crucial role in defining the lifecycle of training jobs, including their launch, tracking, recovery, and debugging across our clusters. Your contributions will enable researchers to innovate rapidly without infrastructure hindrances.In this role, you will be at the heart of enhancing research efficiency: when a training job fails, your system will allow for automatic recovery; when experiments become challenging to reproduce, you will implement effective solutions; and when GPU hours are squandered, you will ensure visibility and preventative measures are in place.
AI ResidencyLocation: Palo Alto, CA (on-site)About 1XAt 1X, we are pioneering the development of humanoid robots designed to collaborate with humans, addressing labor shortages and enhancing productivity.About the RoleThe AI Residency offers a unique fixed-term opportunity (3–6 months) to engage in transformative AI and robotics initiatives alongside our dedicated team. As a resident, you will contribute to building critical infrastructure for simulation, data management, and machine learning, directly translating research concepts into practical applications. This is your chance to play a vital role in advancing deployed robotic systems while gaining invaluable hands-on experience at the intersection of AI and robotics.
At Rhoda AI, we are pioneering a comprehensive platform for the next generation of humanoid robots. Our cutting-edge technology encompasses high-performance, software-defined hardware alongside sophisticated foundational models and video world models. Our robots are crafted to be versatile and capable of navigating complex, real-world scenarios that may not be covered in traditional training. We are at the forefront of large-scale learning, robotics, and systems integration, boasting a research team that includes esteemed professionals from Stanford, Berkeley, Harvard, and other prestigious institutions. With over $400 million raised, we are committed to significantly investing in research and development, hardware innovation, and scaling up our manufacturing processes to realize our vision.We are seeking a talented mid-to-senior Application Engineer who will be instrumental in translating real-world customer use cases into effective intelligent robot deployments. You will engage in the design, prototyping, and enhancement of grippers, end-of-arm tooling, workstations, and integration hardware. Additionally, you will develop efficient strategies for robot learning and data collection, operating at the confluence of robotics, artificial intelligence, hardware, and deployment operations.
About PathwayPathway is revolutionizing artificial intelligence with the introduction of the world’s first post-transformer model that mimics human thought processes. Our innovative architecture surpasses traditional Transformer models, providing enterprises with unparalleled transparency into model operations. By integrating this foundational model with the fastest data processing engine available, Pathway empowers organizations to transcend mere incremental optimization and achieve genuinely contextualized, experience-driven intelligence. Trusted by prestigious clients including NATO, La Poste, and Formula 1 racing teams, we are at the forefront of AI advancements.Led by visionary CEO Zuzanna Stamirowska, a complexity scientist, our team includes AI trailblazers such as CTO Jan Chorowski, who pioneered the application of Attention in speech and collaborated with Nobel laureate Geoff Hinton at Google Brain, and CSO Adrian Kosowski, a distinguished computer scientist and quantum physicist who earned his PhD at just 20 years old.Supported by prominent investors and advisors like Lukasz Kaiser, co-author of the Transformer architecture (the “T” in ChatGPT) and a key researcher in OpenAI's reasoning models, Pathway is headquartered in Palo Alto, California.The OpportunityWe are on the lookout for passionate Machine Learning/AI Software Engineering interns with a solid foundation in machine learning model research.Your ResponsibilitiesAssist in training Large Language Models (LLMs)Conduct benchmarking of LLMsPrepare and evaluate training datasetsCollaborate with the core Pathway Research TeamYour contributions will significantly impact the advancement of the AI landscape.
Hippocratic AI
Hippocratic AI builds generative AI technology for healthcare, focusing on safe, autonomous clinical conversations with patients. The company’s proprietary large language models, known as the Polaris constellation, drive this platform and have achieved an accuracy rate above 99.9%. As a Senior Staff AI Engineer based in Palo Alto, this role centers on advancing healthcare through a safety-first approach. The team is working to launch the first healthcare-specific, safety-centric large language model, with the goal of improving patient outcomes worldwide. Collaboration and Team The team includes experienced physicians, hospital executives, AI researchers, and innovators from leading institutions such as El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA. Co-founder and CEO Munjal Shah leads a group committed to scientific progress and reliable technology in healthcare AI. Funding and Support Hippocratic AI is backed by a strong group of healthcare and AI investors. The company recently closed a $126M Series C funding round at a $3.5B valuation, bringing total funding to $404M. Investors include Avenir Growth, CapitalG, General Catalyst, a16z, Kleiner Perkins, Premji Invest, UHS, Cincinnati Children’s, WellSpan Health, John Doerr, Rick Klausner, and others. Location Requirement This position requires working onsite at the Palo Alto office five days a week. The company values in-person collaboration to strengthen team culture and accelerate innovation.
Mistral AI
About Mistral AIAt Mistral AI, we harness the transformative power of artificial intelligence to streamline tasks, save valuable time, and foster enhanced creativity and learning. Our innovative technology is crafted to effortlessly integrate into everyday work environments.We are committed to democratizing AI by offering high-performance, optimized, open-source models, products, and solutions. Our extensive AI platform caters to both enterprise and individual needs, featuring products like Le Chat, La Plateforme, Mistral Code, and Mistral Compute—creating cutting-edge intelligence accessible to all users.As a vibrant and collaborative team, we are driven by our passion for AI and its potential to revolutionize society. Our diverse workforce excels in competitive settings and is dedicated to fostering innovation. With teams distributed across France, the USA, the UK, Germany, and Singapore, we pride ourselves on our creativity, humility, and team spirit.Join us in shaping the future of AI at a pioneering company. Together, we can create a lasting impact. Discover more about our culture at https://mistral.ai/careers.Role OverviewAbout the Research Engineering TeamThe Research Engineering team operates across Platform (shared infrastructure & clean coding practices) and Embedded (integrated within research squads). Our engineers have the flexibility to navigate the research↔production spectrum as their interests and needs evolve.As a Machine Learning Research Engineer, you will be responsible for building and optimizing large-scale learning systems that underpin our open-weight models. Collaborating closely with Research Scientists, you may join either:- Platform RE Team: Focus on enhancing our shared training frameworks, data pipelines, and tools utilized across all teams; or- Embedded RE Team: Become part of a research squad (Alignment, Pre-training, Multimodal, etc.) to turn innovative ideas into scalable, repeatable code.Key Responsibilities• Support researchers by managing the complex aspects of large-scale ML pipelines and developing robust tools.• Bridge cutting-edge research with production: integrate checkpoints, optimize evaluations, and create accessible APIs.• Conduct experiments utilizing the latest deep-learning techniques (sparsification on 70B+ models, distributed training across thousands of GPUs).• Design, implement, and benchmark ML algorithms; produce clear and efficient code in Python.• Deliver prototypes that evolve into production-grade components for Le Chat and our enterprise API.
Hippocratic AI
About UsAt Hippocratic AI, we are pioneers in the realm of generative AI for healthcare. Our innovative system enables safe and autonomous clinical dialogues with patients, achieving over 99.9% accuracy through our proprietary large language models (LLMs) within the Polaris constellation.Why Join Our TeamTransform healthcare with safety in mind. We are developing the world’s first healthcare-specific, safety-centric LLM, a revolutionary platform aimed at enhancing patient outcomes globally. This represents a new category in healthcare technology.Collaborate with industry leaders. Co-founded by CEO Munjal Shah alongside a team of esteemed physicians, hospital executives, AI innovators, and researchers from top institutions such as El Camino Health, Johns Hopkins, Stanford, Google, and NVIDIA.Supported by premier investors. Recently, we secured $126M in Series C funding at a $3.5B valuation, led by Avenir Growth, contributing to a total funding of $404M from notable investors including CapitalG, General Catalyst, and a16z.Work with the best minds in healthcare and AI. Join a team dedicated to improving healthcare, advancing science, and creating transformative technologies that ensure our platform is powerful and trusted.About the RoleAs an AI Engineer at Hippocratic AI, you will be instrumental in advancing voice-based generative AI in healthcare. Your responsibilities will include designing and developing intelligent systems that power our clinically safe healthcare agents, integrating large language models with real-time voice technology and human-centered design.This is a hands-on, cross-functional position, requiring close collaboration with AI researchers, product managers, and clinical experts to deploy advanced language and speech models. Your contributions will significantly shape how patients and providers interact safely with generative AI.
At Rhoda AI, we are pioneering the development of a comprehensive foundation for the next generation of humanoid robots. Our focus spans high-performance, software-defined hardware to advanced foundational models and video world models that govern robot functionality. Our robots are engineered to be versatile, capable of navigating intricate, real-world environments and tackling scenarios not previously encountered in training. We stand at the crossroads of large-scale learning, robotics, and systems, bolstered by a research team comprising experts from prestigious institutions such as Stanford, Berkeley, and Harvard. Our ambition is not merely to add features; we are crafting a revolutionary computing platform for physical tasks, underpinned by over $400 million in funding, driving aggressive investments in research & development, hardware innovation, and scaling up manufacturing to bring our vision to fruition.Role OverviewWe are in search of a Principal Machine Learning Systems Engineer to take charge of our training systems' performance from start to finish. You will be instrumental in defining the scaling of our model training, enhancing efficiency, scalability, and accuracy across extensive multimodal training environments. This is a pivotal systems role, not merely focused on infrastructure support. Your contributions will significantly influence our compute utilization efficiency, scalability of models across thousands of GPUs, and the speed of research iterations.Your ResponsibilitiesOversee training performance from start to finishAnalyze and enhance the performance of large-scale multimodal training encompassing vision, video, proprioception, actions, and language.Create systematic performance attributions by breaking down step-time into compute, communication, and input pipeline, along with scaling curves for various cluster sizes and identifying key bottlenecks.Drive quantifiable improvements across:Distributed efficiency (e.g., communication and compute overlap, bucketization, topology-aware mapping, and parallelism strategies).Compute efficiency (e.g., identifying kernel hotspots, operator fusion, attention optimization, and minimizing framework/runtime overhead).Memory efficiency (e.g., activation checkpointing, sequence packing, and reducing fragmentation).Design training systems rather than just tuning themDefine and refine parallelism strategies including data, tensor, pipeline, sharding, and hybrid approaches.Enhance execution efficiency through communication scheduling, graph capture, execution optimization, and runtime enhancements.Contribute to the overall system architecture with innovative solutions.
Join Array Labs, a pioneering company dedicated to constructing cutting-edge radar systems that empower humanity to interpret and respond effectively to changes in our physical environment.We are embarking on an ambitious project to deploy a synchronized fleet of radar satellites, aimed at generating a highly detailed 3D representation of the Earth that is continuously updated. This initiative will facilitate quicker, more informed decision-making for both governmental and commercial entities involved in disaster management, infrastructure robustness, and critical geopolitical intelligence.Our team designs and constructs our satellites with a comprehensive end-to-end approach, resulting in the creation of the world's most sophisticated Earth observation satellites. Through our fleet, we aim to deliver unparalleled accuracy, extensive coverage, and rapid responsiveness, providing essential insights precisely where they are needed most.About the RoleAs a Senior Electrical Design Engineer, you will be instrumental in designing and validating the digital and mixed-signal electronics that underpin Array's radar payloads and satellite platforms. Your responsibilities will encompass high-speed digital interfaces, precision clocking and timing distribution, power regulation and control circuits, system monitoring and telemetry hardware, as well as fault-tolerant satellite avionics. You will oversee the hardware development lifecycle, including design, prototyping, initial testing, and qualification, collaborating closely with RF, antenna, mechanical, and systems engineers. The hardware you create will significantly influence system performance, power efficiency, stability, and overall reliability in orbit.
Sign in to browse more jobs
Create account — see all 1,197 results
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.
