Applied AI Operations Lead
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Mid to Senior
Similar jobs
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.
Canvas Medical
Company OverviewAt Canvas Medical, we are revolutionizing everyday healthcare through our innovative EMR platform designed for healthcare automation. Our mission is to empower care teams with advanced software solutions that streamline data integration, automate workflows, and foster collaboration among developers and clinicians to tackle the most pressing c…
lilasciences
Join lilasciences as a Product Lead specializing in Software and Applied AI, where you will spearhead innovative projects that leverage cutting-edge AI technologies. In this pivotal role, you will collaborate with cross-functional teams to define product vision, strategy, and execution, ensuring alignment with user needs and market trends.
Arcade
Join Our Innovative Team at ArcadeAt Arcade, we are revolutionizing the way physical products are created with our cutting-edge AI platform. We empower individuals to turn their creative ideas into tangible products seamlessly, utilizing natural language and generative AI. Our mission is to democratize product design, making it as effortless as sharing a post online.Backed by a remarkable $42M in funding from industry-leading investors including Reid Hoffman and Ashton Kutcher, our company is a rising star in the tech landscape. Guided by our founder Mariam Naficy and a team steeped in AI and design expertise, we are at the forefront of a new frontier that merges AI, personal expression, and on-demand manufacturing.Your Role as an Applied AI EngineerWe are on the lookout for an Applied AI Engineer to enhance our generative AI capabilities. This position combines hands-on model development with the integration of advanced AI techniques into our production systems. You will collaborate with diverse teams to conduct research, experiment with models, and implement AI-driven products.
Applied Compute
About UsAt Applied Compute, we are pioneering the development of Specific Intelligence for enterprises, creating agents that continuously learn from a company’s processes, data, expertise, and objectives. Our mission is to bridge the gap between isolated AI capabilities and their effective application within real business environments. Traditional AI systems often fall short as they lack the ability to adapt based on feedback. Our innovative continual learning layer captures context, memory, and decision-making processes across the enterprise, enabling specialized agents to engage in meaningful work.What Excites Us: We operate at the exciting intersection of product development and cutting-edge research. Our product team designs the platform that empowers a new generation of digital coworkers, while our research team drives advancements in post-training and reinforcement learning to enhance user experiences. As an applied research engineer, you will work directly with clients to implement models in production, combining robust product development with deep research insights to facilitate AI integration in enterprises.Meet Our Team: Our diverse team consists of engineers, researchers, and operators, many of whom are former founders. We have previously built reinforcement learning infrastructure at OpenAI, established data foundations at Scale AI, and contributed to significant systems at companies like Together, Two Sigma, and Watershed. We collaborate with Fortune 50 clients, including DoorDash, Mercor, and Cognition, and are proud to be backed by reputable investors such as Benchmark, Sequoia, and Lux.Who Thrives Here: We seek individuals who are passionate about applying innovative research and complex systems to solve real-world challenges. You should feel comfortable navigating new environments rapidly—be it a fresh codebase, a client’s data architecture, or an unfamiliar problem domain. A genuine enjoyment for customer interaction, empathy, and a deep understanding of their operational workflows are essential. Candidates with entrepreneurial backgrounds, extensive side projects, or a proven track record of end-to-end ownership typically excel in our environment.
Join Anthropic as an Engagement Manager on our Applied AI team, where you will spearhead the delivery of cutting-edge AI solutions for Fortune 500 companies. In this pivotal role, you will collaborate with customers to create bespoke AI agents that enhance their core business processes. You will oversee the entire project lifecycle from the signed Statement of Work (SOW) to production deployment, coordinating cross-functional teams that include Engineering, Product, Design, and key customer stakeholders. This position goes beyond traditional project management; you will adeptly navigate complex enterprise environments, eliminate technical and organizational obstacles, and drive measurable business outcomes while upholding our commitment to safety and reliability. Work closely with Forward Deployed Engineers (FDEs) to manage stakeholder relationships and organizational intricacies, ensuring seamless delivery of AI innovations. Additionally, you will champion our mission in the field and develop the frameworks that enable scalability in our growing initiatives.
Quizlet Inc.
Quizlet Inc. is looking for an Applied AI Engineer to create AI-driven features that support student learning. This position centers on developing and deploying machine learning solutions aimed at making study experiences more effective and engaging for a global user base. What you will do Design and implement machine learning models to enhance Quizlet’s educational tools Work on features that help students study more efficiently and enjoyably Locations Denver, CO New York, NY San Francisco, CA Seattle, WA
Applied Compute
ABOUT USAt Applied Compute, we are pioneers in developing Specific Intelligence for enterprises, creating agents that learn continuously from a company’s processes, data, expertise, and objectives. Our mission is to establish a continual learning platform that captures context, memory, and decision traces throughout the organization, enabling specialized agents to perform meaningful tasks.Why Join Us: Our team operates at a unique intersection of innovation. Our product team is responsible for crafting a platform that serves as the backbone for a new generation of digital coworkers. Meanwhile, our research team explores the cutting edge of post-training and reinforcement learning to enhance product experiences. Our applied research engineers collaborate closely with clients to deploy agents effectively in real-world scenarios. This synergy of robust product development, extensive research, and direct client engagement is essential for us to revolutionize AI in the enterprise landscape.Our Team: Comprising engineers, researchers, and operations experts, our team includes many former founders with extensive experience. We have developed RL infrastructure at OpenAI, data foundations at Scale AI, and other systems at companies like Two Sigma and Watershed. We proudly serve Fortune 50 clients and are supported by top-tier investors including Kleiner Perkins, Benchmark, Sequoia, Lux, and Greenoaks.Who Thrives Here: We seek individuals passionate about utilizing cutting-edge research and complex systems to address real-world challenges. Comfort navigating diverse environments, whether it’s a new codebase, unfamiliar customer data architecture, or unexplored problem domains, is essential. Our team values genuine client engagement — listening, empathizing, and understanding the realities of work in their organizations. Those with entrepreneurial spirits, rich project experiences, or proven capabilities to manage tasks end-to-end will excel in our environment.THE POSITIONAs a Software Engineer, you will be instrumental in building the products and interfaces utilized by customers and internal teams. You will manage the entire application platform stack, from collaborative human-AI workspace systems to backend workflows orchestrating sandboxed agent sessions, and the continual learning SDK that provides engineers with oversight of the agent development lifecycle.
Demandbase
Join Demandbase as an Applied AI Scientist, where you'll have the opportunity to push the boundaries of AI technology in a dynamic and innovative environment. As a key member of our team, your role will involve developing and implementing advanced AI solutions that drive impactful business results.We're looking for a passionate individual who thrives on solving complex problems and is eager to contribute to AI advancements. You will collaborate with cross-functional teams to integrate AI models into our existing frameworks and enhance our product offerings.
Join mesh as an AI Operations Lead in our San Francisco office. In this pivotal role, you will oversee the implementation and optimization of AI-driven solutions, ensuring operational excellence and innovation. You will collaborate with cross-functional teams to drive efficiency and improve outcomes across various projects. Your expertise will help shape our AI strategies and enhance our operational capabilities.
Artisan
About ArtisanAt Artisan, we are pioneers in creating fully autonomous AI employees – not mere chatbots or copilots, but digital workers capable of performing meaningful tasks.Our flagship product, Ava, is an AI-powered Business Development Representative (BDR) utilized by hundreds of companies. Ava excels at researching leads, crafting and sending emails in clients' unique voices, managing complex outbound sequences, autonomously optimizing her performance, and even addressing objections while scheduling meetings. She is not just a tool; she is a collaborative teammate.As a proud Y Combinator W24 company, we have successfully raised over $35 million in funding and are currently achieving over $8 million in annual recurring revenue (ARR). We are now embarking on the development of Ava 2.0, which will redefine the capabilities of AI employees. The engineering challenges involved are substantial, and the scope of our project is vast.Role OverviewJoin our innovative team as the third Applied AI Engineer at Artisan! You will be instrumental in pushing the boundaries of AI employee capabilities and guiding our product's future direction.Evaluate LLMs: Select optimal models for various tasks while considering cost, latency, reliability, and accuracy.Architect Prompt Frameworks: Design agent behaviors for Ava's essential workflows such as email generation, chat interactions, meeting scheduling, prospect research, and more.Optimize Multi-Step Agent Chains: Implement retrieval-augmented generation (RAG), integrate web searches, and utilize tools like CRMs and APIs.Drive Infrastructure Decisions: Lead choices related to routing, orchestration, evaluation loops, and persistent memory for agents.Build Safety and Trust Mechanisms: Collaborate with the product team to design user guardrails, fail-safes, and success metrics.Explore Emerging Modalities: Investigate and deploy voice AI, talking head technology, and multi-modal reasoning to enhance Ava's human-like interactions.Design Autonomous Agent Workflows: Create workflows that enable strategic decision-making, real-time self-optimization, and measurable outcomes.Location: San Francisco, New York, or Remote USATeam: AIReports to: CPTO, Sam Stallings
Abby Care
Join Our Mission at Abby CareAt Abby Care, we are dedicated to transforming the landscape of family caregiving, addressing one of the most significant challenges of our time. Our goal is to empower over 50 million unpaid family caregivers across the United States by providing them with the training and support they need to get compensated for the invaluable care they provide at home.We are developing a cutting-edge, tech-driven family-first care platform designed to enhance care delivery, improve health outcomes, and ensure a superior experience for families nationwide. As we expand our impact, we are seeking passionate individuals to join our team. With partnerships alongside leading insurance providers, healthcare organizations, and community groups, we are backed by top-tier, mission-driven venture capitalists who share our vision of supporting families throughout the country.Our team comprises high-caliber professionals with backgrounds at renowned companies such as Uber, Scale AI, DoorDash, Dropbox, and Meta. Together, we are reimagining family caregiving.Your Role as an Applied AI EngineerWe are on the lookout for a skilled and motivated Applied AI Engineer to join our dynamic team. Reporting to the VP, Head of Engineering, this full-time position is based in San Francisco, CA, with an in-person presence required four days a week.In this role, you will play a crucial part in designing and building AI-driven products that serve as intelligent copilots for families and healthcare providers. You will create AI systems that facilitate family caregiving, enhance clinician support in care delivery, and explore innovative home-based care models. Your work will involve hands-on engagement with AI models and real-world clinical and operational data, contributing to the evolution of AI-assisted caregiving and shaping the future of care as a key early engineer.
Automat
Join the Visionaries at Automat!At Automat, we are a collective of innovative technologists hailing from prestigious backgrounds at Google Creative Lab and Samsung’s innovation Think Tank. Our mission is to transform the future of business through intelligent agents that enhance operational efficiency and eventually manage enterprises autonomously. We are leaders in redefining Enterprise AI Agents and Intelligent Document Processing, creating cutting-edge tools and infrastructure to seamlessly integrate AI advancements into practical applications.We cherish curiosity, teamwork, and impactful contributions over strict credentials. If you are passionate about AI and automation, excel in dynamic and creative settings, and yearn to engage in groundbreaking projects, we invite you to apply!While this is a senior role, we believe that unique talents and perspectives can surpass traditional experience metrics. We seek an individual who can make an immediate impact from day one and contribute to our team’s learning journey.Your RoleAs an Applied AI Engineer, you will operate at the crossroads of product development, engineering, and research. You will work hand-in-hand with skilled engineers and end users to create innovative tools for our internal teams and external clientele. Your contributions will help shape technical solutions, guide strategic initiatives, and enrich our company culture.Key Responsibilities:Collaborate with product and engineering teams to gain insights into customer challengesResearch and implement real-world applications of advanced AI modelsDesign and execute impactful solutions for both internal tools and customer-facing productsInfluence engineering practices, guide product development, and foster a collaborative cultureWhat Motivates You:A passion for curiosity, creativity, and quick adaptationCreating full-stack applications that bring AI to lifeKeeping abreast of the latest trends in AI and LLM prompting techniquesChallenging the limits of what is achievable with AI agentsThriving in fast-paced, collaborative environments
WorkOS
About WorkOS WorkOS is revolutionizing the developer landscape by creating cutting-edge tools and APIs that empower businesses to achieve Enterprise Ready status. Our platform is the backbone for authentication, identity management, authorization, and other essential infrastructures, enabling developers to securely scale their products for large enterprises.Having recently secured a $100M Series C funding round, valuing the company at $2B, we are supported by leading investors such as Meritech, Sapphire, Greenoaks, Craft, Abstract, and Audacious. WorkOS is proud to serve many of the fastest-growing AI companies, including OpenAI, Cursor, Perplexity, Vercel, and Plaid.As AI technology evolves, WorkOS stands at the forefront of Human and Agent Authentication, Identity, and Access Control, helping businesses navigate crucial questions about agent identity and permissions. Our rapidly expanding customer base features hundreds of innovative software companies developing the next generation of enterprise-ready solutions.About the RoleWe are assembling an Applied AI team dedicated to significantly enhancing productivity across Engineering, Sales, Support, and Operations while also developing AI functionalities for our clients.As an Applied AI Engineer, you will design and deploy transformative AI systems that redefine how WorkOS develops, markets, and supports its software. You will focus on creating robust tools and workflows that are essential for daily operations, while also exploring novel AI capabilities for our users.Join a small, high-ownership team that:Identifies challenges based on measurable impact.Rapidly transitions from idea to prototype to production in days or weeks.Quickly adapts to evolving models, tools, and best practices.This role offers substantial visibility across the organization.What You’ll DoCollaborate closely with Engineering, Sales, Support, and Operations to identify high-impact internal challenges where AI can enhance efficiency or facilitate new workflows.Design and construct tools that integrate seamlessly into daily operations—agents, automations, and workflows that are reliable, observable, and easy to maintain.Leverage LLMs, embeddings, retrieval systems, and tool-calling to connect with documents, Slack, GitHub, CRM systems, analytics, support frameworks, and internal services.Transform repetitive, multi-step manual tasks into streamlined, AI-driven processes that span various applications and data sources.Stay informed about emerging models and tools, conduct focused experiments, and assist the team in continuous improvement.
Role Overview:As an Applied AI Engineer at Mem0, you'll lead the charge from concept to execution. Your mission is to transform abstract customer use cases into tangible proofs-of-concept that vividly demonstrate Mem0's capabilities. This entails rapid full-stack prototyping, leveraging AI tools, and rigorously experimenting with memory retrieval methodologies until the end-to-end use case is operational. Collaborating closely with Research and Backend teams, you'll clearly communicate trade-offs and deliver successful prototypes ready for production refinement.Your Responsibilities:Develop Proofs of Concept: Create comprehensive demos (UI + APIs + data) that seamlessly integrate Mem0 into customer workflows.Innovate in Memory Retrieval: Experiment with various embeddings, indexing techniques, hybrid search, re-ranking, chunking/windowing, prompts, and caching to achieve optimal task-level quality and latency.Collaborate on Prototypes: Implement groundbreaking ideas and techniques sourced from academic research, compare them against established baselines, and adopt the most effective solutions.Design Evaluation Frameworks: Establish small gold sets and lightweight metrics to assess POC success; equip demos with basic telemetry.Integrate AI Tools: Combine LLMs, vector databases, Mem0 SDKs/APIs, and third-party services into cohesive workflows.Foster Close Collaboration: Work alongside Backend teams to establish clean contracts and data models; collaborate with Research on hypotheses; share insights and future steps.Document and Handoff: Produce clear documentation, scripts, and templates to facilitate rapid production transition by Engineering.Minimum Qualifications:Full-Stack Proficiency: Experience with Next.js/React for front-end development and Python backends (FastAPI/Django/Flask) or Node.js as required.Proficient in Python and TypeScript/JavaScript; capable of building APIs, wiring data models, and deploying quick demos.Hands-on experience with the LLM/RAG stack: embeddings, vector databases, retrieval strategies, and prompt engineering.Proven ability to rapidly prototype: transforming ideas into demos within days, not months; clear documentation of outcomes and trade-offs.Capacity to design small, impactful evaluations for use cases (quality + latency) and iterate based on data-driven evidence.Outstanding communication skills with Research and Backend teams; ability to produce crisp specifications, readable code, and honest status updates.
Paraform
Role Overview Paraform is hiring an Applied AI Engineer in San Francisco. This role focuses on building and deploying AI systems that directly serve real users. The ideal candidate brings 2-5 years of experience, a strong grasp of modern LLM-based technologies, and a track record of turning advanced models into reliable product features. Success in this role depends on sound product sense and the ability to weigh trade-offs between LLM and traditional machine learning approaches. Experience with LLM-powered applications, retrieval systems, agentic workflows, or automation is valuable. Familiarity with classic ML techniques, such as ranking, recommendation, or classification, will help in designing hybrid systems that balance performance, cost, and reliability. What You Will Do Design and build AI systems to improve matchmaking, ranking, and automation in the Paraform marketplace. Develop LLM-driven features, including retrieval pipelines and agentic workflows, to streamline recruiter and company interactions. Own systems end-to-end: from data pipelines and model design to deployment, monitoring, and iteration in production. Work closely with product managers, ML engineers, and full-stack teams to deliver AI capabilities that shape marketplace outcomes. Create evaluation frameworks to measure real-world performance, reliability, and business impact, not just offline metrics. Set best practices for building and maintaining production AI systems, balancing model quality, cost, latency, and maintainability. Advance the integration of AI into product experiences across the platform. What We Look For 2-5 years of experience at an AI-focused startup (Series A through D). Background working on products with a broad user base, beyond single-enterprise deployments. Proficient in Python and Typescript. Experience developing agentic systems that drive measurable business or user outcomes. Comfort with ambiguity and building in 0 to 1 environments. Ability to communicate technical trade-offs clearly to non-technical stakeholders.
LangChain
About Us:At LangChain, we strive to revolutionize the accessibility of intelligent agents. Our goal is to provide a robust framework for agent engineering that empowers developers to transition from initial prototypes to production-ready AI agents that can be trusted by teams. What started as a suite of widely embraced open-source tools has evolved into a comprehensive platform designed for the building, assessment, deployment, and management of agents at scale.Currently, our tools, including LangChain, LangGraph, LangSmith, and Agent Builder, are utilized by various teams delivering tangible AI solutions across both startups and large corporations. Millions of developers depend on LangChain to empower AI initiatives at prestigious companies like Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.With $125M raised in our Series B round from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are in a phase of rapid growth, continuously innovating new products while ensuring every team member has a significant impact on our development and collaborative processes. LangChain is a place where your contributions can influence the real-world application of this transformative technology.About the Role:We are seeking talented Applied AI Engineers to assist in creating AI agents that enhance various aspects of LangChain, from Marketing and Go-To-Market strategies to Recruiting, Support, Internal Tools, and our Core Product.In this position, you will take ownership of specific problem areas, collaborating closely with relevant teams to design, construct, and deploy production-grade agents, workflows, and applications that revolutionize our operational processes. Your contributions will directly support LangChain’s vision of making intelligent, autonomous software a reality for both our internal teams and our clients. Some projects may be open source, contributing to the LangChain and LangGraph ecosystems while establishing new benchmarks for AI development practices.If you are a full-stack software engineer eager to implement AI agents in practical use cases and witness their impact on business outcomes, we invite you to apply.
Eragon
Job OverviewJoin our team at Eragon as an Applied AI Intern, where you will play a crucial role in developing and deploying advanced AI systems. Collaborating closely with engineers and researchers, you'll contribute to transforming AI models from theoretical concepts into impactful real-world applications.This internship offers a unique opportunity for hands-on experience across various aspects of modeling, data management, and systems integration, working on projects that directly reach users.Main ResponsibilitiesModel Development: Assist in refining, assessing, and implementing machine learning models to address practical challenges.System Implementation: Collaborate in building and integrating AI-driven features into live production systems.Data & Pipelines: Engage with datasets to facilitate training, evaluation, and iterative improvements.Experimentation: Conduct experiments, analyze outcomes, and enhance model performance through iterative testing.Evaluation & Monitoring: Contribute to the development of evaluation frameworks and assist in monitoring system performance metrics.Cross-Functional Collaboration: Work alongside engineering and product teams to aid in the development of new features.QualificationsEducation: Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Engineering, or a related discipline.Technical Skills: Proficient in Python with familiarity in machine learning frameworks such as PyTorch or TensorFlow.ML Fundamentals: Strong understanding of fundamental machine learning concepts and workflows.Problem-Solving Skills: Demonstrated ability to dissect problems and contribute to effective solutions.Curiosity & Initiative: A keen desire to learn and make meaningful contributions in a fast-paced setting.Preferred QualificationsExperience in machine learning projects, internships, or research roles.Familiarity with large language models, AI agents, or data pipelines.Demonstrated experience in building projects beyond academic coursework.Genuine interest in real-world AI applications.
Distyl AI
About Distyl AIDistyl AI specializes in creating high-performance AI systems that enhance the fundamental operational processes of Fortune 500 companies. Through a strategic alliance with OpenAI, proprietary software accelerators, and extensive expertise in enterprise AI, we deliver effective AI solutions with swift time-to-value, often within a quarter.Our innovations have empowered Fortune 500 clients in various sectors, including insurance, consumer packaged goods, and non-profit organizations. Joining our team means you will assist organizations in recognizing, developing, and extracting value from their Generative AI investments, frequently for the first time. We prioritize customer needs, working backward from the client's challenges and ensuring we generate financial benefits while enhancing the experiences of end-users.Distyl is guided by seasoned leaders from top-tier companies like Palantir and Apple and enjoys backing from prominent investors including Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (Former CEO of GitHub), Brad Gerstner (Founder and CEO of Altimeter), along with board members from numerous Fortune 500 firms.What We Are Looking ForAt Distyl, we are at the forefront of leveraging AI within enterprises. We seek imaginative researchers who aspire to go beyond incremental enhancements on benchmarks and are eager to redefine the application of software in innovative ways.Our researchers hail from diverse academic disciplines but possess a robust research background, operate in an AI-centric manner, and would find conventional research environments unfulfilling.Key ResponsibilitiesThe AI Systems team is dedicated to architecting complex, comprehensive solutions that integrate perception, reasoning, planning, and execution. Researchers amalgamate various components (LLMs, retrievers, evaluators, memory systems, and execution agents) into resilient, scalable systems that deliver consistent performance across dynamic enterprise workflows.Researchers in AI Systems examine the principles governing intricate system interactions. They analyze coordination, information flow, and emergent behavior across multiple agents and models. Their research reveals the foundational mechanics of robustness, composability, and alignment, ultimately establishing the design paradigm for constructing intelligent systems.
OpenEvidence
RoleAs a Software Engineer specializing in Applied AI at OpenEvidence, you will be instrumental in developing comprehensive systems that harness cutting-edge AI models to create impactful, user-centric products.We seek exceptional builders who thrive outside conventional boundaries. Our engineers engage across various projects and products, taking ownership to maximize their impact.About UsOpenEvidence stands as the leading medical AI platform globally.In just over a year, over 40% of US clinicians have adopted our solution, driven by product-led growth and word-of-mouth recommendations. We are a $12 billion company with a talented 30-person engineering team hailing from prestigious institutions like MIT, Harvard, and Stanford. We believe that transformative products emerge from a small cadre of exceptional, autonomous builders who are empowered to take ownership and drive swift progress. We are expanding our team to capture an extraordinary opportunity to establish a benchmark platform for medical AI.If you are a top-tier engineer or scientist eager to innovate at the forefront of technology and deliver meaningful results that affect millions of lives, we want to connect with you.CultureWe believe that work should reflect a world-class commitment. Innovating from 0 to 1 and scaling from 1 to 1000 is akin to a professional sport, and we set the bar at uncompromising excellence. We understand that groundbreaking technologies are only possible through complete ownership. Significant achievements arise when individuals take decisive action.Who are you?If you are seeking a typical 9-5 job or merely wish to write papers, this position is not for you. If you are ready to dive in, get hands-on, face challenges head-on, and create something impactful that can reach millions and generate significant revenue, this role might be your perfect fit.The ideal candidate is a brilliant builder—intelligent, driven, resourceful, independent, meticulous, motivated, hardworking, and humble. Does that sound rare? It is indeed rare; we currently have only 30 such individuals, and we are eager to find more.LocationAll full-time engineering roles require in-person attendance five days a week in San Francisco or Miami.
ABOUT BASETENAt Baseten, we empower the leading AI companies of today, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, by providing essential inference capabilities. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators at the cutting edge of AI to seamlessly transition advanced models into production. With our recent success in securing a $300M Series E funding round, backed by notable investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we're on an exciting growth trajectory. Join our team and contribute to the platform that engineers rely on to launch AI-driven products.THE ROLEAs an Applied AI Inference Engineer at Baseten, you'll collaborate closely with clients to design, develop, and implement high-performance AI applications using our platform. You will guide customers through the entire process, from initial concept to deployment, transforming vague business objectives into dependable, observable solutions that meet defined quality, latency, and cost metrics.This position is ideal for innovative engineers eager to gain insight into how modern organizations scale AI adoption. You will thrive if you enjoy a multifaceted role that intersects product development, software engineering, performance optimization, and direct customer engagement.It’s essential to note that this position requires hands-on coding and software development, while also encompassing elements of product management, technical customer success, and pre-sales engineering.EXAMPLE INITIATIVESExplore insights from our Forward Deployed Engineering team through these blog posts: Forward Deployed Engineering on the frontier of AIThe fastest, most accurate Whisper transcriptionDeploy production-ready model servers from Docker imagesDeploy custom ComfyUI workflows as APIs...
Sign in to browse more jobs
Create account — see all 5,361 results
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.
Canvas Medical
Company OverviewAt Canvas Medical, we are revolutionizing everyday healthcare through our innovative EMR platform designed for healthcare automation. Our mission is to empower care teams with advanced software solutions that streamline data integration, automate workflows, and foster collaboration among developers and clinicians to tackle the most pressing c…
lilasciences
Join lilasciences as a Product Lead specializing in Software and Applied AI, where you will spearhead innovative projects that leverage cutting-edge AI technologies. In this pivotal role, you will collaborate with cross-functional teams to define product vision, strategy, and execution, ensuring alignment with user needs and market trends.
Arcade
Join Our Innovative Team at ArcadeAt Arcade, we are revolutionizing the way physical products are created with our cutting-edge AI platform. We empower individuals to turn their creative ideas into tangible products seamlessly, utilizing natural language and generative AI. Our mission is to democratize product design, making it as effortless as sharing a post online.Backed by a remarkable $42M in funding from industry-leading investors including Reid Hoffman and Ashton Kutcher, our company is a rising star in the tech landscape. Guided by our founder Mariam Naficy and a team steeped in AI and design expertise, we are at the forefront of a new frontier that merges AI, personal expression, and on-demand manufacturing.Your Role as an Applied AI EngineerWe are on the lookout for an Applied AI Engineer to enhance our generative AI capabilities. This position combines hands-on model development with the integration of advanced AI techniques into our production systems. You will collaborate with diverse teams to conduct research, experiment with models, and implement AI-driven products.
Applied Compute
About UsAt Applied Compute, we are pioneering the development of Specific Intelligence for enterprises, creating agents that continuously learn from a company’s processes, data, expertise, and objectives. Our mission is to bridge the gap between isolated AI capabilities and their effective application within real business environments. Traditional AI systems often fall short as they lack the ability to adapt based on feedback. Our innovative continual learning layer captures context, memory, and decision-making processes across the enterprise, enabling specialized agents to engage in meaningful work.What Excites Us: We operate at the exciting intersection of product development and cutting-edge research. Our product team designs the platform that empowers a new generation of digital coworkers, while our research team drives advancements in post-training and reinforcement learning to enhance user experiences. As an applied research engineer, you will work directly with clients to implement models in production, combining robust product development with deep research insights to facilitate AI integration in enterprises.Meet Our Team: Our diverse team consists of engineers, researchers, and operators, many of whom are former founders. We have previously built reinforcement learning infrastructure at OpenAI, established data foundations at Scale AI, and contributed to significant systems at companies like Together, Two Sigma, and Watershed. We collaborate with Fortune 50 clients, including DoorDash, Mercor, and Cognition, and are proud to be backed by reputable investors such as Benchmark, Sequoia, and Lux.Who Thrives Here: We seek individuals who are passionate about applying innovative research and complex systems to solve real-world challenges. You should feel comfortable navigating new environments rapidly—be it a fresh codebase, a client’s data architecture, or an unfamiliar problem domain. A genuine enjoyment for customer interaction, empathy, and a deep understanding of their operational workflows are essential. Candidates with entrepreneurial backgrounds, extensive side projects, or a proven track record of end-to-end ownership typically excel in our environment.
Join Anthropic as an Engagement Manager on our Applied AI team, where you will spearhead the delivery of cutting-edge AI solutions for Fortune 500 companies. In this pivotal role, you will collaborate with customers to create bespoke AI agents that enhance their core business processes. You will oversee the entire project lifecycle from the signed Statement of Work (SOW) to production deployment, coordinating cross-functional teams that include Engineering, Product, Design, and key customer stakeholders. This position goes beyond traditional project management; you will adeptly navigate complex enterprise environments, eliminate technical and organizational obstacles, and drive measurable business outcomes while upholding our commitment to safety and reliability. Work closely with Forward Deployed Engineers (FDEs) to manage stakeholder relationships and organizational intricacies, ensuring seamless delivery of AI innovations. Additionally, you will champion our mission in the field and develop the frameworks that enable scalability in our growing initiatives.
Quizlet Inc.
Quizlet Inc. is looking for an Applied AI Engineer to create AI-driven features that support student learning. This position centers on developing and deploying machine learning solutions aimed at making study experiences more effective and engaging for a global user base. What you will do Design and implement machine learning models to enhance Quizlet’s educational tools Work on features that help students study more efficiently and enjoyably Locations Denver, CO New York, NY San Francisco, CA Seattle, WA
Applied Compute
ABOUT USAt Applied Compute, we are pioneers in developing Specific Intelligence for enterprises, creating agents that learn continuously from a company’s processes, data, expertise, and objectives. Our mission is to establish a continual learning platform that captures context, memory, and decision traces throughout the organization, enabling specialized agents to perform meaningful tasks.Why Join Us: Our team operates at a unique intersection of innovation. Our product team is responsible for crafting a platform that serves as the backbone for a new generation of digital coworkers. Meanwhile, our research team explores the cutting edge of post-training and reinforcement learning to enhance product experiences. Our applied research engineers collaborate closely with clients to deploy agents effectively in real-world scenarios. This synergy of robust product development, extensive research, and direct client engagement is essential for us to revolutionize AI in the enterprise landscape.Our Team: Comprising engineers, researchers, and operations experts, our team includes many former founders with extensive experience. We have developed RL infrastructure at OpenAI, data foundations at Scale AI, and other systems at companies like Two Sigma and Watershed. We proudly serve Fortune 50 clients and are supported by top-tier investors including Kleiner Perkins, Benchmark, Sequoia, Lux, and Greenoaks.Who Thrives Here: We seek individuals passionate about utilizing cutting-edge research and complex systems to address real-world challenges. Comfort navigating diverse environments, whether it’s a new codebase, unfamiliar customer data architecture, or unexplored problem domains, is essential. Our team values genuine client engagement — listening, empathizing, and understanding the realities of work in their organizations. Those with entrepreneurial spirits, rich project experiences, or proven capabilities to manage tasks end-to-end will excel in our environment.THE POSITIONAs a Software Engineer, you will be instrumental in building the products and interfaces utilized by customers and internal teams. You will manage the entire application platform stack, from collaborative human-AI workspace systems to backend workflows orchestrating sandboxed agent sessions, and the continual learning SDK that provides engineers with oversight of the agent development lifecycle.
Demandbase
Join Demandbase as an Applied AI Scientist, where you'll have the opportunity to push the boundaries of AI technology in a dynamic and innovative environment. As a key member of our team, your role will involve developing and implementing advanced AI solutions that drive impactful business results.We're looking for a passionate individual who thrives on solving complex problems and is eager to contribute to AI advancements. You will collaborate with cross-functional teams to integrate AI models into our existing frameworks and enhance our product offerings.
Join mesh as an AI Operations Lead in our San Francisco office. In this pivotal role, you will oversee the implementation and optimization of AI-driven solutions, ensuring operational excellence and innovation. You will collaborate with cross-functional teams to drive efficiency and improve outcomes across various projects. Your expertise will help shape our AI strategies and enhance our operational capabilities.
Artisan
About ArtisanAt Artisan, we are pioneers in creating fully autonomous AI employees – not mere chatbots or copilots, but digital workers capable of performing meaningful tasks.Our flagship product, Ava, is an AI-powered Business Development Representative (BDR) utilized by hundreds of companies. Ava excels at researching leads, crafting and sending emails in clients' unique voices, managing complex outbound sequences, autonomously optimizing her performance, and even addressing objections while scheduling meetings. She is not just a tool; she is a collaborative teammate.As a proud Y Combinator W24 company, we have successfully raised over $35 million in funding and are currently achieving over $8 million in annual recurring revenue (ARR). We are now embarking on the development of Ava 2.0, which will redefine the capabilities of AI employees. The engineering challenges involved are substantial, and the scope of our project is vast.Role OverviewJoin our innovative team as the third Applied AI Engineer at Artisan! You will be instrumental in pushing the boundaries of AI employee capabilities and guiding our product's future direction.Evaluate LLMs: Select optimal models for various tasks while considering cost, latency, reliability, and accuracy.Architect Prompt Frameworks: Design agent behaviors for Ava's essential workflows such as email generation, chat interactions, meeting scheduling, prospect research, and more.Optimize Multi-Step Agent Chains: Implement retrieval-augmented generation (RAG), integrate web searches, and utilize tools like CRMs and APIs.Drive Infrastructure Decisions: Lead choices related to routing, orchestration, evaluation loops, and persistent memory for agents.Build Safety and Trust Mechanisms: Collaborate with the product team to design user guardrails, fail-safes, and success metrics.Explore Emerging Modalities: Investigate and deploy voice AI, talking head technology, and multi-modal reasoning to enhance Ava's human-like interactions.Design Autonomous Agent Workflows: Create workflows that enable strategic decision-making, real-time self-optimization, and measurable outcomes.Location: San Francisco, New York, or Remote USATeam: AIReports to: CPTO, Sam Stallings
Abby Care
Join Our Mission at Abby CareAt Abby Care, we are dedicated to transforming the landscape of family caregiving, addressing one of the most significant challenges of our time. Our goal is to empower over 50 million unpaid family caregivers across the United States by providing them with the training and support they need to get compensated for the invaluable care they provide at home.We are developing a cutting-edge, tech-driven family-first care platform designed to enhance care delivery, improve health outcomes, and ensure a superior experience for families nationwide. As we expand our impact, we are seeking passionate individuals to join our team. With partnerships alongside leading insurance providers, healthcare organizations, and community groups, we are backed by top-tier, mission-driven venture capitalists who share our vision of supporting families throughout the country.Our team comprises high-caliber professionals with backgrounds at renowned companies such as Uber, Scale AI, DoorDash, Dropbox, and Meta. Together, we are reimagining family caregiving.Your Role as an Applied AI EngineerWe are on the lookout for a skilled and motivated Applied AI Engineer to join our dynamic team. Reporting to the VP, Head of Engineering, this full-time position is based in San Francisco, CA, with an in-person presence required four days a week.In this role, you will play a crucial part in designing and building AI-driven products that serve as intelligent copilots for families and healthcare providers. You will create AI systems that facilitate family caregiving, enhance clinician support in care delivery, and explore innovative home-based care models. Your work will involve hands-on engagement with AI models and real-world clinical and operational data, contributing to the evolution of AI-assisted caregiving and shaping the future of care as a key early engineer.
Automat
Join the Visionaries at Automat!At Automat, we are a collective of innovative technologists hailing from prestigious backgrounds at Google Creative Lab and Samsung’s innovation Think Tank. Our mission is to transform the future of business through intelligent agents that enhance operational efficiency and eventually manage enterprises autonomously. We are leaders in redefining Enterprise AI Agents and Intelligent Document Processing, creating cutting-edge tools and infrastructure to seamlessly integrate AI advancements into practical applications.We cherish curiosity, teamwork, and impactful contributions over strict credentials. If you are passionate about AI and automation, excel in dynamic and creative settings, and yearn to engage in groundbreaking projects, we invite you to apply!While this is a senior role, we believe that unique talents and perspectives can surpass traditional experience metrics. We seek an individual who can make an immediate impact from day one and contribute to our team’s learning journey.Your RoleAs an Applied AI Engineer, you will operate at the crossroads of product development, engineering, and research. You will work hand-in-hand with skilled engineers and end users to create innovative tools for our internal teams and external clientele. Your contributions will help shape technical solutions, guide strategic initiatives, and enrich our company culture.Key Responsibilities:Collaborate with product and engineering teams to gain insights into customer challengesResearch and implement real-world applications of advanced AI modelsDesign and execute impactful solutions for both internal tools and customer-facing productsInfluence engineering practices, guide product development, and foster a collaborative cultureWhat Motivates You:A passion for curiosity, creativity, and quick adaptationCreating full-stack applications that bring AI to lifeKeeping abreast of the latest trends in AI and LLM prompting techniquesChallenging the limits of what is achievable with AI agentsThriving in fast-paced, collaborative environments
WorkOS
About WorkOS WorkOS is revolutionizing the developer landscape by creating cutting-edge tools and APIs that empower businesses to achieve Enterprise Ready status. Our platform is the backbone for authentication, identity management, authorization, and other essential infrastructures, enabling developers to securely scale their products for large enterprises.Having recently secured a $100M Series C funding round, valuing the company at $2B, we are supported by leading investors such as Meritech, Sapphire, Greenoaks, Craft, Abstract, and Audacious. WorkOS is proud to serve many of the fastest-growing AI companies, including OpenAI, Cursor, Perplexity, Vercel, and Plaid.As AI technology evolves, WorkOS stands at the forefront of Human and Agent Authentication, Identity, and Access Control, helping businesses navigate crucial questions about agent identity and permissions. Our rapidly expanding customer base features hundreds of innovative software companies developing the next generation of enterprise-ready solutions.About the RoleWe are assembling an Applied AI team dedicated to significantly enhancing productivity across Engineering, Sales, Support, and Operations while also developing AI functionalities for our clients.As an Applied AI Engineer, you will design and deploy transformative AI systems that redefine how WorkOS develops, markets, and supports its software. You will focus on creating robust tools and workflows that are essential for daily operations, while also exploring novel AI capabilities for our users.Join a small, high-ownership team that:Identifies challenges based on measurable impact.Rapidly transitions from idea to prototype to production in days or weeks.Quickly adapts to evolving models, tools, and best practices.This role offers substantial visibility across the organization.What You’ll DoCollaborate closely with Engineering, Sales, Support, and Operations to identify high-impact internal challenges where AI can enhance efficiency or facilitate new workflows.Design and construct tools that integrate seamlessly into daily operations—agents, automations, and workflows that are reliable, observable, and easy to maintain.Leverage LLMs, embeddings, retrieval systems, and tool-calling to connect with documents, Slack, GitHub, CRM systems, analytics, support frameworks, and internal services.Transform repetitive, multi-step manual tasks into streamlined, AI-driven processes that span various applications and data sources.Stay informed about emerging models and tools, conduct focused experiments, and assist the team in continuous improvement.
Role Overview:As an Applied AI Engineer at Mem0, you'll lead the charge from concept to execution. Your mission is to transform abstract customer use cases into tangible proofs-of-concept that vividly demonstrate Mem0's capabilities. This entails rapid full-stack prototyping, leveraging AI tools, and rigorously experimenting with memory retrieval methodologies until the end-to-end use case is operational. Collaborating closely with Research and Backend teams, you'll clearly communicate trade-offs and deliver successful prototypes ready for production refinement.Your Responsibilities:Develop Proofs of Concept: Create comprehensive demos (UI + APIs + data) that seamlessly integrate Mem0 into customer workflows.Innovate in Memory Retrieval: Experiment with various embeddings, indexing techniques, hybrid search, re-ranking, chunking/windowing, prompts, and caching to achieve optimal task-level quality and latency.Collaborate on Prototypes: Implement groundbreaking ideas and techniques sourced from academic research, compare them against established baselines, and adopt the most effective solutions.Design Evaluation Frameworks: Establish small gold sets and lightweight metrics to assess POC success; equip demos with basic telemetry.Integrate AI Tools: Combine LLMs, vector databases, Mem0 SDKs/APIs, and third-party services into cohesive workflows.Foster Close Collaboration: Work alongside Backend teams to establish clean contracts and data models; collaborate with Research on hypotheses; share insights and future steps.Document and Handoff: Produce clear documentation, scripts, and templates to facilitate rapid production transition by Engineering.Minimum Qualifications:Full-Stack Proficiency: Experience with Next.js/React for front-end development and Python backends (FastAPI/Django/Flask) or Node.js as required.Proficient in Python and TypeScript/JavaScript; capable of building APIs, wiring data models, and deploying quick demos.Hands-on experience with the LLM/RAG stack: embeddings, vector databases, retrieval strategies, and prompt engineering.Proven ability to rapidly prototype: transforming ideas into demos within days, not months; clear documentation of outcomes and trade-offs.Capacity to design small, impactful evaluations for use cases (quality + latency) and iterate based on data-driven evidence.Outstanding communication skills with Research and Backend teams; ability to produce crisp specifications, readable code, and honest status updates.
Paraform
Role Overview Paraform is hiring an Applied AI Engineer in San Francisco. This role focuses on building and deploying AI systems that directly serve real users. The ideal candidate brings 2-5 years of experience, a strong grasp of modern LLM-based technologies, and a track record of turning advanced models into reliable product features. Success in this role depends on sound product sense and the ability to weigh trade-offs between LLM and traditional machine learning approaches. Experience with LLM-powered applications, retrieval systems, agentic workflows, or automation is valuable. Familiarity with classic ML techniques, such as ranking, recommendation, or classification, will help in designing hybrid systems that balance performance, cost, and reliability. What You Will Do Design and build AI systems to improve matchmaking, ranking, and automation in the Paraform marketplace. Develop LLM-driven features, including retrieval pipelines and agentic workflows, to streamline recruiter and company interactions. Own systems end-to-end: from data pipelines and model design to deployment, monitoring, and iteration in production. Work closely with product managers, ML engineers, and full-stack teams to deliver AI capabilities that shape marketplace outcomes. Create evaluation frameworks to measure real-world performance, reliability, and business impact, not just offline metrics. Set best practices for building and maintaining production AI systems, balancing model quality, cost, latency, and maintainability. Advance the integration of AI into product experiences across the platform. What We Look For 2-5 years of experience at an AI-focused startup (Series A through D). Background working on products with a broad user base, beyond single-enterprise deployments. Proficient in Python and Typescript. Experience developing agentic systems that drive measurable business or user outcomes. Comfort with ambiguity and building in 0 to 1 environments. Ability to communicate technical trade-offs clearly to non-technical stakeholders.
LangChain
About Us:At LangChain, we strive to revolutionize the accessibility of intelligent agents. Our goal is to provide a robust framework for agent engineering that empowers developers to transition from initial prototypes to production-ready AI agents that can be trusted by teams. What started as a suite of widely embraced open-source tools has evolved into a comprehensive platform designed for the building, assessment, deployment, and management of agents at scale.Currently, our tools, including LangChain, LangGraph, LangSmith, and Agent Builder, are utilized by various teams delivering tangible AI solutions across both startups and large corporations. Millions of developers depend on LangChain to empower AI initiatives at prestigious companies like Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.With $125M raised in our Series B round from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are in a phase of rapid growth, continuously innovating new products while ensuring every team member has a significant impact on our development and collaborative processes. LangChain is a place where your contributions can influence the real-world application of this transformative technology.About the Role:We are seeking talented Applied AI Engineers to assist in creating AI agents that enhance various aspects of LangChain, from Marketing and Go-To-Market strategies to Recruiting, Support, Internal Tools, and our Core Product.In this position, you will take ownership of specific problem areas, collaborating closely with relevant teams to design, construct, and deploy production-grade agents, workflows, and applications that revolutionize our operational processes. Your contributions will directly support LangChain’s vision of making intelligent, autonomous software a reality for both our internal teams and our clients. Some projects may be open source, contributing to the LangChain and LangGraph ecosystems while establishing new benchmarks for AI development practices.If you are a full-stack software engineer eager to implement AI agents in practical use cases and witness their impact on business outcomes, we invite you to apply.
Eragon
Job OverviewJoin our team at Eragon as an Applied AI Intern, where you will play a crucial role in developing and deploying advanced AI systems. Collaborating closely with engineers and researchers, you'll contribute to transforming AI models from theoretical concepts into impactful real-world applications.This internship offers a unique opportunity for hands-on experience across various aspects of modeling, data management, and systems integration, working on projects that directly reach users.Main ResponsibilitiesModel Development: Assist in refining, assessing, and implementing machine learning models to address practical challenges.System Implementation: Collaborate in building and integrating AI-driven features into live production systems.Data & Pipelines: Engage with datasets to facilitate training, evaluation, and iterative improvements.Experimentation: Conduct experiments, analyze outcomes, and enhance model performance through iterative testing.Evaluation & Monitoring: Contribute to the development of evaluation frameworks and assist in monitoring system performance metrics.Cross-Functional Collaboration: Work alongside engineering and product teams to aid in the development of new features.QualificationsEducation: Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Engineering, or a related discipline.Technical Skills: Proficient in Python with familiarity in machine learning frameworks such as PyTorch or TensorFlow.ML Fundamentals: Strong understanding of fundamental machine learning concepts and workflows.Problem-Solving Skills: Demonstrated ability to dissect problems and contribute to effective solutions.Curiosity & Initiative: A keen desire to learn and make meaningful contributions in a fast-paced setting.Preferred QualificationsExperience in machine learning projects, internships, or research roles.Familiarity with large language models, AI agents, or data pipelines.Demonstrated experience in building projects beyond academic coursework.Genuine interest in real-world AI applications.
Distyl AI
About Distyl AIDistyl AI specializes in creating high-performance AI systems that enhance the fundamental operational processes of Fortune 500 companies. Through a strategic alliance with OpenAI, proprietary software accelerators, and extensive expertise in enterprise AI, we deliver effective AI solutions with swift time-to-value, often within a quarter.Our innovations have empowered Fortune 500 clients in various sectors, including insurance, consumer packaged goods, and non-profit organizations. Joining our team means you will assist organizations in recognizing, developing, and extracting value from their Generative AI investments, frequently for the first time. We prioritize customer needs, working backward from the client's challenges and ensuring we generate financial benefits while enhancing the experiences of end-users.Distyl is guided by seasoned leaders from top-tier companies like Palantir and Apple and enjoys backing from prominent investors including Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (Former CEO of GitHub), Brad Gerstner (Founder and CEO of Altimeter), along with board members from numerous Fortune 500 firms.What We Are Looking ForAt Distyl, we are at the forefront of leveraging AI within enterprises. We seek imaginative researchers who aspire to go beyond incremental enhancements on benchmarks and are eager to redefine the application of software in innovative ways.Our researchers hail from diverse academic disciplines but possess a robust research background, operate in an AI-centric manner, and would find conventional research environments unfulfilling.Key ResponsibilitiesThe AI Systems team is dedicated to architecting complex, comprehensive solutions that integrate perception, reasoning, planning, and execution. Researchers amalgamate various components (LLMs, retrievers, evaluators, memory systems, and execution agents) into resilient, scalable systems that deliver consistent performance across dynamic enterprise workflows.Researchers in AI Systems examine the principles governing intricate system interactions. They analyze coordination, information flow, and emergent behavior across multiple agents and models. Their research reveals the foundational mechanics of robustness, composability, and alignment, ultimately establishing the design paradigm for constructing intelligent systems.
OpenEvidence
RoleAs a Software Engineer specializing in Applied AI at OpenEvidence, you will be instrumental in developing comprehensive systems that harness cutting-edge AI models to create impactful, user-centric products.We seek exceptional builders who thrive outside conventional boundaries. Our engineers engage across various projects and products, taking ownership to maximize their impact.About UsOpenEvidence stands as the leading medical AI platform globally.In just over a year, over 40% of US clinicians have adopted our solution, driven by product-led growth and word-of-mouth recommendations. We are a $12 billion company with a talented 30-person engineering team hailing from prestigious institutions like MIT, Harvard, and Stanford. We believe that transformative products emerge from a small cadre of exceptional, autonomous builders who are empowered to take ownership and drive swift progress. We are expanding our team to capture an extraordinary opportunity to establish a benchmark platform for medical AI.If you are a top-tier engineer or scientist eager to innovate at the forefront of technology and deliver meaningful results that affect millions of lives, we want to connect with you.CultureWe believe that work should reflect a world-class commitment. Innovating from 0 to 1 and scaling from 1 to 1000 is akin to a professional sport, and we set the bar at uncompromising excellence. We understand that groundbreaking technologies are only possible through complete ownership. Significant achievements arise when individuals take decisive action.Who are you?If you are seeking a typical 9-5 job or merely wish to write papers, this position is not for you. If you are ready to dive in, get hands-on, face challenges head-on, and create something impactful that can reach millions and generate significant revenue, this role might be your perfect fit.The ideal candidate is a brilliant builder—intelligent, driven, resourceful, independent, meticulous, motivated, hardworking, and humble. Does that sound rare? It is indeed rare; we currently have only 30 such individuals, and we are eager to find more.LocationAll full-time engineering roles require in-person attendance five days a week in San Francisco or Miami.
ABOUT BASETENAt Baseten, we empower the leading AI companies of today, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, by providing essential inference capabilities. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators at the cutting edge of AI to seamlessly transition advanced models into production. With our recent success in securing a $300M Series E funding round, backed by notable investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we're on an exciting growth trajectory. Join our team and contribute to the platform that engineers rely on to launch AI-driven products.THE ROLEAs an Applied AI Inference Engineer at Baseten, you'll collaborate closely with clients to design, develop, and implement high-performance AI applications using our platform. You will guide customers through the entire process, from initial concept to deployment, transforming vague business objectives into dependable, observable solutions that meet defined quality, latency, and cost metrics.This position is ideal for innovative engineers eager to gain insight into how modern organizations scale AI adoption. You will thrive if you enjoy a multifaceted role that intersects product development, software engineering, performance optimization, and direct customer engagement.It’s essential to note that this position requires hands-on coding and software development, while also encompassing elements of product management, technical customer success, and pre-sales engineering.EXAMPLE INITIATIVESExplore insights from our Forward Deployed Engineering team through these blog posts: Forward Deployed Engineering on the frontier of AIThe fastest, most accurate Whisper transcriptionDeploy production-ready model servers from Docker imagesDeploy custom ComfyUI workflows as APIs...
Sign in to browse more jobs
Create account — see all 5,361 results
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.
