Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Entry Level
Qualifications
The ideal candidate will have experience in software development, particularly in billing systems and internal tools. Proficiency in programming languages such as Python and JavaScript is essential. Familiarity with cloud technologies and APIs will be a strong advantage. Excellent problem-solving skills and the ability to work in a fast-paced environment are key.
About the job
Join Baseten as a Software Engineer focused on Billing and Internal Tooling, where you will play a crucial role in developing and enhancing our internal systems. You will collaborate with cross-functional teams to create efficient billing solutions and streamline internal processes. Your contributions will directly impact our operational efficiency and customer satisfaction.
About Baseten
Baseten is a leading technology company based in San Francisco, dedicated to enhancing business processes through innovative software solutions. We foster a collaborative and dynamic work environment, promoting growth and creativity among our team members.
Join baseten as a Kubernetes Systems Engineer, where you will play a pivotal role in shaping the future of machine learning and data deployment. Your expertise in operating systems and Kubernetes will help streamline our infrastructure and enhance our services. Collaborate with a talented team to drive innovation and efficiency.
Join baseten as a Data Engineer and be at the forefront of data-driven innovation. In this role, you will design and implement robust data pipelines, ensuring the efficient processing and analysis of data to empower our products and decision-making processes. Collaborate with cross-functional teams to understand their data needs, while striving for optimization and scalability in data architectures.
Join Baseten as an Onboarding Program Manager where you will play a vital role in shaping the onboarding experience for our new team members. You will be responsible for developing and implementing effective onboarding programs that enhance employee engagement and retention.
About Baseten Baseten supports leading AI companies, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, by delivering essential inference capabilities. The platform brings together advanced AI research, flexible infrastructure, and developer-friendly tools, helping teams move models from the lab into production. Backed by a recent $300M Series E funding round and investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, Baseten is growing quickly in its mission to become the platform engineers trust for building and shipping AI products. Role Overview The Integrated Marketing Manager will shape and run multi-channel marketing campaigns to drive a qualified pipeline and strengthen Baseten’s go-to-market approach. This role calls for a strategic marketer with hands-on experience in AI, comfortable guiding campaigns from initial idea through launch and measurement, and collaborating across teams and channels. What You Will Do Develop and execute full-funnel campaign programs that include content, paid media, email outreach, events, and web initiatives Increase awareness, engagement, and pipeline growth as Baseten scales through FY’27 Work closely with cross-functional teams to ensure campaigns align with business goals and market needs Analyze campaign performance and apply insights to improve future efforts Location This position is based in San Francisco.
Join Baseten as a Software Engineer focusing on GPU Networking and Distributed Systems. In this pivotal role, you'll collaborate with talented engineers and researchers to develop cutting-edge solutions that leverage GPU technology for high-performance networking operations. Your contributions will be instrumental in shaping the future of distributed systems, enhancing performance, scalability, and reliability.
Join Baseten as a Software Engineer focused on Billing and Internal Tooling, where you will play a crucial role in developing and enhancing our internal systems. You will collaborate with cross-functional teams to create efficient billing solutions and streamline internal processes. Your contributions will directly impact our operational efficiency and customer satisfaction.
Join Baseten as an Account Executive in the Industries division, where you'll play a pivotal role in driving growth and building strong client relationships. In this position, you will leverage your expertise to engage with prospective customers, understand their needs, and offer tailored solutions that align with their objectives. Ideal candidates will possess exceptional communication skills, a strong sales acumen, and a passion for technology.
ABOUT BASETENAt Baseten, we empower cutting-edge AI companies, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, to achieve mission-critical inference. By merging advanced AI research with flexible infrastructure and intuitive developer tools, we facilitate the deployment of innovative AI models into production. Having recently secured a $300M Series E funding round from esteemed investors like BOND, IVP, Spark Capital, Greylock, and Conviction, we are on a rapid growth trajectory. Join our team and contribute to building a platform that engineers rely on to launch AI products successfully.THE ROLEAs a key member of Baseten's Platform Team, you will play a crucial role in developing internal infrastructure to support our engineering division. While our product offers infrastructure for AI advancements, your primary focus will be on crafting robust internal systems that enhance productivity, collaboration, and work quality across engineering teams, leveraging exceptional tools, efficient workflows, and resilient development settings.If you have a passion for elegant solutions—such as streamlined monorepos, rapid CI pipelines, and well-designed shared libraries—you will excel at Baseten.RESPONSIBILITIESDevelop a range of tools customized to meet the diverse needs of engineering teams.Enhance monorepo functionality and create project templates to ensure consistency and efficiency.Design and implement shared libraries focused on system observability.Optimize the speed, reliability, and thoroughness of our CI pipelines.Assist in designing and maintaining Terraform modules for effective infrastructure management.Provide innovative solutions to improve visibility within continuous delivery (CD) processes.Proactively support engineering teams, ensuring they have the necessary resources and tools for maximum productivity.REQUIREMENTSProficiency in Go and/or Python programming languages.Experience with Kubernetes and Docker tools (e.g., Helm, Docker, Kubernetes).Demonstrated experience managing and working with large monorepos.Strong problem-solving skills with an emphasis on efficient software delivery.Familiarity with CI/CD methodologies and tools.Excellent communication and collaboration skills.
Join Baseten as a Software Engineer focused on developing cutting-edge products that push the boundaries of technology. In this role, you will collaborate with a dynamic team to design, implement, and maintain innovative software solutions that meet the needs of our users. You will have the opportunity to work on exciting projects that utilize the latest technologies and methodologies.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENAt Baseten, we empower innovative AI companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer to execute mission-critical inference with ease. By merging advanced AI research with flexible infrastructure and robust developer tools, we enable organizations at the forefront of AI to seamlessly deploy cutting-edge models into production. Fueled by rapid growth and a recent $300M Series E investment from industry leaders such as BOND, IVP, Spark Capital, Greylock, and Conviction, we're building the essential platform that engineers trust to launch AI products.THE ROLEAs a Software Engineer on our Core Product team, you will play a pivotal role in developing and enhancing the core Baseten platform, empowering users to effortlessly deploy and derive value from machine learning models. Given our developer-centric approach, you will engage with a vast array of components, including CLI tools, REST APIs, and the web application. The Core Product team leads all new product innovations within Baseten.EXAMPLE INITIATIVESAs part of our Core Product team, you will tackle exciting projects such as:Chains for multi-component workflowsAsynchronous inferenceModel APIs for cutting-edge modelsModel training optimized for production inferenceRESPONSIBILITIESDevelop and implement new features and products for the teamDesign intuitive APIs and abstractions to effectively address customer needsQuickly resolve bugs and customer issues with a proactive approachWork across the technology stack; you'll engage with both React Components and Kubernetes PodsCollaborate closely with product managers and cross-functional teams to drive product success
ABOUT BASETENAt Baseten, we are at the forefront of AI innovation, providing critical inference solutions for leading AI companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our platform combines advanced AI research, adaptable infrastructure, and intuitive developer tools, empowering organizations to deploy state-of-the-art models effectively. With rapid growth and a recent $300M Series E funding round backed by top-tier investors including BOND, IVP, Spark Capital, Greylock, and Conviction, we invite you to join our mission in building the platform of choice for engineers delivering AI products.THE ROLE:As a member of Baseten’s Model Performance (MP) team, you will play a pivotal role in ensuring our platform’s model APIs are not only fast and reliable but also cost-effective. Your primary focus will be on developing and optimizing the infrastructure that supports our hosted API endpoints for cutting-edge open-source models. This role involves working with distributed systems, model serving, and enhancing the developer experience. You will collaborate with a small, dynamic team at the intersection of product development, model performance, and infrastructure, defining how developers interact with AI models on a large scale.RESPONSIBILITIES:Design, develop, and maintain the Model APIs surface, focusing on advanced inference features such as structured outputs (JSON mode, grammar-constrained generation), tool/function calling, and multi-modal serving.Profile and optimize TensorRT-LLM kernels, analyze CUDA kernel performance, create custom CUDA operators, and enhance memory allocation patterns for maximum efficiency across multi-GPU setups.Implement performance improvements across various runtimes based on a deep understanding of their internals, including speculative decoding, guided generation for structured outputs, and custom scheduling algorithms for high-performance serving.Develop robust benchmarking frameworks to evaluate real-world performance across diverse model architectures, batch sizes, sequence lengths, and hardware configurations.Enhance performance across runtimes (e.g., TensorRT, TensorRT-LLM) through techniques such as speculative decoding, quantization, batching, and KV-cache reuse.Integrate deep observability mechanisms (metrics, traces, logs) and establish repeatable benchmarks to assess speed, reliability, and quality.
Join our innovative team at Baseten as a Software Engineer - AI Enablement. In this role, you will work on cutting-edge AI technologies and help build tools that empower developers to harness the full potential of artificial intelligence.We are looking for passionate engineers who thrive in a collaborative environment and are eager to tackle challenging problems. You will be responsible for designing and implementing scalable AI solutions, working closely with cross-functional teams to deliver impactful results.
Join Baseten as a Software Engineer focused on enhancing our Developer Ecosystem. You will be instrumental in crafting solutions that enable developers to build, train, and deploy machine learning models seamlessly. Your role will involve collaborating with cross-functional teams to innovate and optimize our platform, ensuring it meets the needs of our users.
ABOUT BASETENAt Baseten, we are at the forefront of enabling transformative AI solutions for some of the world's leading companies, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our innovative platform combines cutting-edge AI research, adaptable infrastructure, and developer-friendly tools to facilitate the production of advanced models. Recently, we celebrated our rapid growth with a successful $300M Series E funding round from notable investors like BOND, IVP, Spark Capital, Greylock, and Conviction. We invite you to join our dynamic team and contribute to the evolution of AI product deployment.THE ROLEAs a Senior Software Engineer specializing in Model Training at Baseten, you will play a pivotal role in constructing the infrastructure essential for the large-scale training and fine-tuning of foundational AI models. Your responsibilities will include designing and implementing distributed training systems, optimizing GPU utilization, and establishing scalable pipelines that empower Baseten and our clientele to adapt models with efficiency and reliability. This role demands a high level of technical expertise and hands-on involvement: you will be responsible for critical components of our training stack, collaborate with product and infrastructure teams to identify customer needs, and drive advancements in scalable training infrastructure.EXAMPLE WORK:Training open-source models that surpass GPT-5 capabilities for a leading digital insurerExploring specialized, continuously learning models as the future of AIOverview of our training documentationResearch initiatives we've undertakenRESPONSIBILITIESDesign, construct, and sustain distributed training infrastructures for large foundation modelsDevelop scalable pipelines for fine-tuning and training across diverse GPU/accelerator clustersEnhance training performance through optimization of algorithms and infrastructureCollaborate closely with cross-functional teams to align technical solutions with business objectivesStay abreast of advancements in the field of machine learning and AI to continually improve our training processes
ABOUT BASETENBaseten is at the forefront of AI technology, empowering leading-edge companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer to seamlessly integrate advanced AI models into their operations. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators to bring their most ambitious AI products to life. With our recent $300M Series E funding from top-tier investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we are poised for rapid growth. Join us in shaping the platform that engineers rely on to deploy transformative AI solutions.THE ROLEAre you driven by a passion for enhancing artificial intelligence applications? We are seeking a proactive Software Engineer specializing in ML performance to join our energetic team. This position is perfect for backend engineers who thrive in a fast-paced startup environment and are eager to make substantial contributions to the realm of Large Language Model (LLM) Inference. If you're enthusiastic about optimizing open-source ML models, we can't wait to hear from you!EXAMPLE INITIATIVESAs a member of our Model Performance team, you will have the opportunity to work on exciting projects, including:Baseten Embeddings Inference: The quickest embeddings solution availableThe Baseten Inference StackDriving model performance optimizationRESPONSIBILITIESDevelop, refine, and implement advanced techniques (quantization, speculative decoding, kv cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.Conduct thorough investigations into the codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to troubleshoot and resolve ML performance issues.Scale and apply optimization techniques across a diverse array of ML models, with a focus on large language models.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENAt Baseten, we empower leading AI companies such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer with our state-of-the-art inference solutions. Our unique blend of applied AI research, versatile infrastructure, and intuitive developer tools allows organizations at the forefront of AI innovation to deploy cutting-edge models effectively. Recently, we have experienced significant growth, securing a $300M Series E funding round, backed by renowned investors like BOND, IVP, Spark Capital, Greylock, and Conviction. Become a part of our journey to create the ultimate platform for engineers to launch AI products seamlessly.THE ROLEAs a Senior Software Engineer focused on our Enterprise Platform, you will play a pivotal role in designing and developing robust infrastructure and platform features tailored for our enterprise clientele and cloud partners. Your contributions will encompass enabling self-hosted and single-tenant environments, implementing region-aware request routing, and ensuring enterprise-grade data security and integration capabilities.EXAMPLE INITIATIVESJoin our Infrastructure team and tackle exciting projects such as:Multi-cloud capacity managementOptimizing inference on B200 GPUsImplementing multi-node inference solutionsLeveraging fractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and implement infrastructure and platform features customized for enterprise clients, covering self-hosted clusters, single-tenant environments, and cross-cloud orchestration.Lead strategic initiatives to enhance secure and scalable private connectivity solutions.Craft and execute solutions that address complex regulatory and compliance requirements for enterprise environments.
Baseten is looking for a Strategic Finance Specialist to focus on Go-To-Market (GTM) strategies in San Francisco. This position centers on analyzing financial data and supporting key decisions that shape GTM initiatives. Role overview This role works closely with teams across the company to improve growth and efficiency. The Strategic Finance Specialist uses financial insights to guide GTM projects and influence business outcomes. What you will do Analyze financial data related to GTM strategies Support strategic decision-making with clear financial insights Collaborate with cross-functional teams to enhance GTM initiatives Help drive growth and operational efficiency through financial expertise Location This position is based in San Francisco.
Company Overview:Specter is revolutionizing how businesses perceive their physical environments by developing a software-defined control plane. Our mission is to enhance the security of American enterprises by providing them with comprehensive visibility over their physical assets.We are pioneering a connected hardware-software ecosystem that leverages multi-modal wireless mesh sensing technology, reducing the deployment costs and time for sensors by a factor of ten. Our platform aims to be the perception engine for a company’s physical presence, facilitating real-time visibility of perimeters and enabling autonomous operational management.Founded by passionate innovators from Anduril, Tesla, Uber, and the U.S. Special Forces, our co-founders, Xerxes and Philip, are dedicated to empowering our partners in the rapidly evolving landscape of physical AI and robotics.
About braintrustBraintrust is at the forefront of AI observability. By merging evaluation and observability into a singular workflow, we empower developers with the insights needed to comprehend AI behavior in production environments, along with the tools to enhance it.Leading teams at Notion, Stripe, Zapier, Vercel, and Ramp utilize Braintrust to compare models, test prompts, and monitor regressions — transforming production data into superior AI with each new release.About the roleWe are in search of a passionate software engineer dedicated to crafting high-performance data processing systems. Our clientele consists of large enterprises handling complex, semi-structured data, which they require for real-time processing and analysis. Our distinct architecture enables these organizations to keep data on-premises while creating intricate visualizations that load without delay. Explore our Brainstore blog post.If you have experience with database systems, compilers, networks, or storage systems and aspire to pivot your expertise into the AI sector, this role could be your ideal fit. You will significantly influence foundational system architecture, technology selection, and implementation. Our founding team possesses extensive knowledge in database and ML systems, and you will have the autonomy to collaborate closely with them while exploring your innovative ideas.Your ResponsibilitiesAs a systems engineer at Braintrust, you’ll contribute to the core systems that empower Braintrust’s capability to process and query vast amounts of unstructured data at an enterprise scale. Key areas of responsibility include:Enhancing the storage, indexing, and query execution performance of Brainstore.Developing Braintrust's btql query language.Optimizing query patterns to boost performance across our platform.QualificationsDeep understanding of systems programming (C++ or Rust, concurrency, databases, operating systems).Experience in founding or working at startups is advantageous.Familiarity with writing prompts or experimenting with GPT models and applications.BenefitsComprehensive medical, dental, and vision insurance.Daily lunch, snacks, and beverages provided.Flexible time off policy.Competitive salary with equity options.
Midstream is an innovative, AI-driven financial operating system tailored for healthcare systems. Founded by a team of seasoned entrepreneurs and supported by prestigious investors, we empower finance, supply chain, and managed care teams with real-time insights into margin risks, enabling them to act swiftly to protect their margins.Designed specifically for the complexities of healthcare, Midstream converts structured, unstructured, and external data into immediate, contract-aware insights that enhance decision-making. Our AI-driven agents integrate spending and revenue operations across the entire back-office, continuously learning and adapting to ensure a level of intelligence that surpasses any standalone solution.We are revolutionizing the pace of healthcare finance, compressing lengthy processes into minutes and transforming retrospective insights into proactive foresight. Midstream is at the forefront of this change.The OpportunityJoin Midstream at a pivotal moment in our growth and contribute to establishing the technical backbone of our platform. In this role, you will operate at the intersection of product development, systems architecture, and cloud infrastructure, crafting and building distributed systems that enable our agile team to operate efficiently as we expand.You will collaborate closely with engineers and leadership to identify current infrastructural pain points and anticipate potential challenges before they arise. From multi-tenant architectures to security protocols, you will transform uncertainty into robust systems that minimize operational burdens and enhance engineering productivity.We are seeking a software engineer with a systems-oriented mindset who is passionate about long-term maintainability and developer experience, while also enjoying the process of creating backend services and delivering tangible software solutions. The ideal candidate will be adept at reasoning about distributed systems, making practical compromises, and developing infrastructure that seamlessly integrates into the background, allowing the team to focus on delivering impactful product value.What You’ll DoDevelop shared platform patterns and tools that enable engineers to launch new backend services and workflows efficiently, securely, and reliably.Enhance the reliability of our production systems by improving observability, debugging capabilities, and resilience as we scale.Architect infrastructure that is clean, reviewable, and repeatable, minimizing unique configurations and facilitating rapid iteration.Establish clear, scalable multi-tenant boundaries across data, compute, and identity to support...