Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Mid to Senior
Qualifications
Proven experience as a DevOps Engineer or similar role. Strong knowledge of CI/CD tools such as Jenkins, GitLab CI, or CircleCI. Experience with cloud services (AWS, Azure, GCP) and configuration management tools (Ansible, Puppet, Chef). Solid understanding of containerization technologies (Docker, Kubernetes). Excellent problem-solving skills and ability to work in a fast-paced environment. Strong communication and collaboration skills.
About the job
We are seeking a talented and experienced DevOps Lead/Architect to join our dynamic team at Sonsoft Inc. in San Francisco. In this pivotal role, you will be responsible for designing and implementing robust Continuous Integration (CI) and Continuous Deployment (CD) pipelines that enhance our software development lifecycle.
The ideal candidate will have a deep understanding of DevOps practices, with a focus on automation, monitoring, and cloud infrastructure. You will collaborate closely with cross-functional teams to ensure efficient delivery of high-quality software products.
About Sonsoft Inc.
Sonsoft Inc. is a leading technology solutions provider based in San Francisco, dedicated to delivering innovative software and IT services to clients across various industries. Our team is passionate about technology and committed to driving success through collaboration and excellence.
We are seeking a talented and experienced DevOps Lead/Architect to join our dynamic team at Sonsoft Inc. in San Francisco. In this pivotal role, you will be responsible for designing and implementing robust Continuous Integration (CI) and Continuous Deployment (CD) pipelines that enhance our software development lifecycle.The ideal candidate will have a deep…
Full-time|$230K/yr - $490K/yr|On-site|San Francisco
About the RoleJoin the Engineering Acceleration Delivery / Continuous Deployment team at OpenAI, where we develop and maintain systems designed to securely deploy OpenAI’s infrastructure and product code into production.Our team is responsible for the deployment platform, release pipelines, and safety mechanisms that empower engineers across OpenAI to make rapid changes while minimizing operational risks. Our goal is to streamline production deployments, enhancing speed, safety, and autonomy.This position is a unique opportunity to work at the convergence of developer productivity, distributed systems reliability, and large-scale infrastructure orchestration.In This Role, You WillArchitect and implement continuous deployment infrastructure that efficiently manages changes across multiple Kubernetes clusters and global regions.Create systems for progressive delivery, incorporating techniques like canary releases, staged rollouts, and automated rollback processes.Enhance engineering velocity by reducing friction within the release pipeline and automating operational workflows.Collaborate with product and infrastructure teams to ensure their services are deployable, observable, and resilient at scale.Refine and adopt deployment methodologies such as GitOps, infrastructure-as-code, and progressive delivery patterns.Develop systems that automatically assess deployment health through metrics, logs, traces, and alerts to identify regressions and initiate safe rollbacks.Create systems that facilitate agent-assisted or fully autonomous deployment workflows using cutting-edge AI tools.Technologies you will work with include:Kubernetes for large-scale container orchestration and runtime infrastructurePython and FastAPI for internal servicesTerraform for infrastructure as codeGitOps-based deployment workflows (e.g., ArgoCD, Flux, or similar systems)Buildkite for CI orchestration
Continue is on the lookout for an exceptional Software Engineer with over 5 years of experience to join our dynamic team in San Francisco. As a pivotal member of our engineering department, you will be instrumental in developing cutting-edge autocomplete features and optimizing codebase retrieval systems. We value precision, analytical thinking, and a keen eye for detail in our candidates. In this position, you'll tackle foundational yet complex challenges where methodical assessment, swift experimentation, and a deep understanding of user needs will drive the advancement of our innovative products.
We are seeking a talented and experienced Senior Continuous Integration Engineer to join our dynamic team at usm2 in San Francisco. In this role, you will play a pivotal part in enhancing our continuous integration processes and infrastructure, ensuring seamless deployment and integration of software solutions.As a Senior Engineer, you will collaborate closely with cross-functional teams, drive best practices in CI/CD, and leverage your expertise to implement innovative solutions that improve our development lifecycle. Your contributions will directly impact the efficiency and quality of our software products.
Full-time|$189K/yr - $303K/yr|On-site|San Francisco, California
About UsAt Aurora, we are on a mission to redefine transportation by delivering self-driving technology that is safe, efficient, and accessible to all.Explore The Aurora Driver and join us in creating a transformative future in mobility and logistics.At Aurora, you will engage with challenging problems alongside a team of passionate experts, fostering your growth as you expand your expertise. Stay updated with our latest developments through aurora.tech or connect with us on LinkedIn.We are looking for a Staff Software Engineer to join our Autonomy Data: Continuous Learning team. This role is ideal for individuals eager to delve into complex models and datasets. You will utilize cutting-edge foundation models and Reinforcement Learning with Human Feedback (RLHF) techniques to enhance our models with high-quality data and to construct the datasets that drive the Aurora Driver.Your ResponsibilitiesEnhance dataset quality by developing semi-automated evaluation mechanisms using state-of-the-art models and RLHF techniques.Broaden our foundation model strategy for identifying noteworthy events over millions of miles.Take ownership of model training and inference pipelines for all core autonomy models.Collaborate across various teams (product, program, operations, data science) to manage projects from conception through to delivery.
At Trunk, our vision is to empower teams to produce exceptional software efficiently. We have collaborated with engineering teams at leading companies such as Google X, Zillow, and Brex to identify the root causes of build failures, tackle flaky tests, and accelerate code deployment without compromising on reliability. While AI has revolutionized code writing, the deployment process remains a bottleneck. Issues such as merge conflicts, unreliable tests, and inconsistent code quality hinder productivity and morale. Our goal is to help engineering teams maintain focus on design, implementation, and delivery, fostering a more satisfying work environment.Founded in 2021 by former engineers from Uber, Google, YouTube, and Microsoft, Trunk has successfully raised $25 million in Series A funding, led by Initialized Capital and a16z, with additional investments from notable industry figures including the founders of GitHub and Apollo GraphQL.Our CI pipelines often operate as black boxes, causing engineers to spend valuable hours troubleshooting failures that may stem from flaky tests or other infrastructural issues. Trunk aims to demystify this process by making failures visible and actionable.We are on the frontier of innovation, building a data layer that enables AI agents to analyze CI processes, diagnose failures, propose solutions, and ultimately facilitate autonomous code deployment.We are seeking a Tech Lead who will take ownership of the data platform driving our flaky test detection and CI analytics solutions. You will be responsible for designing and developing systems capable of processing millions of test runs every hour, unveiling actionable insights, and establishing the groundwork for AI-enhanced CI workflows.We are at a pivotal moment in our growth. The challenges of scalability are significant and increasing. The future of development tools is evolving with AI at its core, and we are laying the foundational data infrastructure to support this transformation. If you are passionate about solving complex systems problems that have a direct impact on customers, this role is for you.
Role overview The Program Manager, Continuous Improvement at Fortune Brands focuses on driving operational excellence across projects in San Francisco. This role centers on finding ways to make processes more efficient and supporting teams as they adopt better practices. The position plays a key part in shaping a culture that values ongoing progress and improvement. What you will do Spot opportunities to streamline or enhance existing processes Work closely with teams from different departments to implement improvements Promote and reinforce a mindset of continuous improvement within the organization Ensure that project efforts support Fortune Brands’ overall business objectives
Join Rad AI as the Head of ContinuityAt Rad AI, we are dedicated to revolutionizing healthcare through the power of artificial intelligence. Born from the expertise of a radiologist, our innovative AI solutions are transforming the landscape of radiology—streamlining processes, alleviating clinician burnout, and enhancing patient outcomes. Our extensive proprietary radiology report dataset, one of the largest globally, has facilitated the identification of countless new cancer cases and has nearly halved error rates in radiology reports.With over $140 million secured in funding, including a recent $68 million Series C round led by Transformation Capital, our valuation stands at $528 million. We are proud to have the backing of prestigious investors such as Khosla Ventures, World Innovation Lab, Gradient Ventures, and Cone Health Ventures, all committed to our mission of empowering healthcare professionals with state-of-the-art AI tools.Our generative AI advancements are utilized by thousands of radiologists every day, supporting nearly half of all medical imaging in the United States through partnerships with esteemed healthcare systems like Cone Health, Jefferson Einstein Health, Geisinger, Guthrie Healthcare System, and Henry Ford Health.Recognized by CB Insights and AuntMinnie as one of the most promising healthcare AI firms of 2023, and ranked 19th by Deloitte among the fastest-growing companies in North America, we are committed to crafting AI solutions that have a tangible impact. Recently, we were also featured on CNBC’s Disruptor 50 list, showcasing the innovation and drive behind our mission.If you are passionate about shaping the future of healthcare, we invite you to join our dynamic team!Our AI solutions integrate seamlessly into the radiologist’s workflow:Rad AI Impressions automatically generates the impression section of reports in real-time.Rad AI Reporting is a comprehensive radiology reporting platform enhanced by Generative AI.Rad AI Continuity is our innovative platform for patient follow-up coordination, ensuring actionable findings in radiology reports are addressed, thereby closing the loop between radiology and subsequent care.Be a part of our mission to enhance patient care through intelligent solutions!
At Continue, we're on the lookout for a seasoned software engineer to enhance our IDE extensions, optimize the hub.continue.dev backend, and develop various innovative services (like our codebase sync indexing engine, data ingestion pipelines, and VMs for background agents).Who You AreWe envision a candidate with a strong background that complements this role. If you may not meet every qualification but believe you can tackle the challenges ahead, we encourage you to reach out!You possess advanced proficiency in TypeScript, Go, or a similar language.You have a keen interest in AI engineering or machine learning.You bring experience in scaling systems for enhanced stability, maintainability, and developer experience.You are adept at simplifying complex problems and making trade-offs to expedite progress.Your ResponsibilitiesAs a startup, we value versatility; be prepared to wear multiple hats as we pursue our mission. Key responsibilities will include:Articulating and implementing robust architectural strategies that facilitate swift delivery of SaaS and on-premise products.Contributing significantly to the development of our backend services.Identifying and addressing core challenges to enhance product metrics such as autocomplete acceptance rates and codebase retrieval accuracy in our open-source initiatives.Creating thoughtful abstractions that form the backbone of our product and may serve as open-source standards.
Join Continue as a passionate and quick-learning Software Engineer, where your contributions will be crucial in enhancing our open-source product. You'll begin by addressing and exploring user challenges on platforms like GitHub and Discord. In this position, you'll have the opportunity to design, develop, and sustain our open-source IDE extensions, creating a product that is cherished by many, including yourself.
Full-time|$194K/yr - $267.3K/yr|Hybrid|San Francisco, California
Okta secures identity for both people and AI, providing trusted infrastructure that helps organizations adapt to change. The company addresses complex, real-world problems with practical solutions and values urgency, excellence, and teamwork. The Developer Foundations team is looking for a Staff Software Engineer based in San Francisco, California. This engineer will play a key role in scaling Okta’s systems and accelerating software delivery. The position centers on refining the company’s approach to Continuous Delivery and driving improvements in engineering velocity and productivity across multiple teams. Collaboration is central in this role. The Staff Software Engineer works closely with engineers, architects, operations, program management, and quality assurance. The team values fresh ideas and solutions that have a direct impact on internal developer experience. What you will do Create high-quality internal tools and automation that support continuous delivery and boost developer productivity. Design and implement Continuous Delivery pipelines for a range of projects, using technologies such as Java, Jenkins, AWS, Docker, Python, Node, iOS, Ruby, Bash, and Go. Develop proof of concepts, guide technology decisions, contribute to internal frameworks, and participate in design and code reviews. Roll out solutions to internal users in phases, monitor adoption, gather feedback, and refine approaches to fit team needs. Maintain pre-production infrastructure on AWS, focusing on monitoring, backup and restore, SLA management, cost control, and deployment processes.
Join Netic, the leading AI revenue engine powering essential services that form the backbone of the American economy.With $43 million in backing from Founders Fund, Greylock, Hanabi, and Dylan Field, who led our Series B, we have enabled our clients to secure hundreds of thousands of jobs across service sectors in North America. Companies are now operating entirely AI-first thanks to Netic.Become a part of our dynamic team composed of relentless builders from Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard, as we integrate frontier AI into the physical economy, tackling complex challenges with immediate and significant impact.As a Deployment Engineer, you will bring Netic’s AI agents to life for clients in essential service industries. You will operate at the crossroads of software, AI, and operations, managing customer deployments and resolving intricate technical and business challenges to drive measurable business results.
About Watney RoboticsWatney Robotics is pioneering the future of autonomous robotics to enhance critical infrastructure development. Following a successful seed funding round of $21 million led by Conviction, Abstract, and A*, we are collaborating with some of the world's leading hyperscalers to expedite data center construction and maintenance.This is your chance to join us at a pivotal stage of our journey, where you can influence the growth from prototype to full-scale production fleets. You will be instrumental in delivering operational systems, formulating deployment strategies, and making a significant impact in a transformative robotics company.Your RoleAs the engineering-focused operator, you will oversee the deployment processes for our clients, ensuring seamless systems integration, effective go-live execution, and consistent operational performance. If your background includes coding, log debugging, and managing field operations, this position allows you to leverage your diverse skill set. You will convert pilot programs into large-scale production, showcasing the comprehensive capabilities of Watney's robots and delivering essential tasks that exceed the competition.Your ResponsibilitiesManage end-to-end deployments: Conduct site surveys, security assessments, integration mapping, phased rollouts, and transition processes.Develop operational frameworks: Create runbooks, SOPs, and dashboards; oversee uptime, task accuracy, MTTR/MTBF, and quality; lead root cause analysis and resolution efforts.Feedback integration: Translate field insights into precise requirements for autonomy, control systems, and teleoperations; prioritize enhancements that impact key metrics.Act as the customer liaison: Collaborate with operators, IT teams, and executives regarding service level agreements and acceptance criteria; build trust through effective on-site and virtual communication.Standardize processes: Package deployment playbooks and reusable integrations to scale from initial sites to global implementations.Deliver lasting integrations: Connect robots with DCIM/CMMS (ServiceNow), ticketing systems, inventory management, IAM/SSO, network configurations, and observability tools; define and implement APIs, schemas, webhooks, and messaging queues (REST/gRPC, JSON, Kafka/MQTT).Your QualificationsProven operator (5–10 years): Experience leading critical deployment initiatives; evaluate success based on outcomes rather than activities.Executive presence: Capable of setting clear expectations, delivering succinct updates, and influencing stakeholders.Exceptional delivery: Skilled in providing high-touch service and ensuring customer satisfaction.
Full-time|On-site|San Francisco, California, United States
Role overview Checkr is hiring a Staff Software Engineer, Integrations to support and enhance the company’s integration services. This position centers on building and refining solutions that connect the Checkr platform with external systems. The goal is to increase both connectivity and operational efficiency. As a member of the engineering team, this person will influence how the platform works with partners and customers. What you will do Design and implement integration features that broaden the platform’s capabilities Work closely with other engineers to deliver solutions that are reliable and easy to maintain Troubleshoot and resolve technical issues related to integrations Spot opportunities for improvement and suggest actionable changes Requirements Strong problem-solving skills and attention to detail Experience developing or maintaining integration services Ability to collaborate effectively within an engineering team Location San Francisco, California, United States
Join Crusoe as a Staff Network Deployment Engineer in our Lab division. In this pivotal role, you will spearhead the deployment and optimization of our cutting-edge networking technologies. Your expertise will be crucial in ensuring that our systems are robust, efficient, and ready to meet the demands of our innovative projects.
Full-time|$170K/yr - $190K/yr|Hybrid|San Francisco
Ironclad is at the forefront of revolutionizing contract management through its advanced AI contracting platform, which transforms agreements into strategic assets. Our platform accelerates the contracting process, delivers immediate insights, and empowers teams to drive progress, all while keeping you in control. Whether facilitating purchases or sales, Ironclad streamlines the entire process on a single intelligent platform, equipping leaders with the visibility they need to stay ahead of the curve. This is why leading organizations, from Rivian to the World Health Organization and the Associated Press, trust Ironclad to enhance their operational efficiency.Recognized as an industry leader, Ironclad has earned accolades including being a Leader in the Forrester Wave and Gartner Magic Quadrant for Contract Lifecycle Management, a Fortune Great Place to Work, and Fast Company’s Most Innovative Workplaces. Additionally, we have been featured in Forbes’ AI 50 and Business Insider’s list of Companies to Bet Your Career On. Our growth is fueled by prestigious investors such as Accel, Y Combinator, Sequoia, BOND, and Franklin Templeton. For more details, visit www.ironcladapp.com or connect with us on LinkedIn.This is a hybrid position requiring in-office attendance at least twice a week on Tuesdays and Thursdays for collaboration and team engagement. Additional in-office days may be scheduled for team or company events.About the RoleAs a Staff Systems Engineer, Automations & Integrations, you will take on a pivotal technical leadership role, crafting and implementing enterprise-wide automations, AI-driven workflows, and managed data integrations across Ironclad’s technology ecosystem. You will collaborate with various teams (GTM, EPD, G&A, etc.) to design resilient, observable automation frameworks and integrations that significantly enhance both employee and customer experiences, as well as accelerate business operations. Your deep technical knowledge in tools such as Workato, Glean, and our core SaaS/identity stack will be crucial in establishing reusable frameworks, comprehensive error handling, and reliable data flows that scale efficiently.This position is primarily individual contributor-oriented, with a broad impact scope: you will closely engage with stakeholders and engineering colleagues, mentor fellow automation engineers, and steer the technical direction for high-impact projects in automations, integrations, and AI.
Full-time|$193K/yr - $234K/yr|On-site|San Francisco, CA - US
At Crusoe, we're on a mission to revolutionize the way energy and intelligence coexist. Our vision is to develop a robust infrastructure that empowers individuals to innovate ambitiously with AI, all while embracing principles of sustainability and efficiency.Join us at the forefront of the AI revolution, where you'll leverage sustainable technology to drive groundbreaking advancements, make a significant impact, and collaborate with a team that's pioneering responsible cloud infrastructure.About the RoleThe Crusoe Cloud Network Deployment Engineering team seeks a dynamic and experienced professional to enhance our Network Engineering efforts. This team is integral to designing, constructing, and managing the global edge, backbone, and data center networks for High-Performance Compute (HPC) Clusters utilizing GPUs. The ideal candidate will be self-motivated, technically adept, and passionate about working with cutting-edge environmental technologies. Exceptional analytical and communication skills, along with a collaborative spirit, are essential.As a Network Engineer, you will play a pivotal role in expanding the Global Crusoe Network, focusing on deploying new data centers, Points of Presence (PoPs), and backbone infrastructure. This position offers a unique opportunity to gain valuable experience in large-scale network engineering involving edge, backbone, and HPC-based data center networking.This position is on-site in San Francisco, CA, or Sunnyvale, CA, and requires in-office presence.Key Responsibilities:Deploy, construct, and optimize the global Crusoe Energy Cloud network, including edge, backbone, data center, and public cloud connectivity.Collaborate with cross-functional teams, including Software Infrastructure and Product, to foster innovation and advancement within the Crusoe Energy Cloud network.Engage with external vendors and ISPs to evaluate and confirm device and carrier selection.Participate in a 24/7 On-call Support for the Crusoe Network.What You Bring:A minimum of 10 years of experience in building and operating network solutions at scale within a production environment.Deep understanding of network protocols such as TCP/IP, QoS, BGP, OSPF/IS-IS, EVPN, VXLAN, and MPLS technologies.
About UsAt Twelve Labs, we are at the forefront of developing innovative multimodal foundation models capable of understanding videos in a human-like manner. Our state-of-the-art models are setting new benchmarks in video-language modeling, allowing for more intuitive interactions and transformative media analysis.With an impressive $107 million in Seed and Series A funding, we have garnered support from leading venture capital firms like NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, as well as esteemed AI pioneers such as Fei-Fei Li, Silvio Savarese, and Alexandr Wang. Our headquarters are in San Francisco, complemented by a strong presence in APAC, particularly in Seoul, highlighting our commitment to global innovation.We embrace the individuality of each team member’s journey. The diversity of our cultural, educational, and experiential backgrounds fuels our drive to challenge the norm. We seek passionate individuals eager to contribute to our mission and make a significant impact as we redefine the landscape of technology. Join us in revolutionizing video comprehension and multimodal AI.Role OverviewAs a Senior Staff Software Engineer, you will take charge of the infrastructure and integration layer that ensures Twelve Labs models are seamlessly accessible on partner platforms. Your responsibilities will encompass everything beyond the models, including packaging, validating, deploying model containers, designing API surfaces for various platforms, routing requests, and maintaining production reliability across diverse cloud environments.You will collaborate closely with our Science, Product, and ML Engineering teams to align model and product roadmaps for successful platform integrations. Your expertise in external model orchestration will be pivotal; you will need to comprehend model component functionalities to make informed integration decisions, although model optimization will not be part of your role. Your contributions will significantly enhance our capacity to reliably deliver new model versions and features to users across all platforms.Willingness to travel up to 10% annually for conferences, off-site meetings, and other business-related events is expected. Participation in on-site interviews and/or in-person onboarding processes may also be required.
Gimlet Labs is pioneering the creation of the first heterogeneous neocloud specifically designed for AI workloads. As the demand for AI systems continues to grow, the industry is encountering significant challenges related to power, capacity, and cost with current homogeneous, vertically integrated infrastructures. Gimlet tackles these limitations by decoupling AI workloads from their physical hardware, intelligently partitioning tasks into components and orchestrating them to the most suitable hardware, optimizing for performance and efficiency. This innovative approach facilitates the use of heterogeneous systems across multiple vendors and generations of hardware, including the latest cutting-edge accelerators, enabling substantial improvements in performance and cost efficiency at scale.Furthermore, Gimlet is developing a robust production-grade neocloud tailored for agentic workloads. Our customers benefit from deploying and managing their workloads seamlessly through stable, production-ready APIs, alleviating the need to focus on hardware selection, placement, or intricate performance optimizations.Collaborating with foundation labs, hyperscalers, and AI-native organizations, Gimlet powers real-world production workloads capable of scaling to gigawatt-class AI datacenters.Gimlet Labs is on the lookout for a Technical Staff Intern to assist in the development of our platform dedicated to deploying and monitoring AI workloads. In this role, you will leverage the latest AI methodologies to create frameworks that enhance and optimize AI workloads. You will play a vital role in advancing Gimlet’s unique compilation framework, facilitating the partitioning and orchestration of AI workloads across varied hardware environments. Your designs will lead to scalable systems capable of handling production workloads of millions of requests per second.
Join Our TeamAt Parallel, we are pioneers in web infrastructure, empowering leading industries—including sales, marketing, insurance, and coding—to create state-of-the-art AI agents with flexible, programmatic access to the web. We have successfully secured $130 million in funding from top-tier investors such as Kleiner Perkins, Index Ventures, and Khosla Ventures, allowing us to expand our mission.As a Member of Technical Staff specializing in Developer Integrations, you will play a crucial role in designing and building robust API integrations within the fast-paced AI landscape. Your expertise will facilitate seamless connections between our platform and various third-party AI tools, including developing custom nodes, plugins, or connectors that enhance functionality and enable novel workflows.