Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Entry Level
About the job
About the Role
cognition is looking for an Infrastructure Software Engineer in the San Francisco Bay Area. This role focuses on designing, building, and maintaining infrastructure that supports high-performance systems. Collaboration with teams across the company is central to the work.
What You Will Do
Develop and maintain scalable infrastructure solutions
Work closely with colleagues from different disciplines to support application needs
Help ensure systems remain reliable and efficient as they grow
About the Role cognition is looking for an Infrastructure Software Engineer in the San Francisco Bay Area. This role focuses on designing, building, and maintaining infrastructure that supports high-performance systems. Collaboration with teams across the company is central to the work. What You Will Do Develop and maintain scalable infrastructure solutions …
Full-time|$180K/yr - $247.5K/yr|Remote|San Francisco or Remote
Join the Revolution at CheckAt Check, we are transforming the payroll landscape. Our mission goes beyond just building a successful business; we collaborate with our partners to innovate payroll solutions. As pioneers of embedded payroll, we are reshaping the payment process, enabling payroll businesses to launch, expand, and succeed with ease. Discover our journey | Listen in.Check is more than an API; we are the catalyst for developing and scaling payroll operations.Our TeamThe payroll system is in dire need of innovation. We invite you to join a passionate team dedicated to making an impactful change! At Check, you will leverage creative problem-solving and critical thinking to influence every business we partner with. We view challenges as opportunities for improvement, valuing the unique contributions of each team member in our collective mission.If you're ready to dive in and transform payroll, let's collaborate to simplify complexity and enhance the future for businesses of all sizes.Your RoleAt Check, engineering is our foundation. We believe that payroll should resemble modern financial software; achieving this requires a comprehensive understanding of systems and reliable infrastructure that our partners can trust. Every product we deliver relies on scalable and secure systems that ensure timely payments and payroll processing.We are seeking a Staff Software Engineer who possesses strong software design capabilities coupled with hands-on infrastructure experience. In this position, you will focus on the essential systems that drive payroll operations, enhancing our service scalability, production operations, and empowering engineers with the tools to deliver software confidently and securely.You will collaborate across product and platform areas to enhance our cloud infrastructure, fortify our deployment and monitoring strategies, and streamline the architecture that supports embedded payroll services. The challenges you will address often intersect infrastructure, product, and operational domains.This opportunity is perfect for someone who has managed complex systems end-to-end in a dynamic environment and takes pride in developing resilient, comprehensible infrastructure that is vital to our operations.
Full-time|$400K/yr - $450K/yr|On-site|San Francisco Bay Area
Join Discord, a platform that connects over 200 million users every month primarily through gaming. With over 90% of our users engaged in gaming activities, we facilitate over 1.5 billion hours of gaming conversations, enhancing the experience before, during, and after gameplay.The Infrastructure organization at Discord is fundamental to our user experience. We handle the real-time delivery of over 40 million events per second and manage the storage of trillions of messages, ensuring robust connections among our vast user base. As a Principal Engineer, you will play a pivotal role in guiding our infrastructure teams, shaping our technical vision, and maintaining the reliability of Discord at a massive scale.This position is ideal for a professional who excels at the intersection of advanced technical skills and organizational leadership. You will contribute to our infrastructure roadmap, address our most challenging technical dilemmas, and ensure our systems can efficiently scale to accommodate the next wave of users.
Full-time|$190K/yr - $280K/yr|Hybrid|San Francisco, California
About SentrySentry is dedicated to eliminating poor software experiences. Our mission is to empower developers to create high-quality software swiftly, allowing everyone to enjoy technology to its fullest.With over $217 million raised in funding and a community of over 100,000 organizations, including giants like Disney, Microsoft, and Atlassian, we are developing state-of-the-art performance and error monitoring tools. Our solutions help our partners minimize time spent on bug fixes and maximize product development.In our commitment to collaboration, Sentry follows a hybrid work model across our global offices. We have designated Mondays, Tuesdays, and Thursdays as in-office days to foster effective teamwork. If you are passionate about building tools that enhance the digital experience, join us in creating the next generation of software monitoring solutions.About the RoleAt Sentry.io, we offer vital services for diagnosing application health issues. Our tools are crucial for organizations aiming to respond adeptly in dynamic markets. We ensure a seamless and enjoyable experience in the development and deployment of these tools through a robust continuous integration environment and an insightful deployment pipeline.As part of the Infrastructure Engineering team, your contributions will be instrumental in supporting Sentry's growth and enabling engineering teams to operate with agility and confidence.Your responsibilities will include designing, developing, and maintaining internal software and platform capabilities that alleviate the cognitive load associated with infrastructure and developer tooling. You will create dependable, reusable abstractions that facilitate rapid shipping of features while incorporating durability, security, and operational excellence into service development and management.This role demands strong engineering judgment: selecting reliable technologies, planning for scalability from the outset, and crafting solutions that serve multiple teams. Your focus will be on practical systems that enhance reliability and ownership across the organization, driving adoption through comprehensive documentation, well-designed APIs, and seamless developer experiences that integrate into daily workflows.Ultimately, you will empower engineering teams to flourish within a culture of ownership—enabling them to deploy, manage, and evolve services confidently while minimizing operational burdens.Key ResponsibilitiesDesign systems that scale with company growth, ensuring a balance of reliability, performance, and cost-efficiency.Develop platform services that enhance internal operations and developer productivity.
Join our innovative team at Astranis as a Senior Software Engineer specializing in Infrastructure. In this role, you will be responsible for designing, implementing, and maintaining robust infrastructure solutions that support our cutting-edge satellite technology. Your expertise will play a crucial role in enhancing the reliability and scalability of our systems.
About the TeamAt ChatGPT, we are at the forefront of innovation, continuously enhancing our system with new capabilities and adapting to ever-evolving user needs. To sustain our rapid pace of development, we require a robust infrastructure capable of managing real-world production challenges, such as high concurrency and unpredictable traffic patterns.The mission of the ChatGPT Infrastructure team is to design and maintain the foundational platforms that facilitate swift iterations without compromising on performance or reliability. We create the shared systems, data pathways, rollout procedures, and reliability measures that enable teams to deploy changes to ChatGPT efficiently and at scale.Our focus is on high-impact infrastructure: we develop fundamental systems and streamlined processes that leverage hard-earned operational insights, ensuring that engineers do not have to repeatedly navigate similar challenges and pitfalls as they innovate.About the RoleWe are seeking experienced Senior and Staff Software Engineers to architect and construct the underlying infrastructure that supports ChatGPT, amplifying the productivity of teams working on user experience.This role transcends mere maintenance; it is about building platforms: you will define interfaces, develop essential abstractions, and create tools that promote safe and rapid iterations. Your contributions will lead to reduced friction, fewer regressions, enhanced performance, and systems that scale seamlessly as our product grows.Where You Can Make a DifferenceAs part of our team, you may engage with one or more of the following areas:Platform Foundations & Frameworks: Craft core libraries, service frameworks, and shared components that standardize system development and integration.Scalability & Performance Primitives: Develop patterns and infrastructure aimed at minimizing latency, boosting throughput, and maintaining cost efficiency as demand increases.Reliability Guardrails: Implement design mechanisms to prevent outages, including rate limiting, load shedding, and safe fallbacks.Developer Productivity via Golden Paths: Establish streamlined workflows that make common processes fast, safe, and user-friendly.Observability & Debugging Systems: Create instrumentation and metrics models to enhance debugging capabilities.
About UsAt Koah Labs, we are pioneering an innovative ad network designed to empower the next wave of AI-driven products. Our goal is to enable publishers to maximize their revenue while allowing advertisers to effectively connect with their target audiences, all while prioritizing speed, user experience, and privacy.We are a dynamic and closely-knit team based in San Francisco, composed of professionals with experience from leading companies such as X, Apple, and Meta, as well as early-stage startups. With backing from top-tier investors, we are rapidly expanding and gaining significant traction on both the publisher and advertiser fronts.Joining Koah means becoming part of something transformative from the start: you will be actively involved in deploying code that influences the company and the broader ecosystem. We value agility, trust, and a strong commitment to our craft.Technological FrameworkInfrastructure: Terraform, AWS, LGTM (Loki, Grafana, Tempo, Mimir), Tailscale, CloudflareData Management: PostgreSQL, ClickHouse, Redis, Kafka, PythonCore Applications: Ruby on Rails, React, TypeScriptSDKs: Flutter, React Native, Android, iOSPosition OverviewWe are on the lookout for talented engineers with a passion for designing, developing, and maintaining the infrastructure that supports our adtech platform.Ideal Candidates Will Have:Experience managing or operating robust systems in production at scale.A meticulous approach to detail with a strong emphasis on performance.A desire to deeply engage with challenges and take ownership of the foundational systems that drive our products.An enthusiasm for experimentation, measurement, and crafting dependable systems.A willingness to assume responsibility across a diverse array of systems.
Join Netic, the cutting-edge AI revenue engine powering essential services that form the backbone of the American economy.Backed by $43 million in funding from top investors like Founders Fund, Greylock, Hanabi, and Dylan Field, we have empowered our clients to secure hundreds of thousands of jobs across various service industries in North America. As a pioneer in AI-driven solutions, we are witnessing the emergence of companies operating entirely on our AI-first platform.As an Agent Infrastructure Engineer, you will be at the forefront of architecting and scaling the core framework that underpins our autonomous AI agents, addressing complex real-world challenges with immediate and significant impacts. Collaborate with a passionate team of innovators from renowned companies such as Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard, as we bring frontier AI to the physical economy where the stakes are high, and the data is intricate.If you thrive in dynamic, fast-paced environments and are eager to set new benchmarks in the agentic space, seize this opportunity to make your mark!
About Emergent Labs Inc. Emergent Labs builds autonomous coding agents that generate, test, and deploy production-ready applications from natural language instructions. Our systems run globally, supporting millions of users as they create real software with minimal friction. Since our public debut, Emergent has reached $100M ARR in just 8 months and now serves over 6 million users in more than 190 countries. Users have built over 6.5 million applications on our platform. We have secured backing from investors such as Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator, raising more than $100M to fuel our growth. The team includes repeat founders, Olympiad medalists, IIT & IIM alumni, and engineers with backgrounds at Google, Amazon, and Dropbox. We focus on the toughest problems in AI-driven software creation: correctness, reliability, security, and scaling in live production systems. Role Overview: Infrastructure Software Engineer This role is based in San Francisco. Emergent Labs is looking for engineers who want to help shape the future of software development at a global scale. Expect ownership, autonomy, and the chance to work quickly on impactful systems.
About MiddeskMiddesk simplifies collaboration for businesses by transforming identity verification processes. Since our inception in 2018, we have replaced outdated, manual methods with an efficient platform that provides seamless access to comprehensive and current data. Our services enable companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer lifecycle.As a proud Y Combinator graduate, Middesk is supported by notable investors such as Sequoia Capital and Accel Partners. We have recently been featured on Forbes’ Fintech 50 List and recognized as a leader in business verification by the digital identity strategy firm, Liminal.About the Middesk Engineering Team:At Middesk, we prioritize delivering value to our customers through a concept we call 'Velocity'. This term embodies our commitment to achieving meaningful outcomes rather than merely focusing on code delivery speed. We believe that exceptional products arise from a blend of technical expertise and a profound understanding of our customers’ needs. Our engineering team is composed of humble, self-driven individuals who are dedicated to addressing even the most complex challenges faced by our clients. At Middesk Engineering, our mission is to put customers first.Your Role:We are seeking a talented Infrastructure Engineer to join our DevSecOps team. Your mission will be to empower engineering teams by providing secure, cost-effective, and scalable platform capabilities that enhance software delivery, improve developer experience, and ensure compliance with industry standards. You will be responsible for developing the tools and infrastructure necessary to scale our development and production systems. Your contributions will directly impact the entire Software Development Lifecycle and overall developer experience (DevEx). The systems you will support include Kubernetes, cloud infrastructure, observability, and local development environments.Our work environment is hybrid, requiring a presence in our San Francisco or New York City offices for 2 days each week. Candidates must reside within a reasonable commuting distance as we value in-person collaboration while also supporting flexible work arrangements.Key Responsibilities:Architect, build, and scale cloud infrastructure and orchestration systems (e.g., Kubernetes, Terraform, CI/CD).Take ownership of and enhance developer experience (DevEx) tools and workflows, spanning from local development to deployment.Develop observability systems that offer insights into performance, reliability, and usage metrics.
Join Our Innovative TeamAt OpenAI, security is the cornerstone of our commitment to ensuring artificial general intelligence serves all of humanity. The Identity Infrastructure Engineering team is pivotal in this mission, crafting robust identity and access management solutions that safeguard our model weights, customer data, and essential systems across diverse cloud environments. Collaborating closely with teams across Applied Engineering, Research, IT, and Security, we deliver a secure and scalable platform that empowers permissioning, orchestration, and groundbreaking AI research.Your RoleAs a Software Engineer on our Identity Infrastructure Engineering team, you will play a crucial role in designing, deploying, and managing foundational security tools and infrastructure. This position involves leveraging a wide array of technologies to support multi-cloud deployments, ensuring our researchers and engineers can securely build, test, and scale transformative AI systems. We seek individuals who are technically adept, collaborative, and passionate about integrating secure-by-default principles throughout our technology stack.We invite Software Engineers eager to address challenges in:Identity & Access Management: Develop and sustain systems and interfaces that efficiently manage user and service identities, guaranteeing consistent, fine-grained access controls across various cloud providers and internal services.Multi-Cloud Security: Architect and implement tools that protect model weights, proprietary data, and sensitive assets, seamlessly operating within AWS, Azure, GCP, and future cloud environments.Automation & Tooling: Create robust frameworks, APIs, and CLI tools that automate ongoing security tasks (such as credential provisioning and rotation), allowing teams to focus on AI innovation without compromising security.In this position, you will:Develop new features for our IAM platform that integrate seamlessly with evolving cloud services, enabling teams to operate efficiently while adhering to security best practices.Lead security innovations by designing tools, processes, and frameworks that enhance our infrastructure.This role is primarily based in San Francisco, Seattle, or New York City, with remote work options considered. We embrace a hybrid work model, requiring three days in the office weekly, and provide relocation assistance for new hires.
Senior Software Engineer, Infrastructure & PlatformRole OverviewIn the role of Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will take on the exciting challenge of designing and constructing the essential infrastructure that drives our innovative data generation, evaluation, and agentic systems.Your responsibilities will include developing shared platforms that empower our engineering and research teams to execute large-scale human-in-the-loop workflows, evaluation harnesses, and automated data pipelines essential for training cutting-edge AI models.This position demands a high level of technical expertise and offers extensive ownership. You will be responsible for architecting and building the foundational infrastructure relied upon by numerous engineers, ensuring that systems are scalable, reliable, and capable of handling high-throughput workloads.Collaboration with the founding team will be key as you define system architecture, establish best engineering practices, and create the infrastructure that supports the evolution of AI development.
Position OverviewJoin OpenEvidence as a Data Infrastructure Software Engineer, where you will engineer comprehensive systems that drive essential product and research operations. Your focus will be on optimizing performance, ensuring scalability, and enhancing accuracy, while enjoying the autonomy to manage the infrastructure that assists healthcare professionals in navigating complex clinical decisions in real-time.We value exceptional creators who thrive in versatile roles. Our engineers engage across various products and projects, taking ownership wherever they can make the most significant impact.About OpenEvidenceOpenEvidence is the leading medical AI platform globally, utilized by over 40% of clinicians in the U.S. in just over a year through organic product-led growth. As a $12 billion company, our engineering team comprises 30 talented individuals from MIT, Harvard, and Stanford. We believe that groundbreaking products are born from a small group of exceptional builders, driven by focused goals and empowered to take ownership and act swiftly. We are expanding our team to capitalize on an unparalleled opportunity to set the standard for medical AI platforms.If you are a top-tier engineer or scientist eager to push the boundaries and achieve tangible outcomes that affect millions of lives, we want to connect with you.Our CultureWe expect our work to be performed at an elite level. The journey from concept to execution and scaling is akin to a professional sport, where excellence is non-negotiable. We believe that the creation of innovative technologies is only achievable through complete ownership. Significant achievements happen when individuals take the initiative to see them through.Your ProfileThis role is not for those seeking a 9-to-5 job or merely looking to write papers. If you are ready to dive into the trenches, tackle challenges head-on, and create something from scratch that could impact millions and drive substantial revenue, you might be the perfect fit.We seek brilliant builders who are intelligent, ambitious, resourceful, self-reliant, detail-oriented, driven, hardworking, and humble. Does this sound rare? It is, as we have only found 30 of them so far, and we are eager to discover more.
Full-time|$200K/yr - $200K/yr|On-site|San Francisco
Join Convex in revolutionizing application development!At Convex, we are on a mission to redefine how software is constructed on the Internet. Our innovative platform enables developers to create swift, dependable, and dynamic applications without the need for a backend team. We offer a comprehensive full-stack application platform, meticulously designed with abstractions for databases, computing, and backend services, allowing both developers and LLMs to innovate rapidly, ensuring products that are scalable and maintain simplicity throughout their lifecycle.About Our Team:Our Convex team comprises engineers who have architected and built some of the largest backends globally, managing exabytes of data and millions of transactions per second. We are a friendly, collaborative group of passionate individuals who thrive on in-person collaboration in our San Francisco office.Position Overview:As Convex evolves, we are seeking outstanding senior or staff-level engineers to help us architect and sustain the future of our infrastructure at scale. If you have a passion for distributed systems and a robust background in designing and managing web infrastructure, we want to connect with you!We value robust architecture, effective collaboration, and simplicity. Our team embraces high ownership and places significant emphasis on operational excellence. This role is not solely focused on operations; we seek individuals who are dedicated to designing and constructing systems in the most effective manner possible, especially in a startup environment.Your Responsibilities:Architect, construct, and oversee Convex’s global cloud infrastructure.Analyze and enhance the performance and reliability of our systems.Independently prioritize projects, collaborating closely with the engineering team and CTO.Establish best practices and reliability standards as we expand our team and systems.Develop sophisticated systems and database code.Engage with feedback from leadership regarding seeking simpler and more elegant solutions.What We Value:A strong enthusiasm for distributed systems and backend infrastructure.A collaborative spirit and a desire to grow with the team.A commitment to best practices and maintaining high standards in engineering.
Full-time|$160K/yr - $225K/yr|Hybrid|San Francisco, CA (Hybrid)
About Fable SecurityIn today’s digital landscape, AI-driven threats and human errors represent the most significant risks to enterprise security. Cybercriminals exploit human behavior, contributing to 70% of security breaches. At Fable, we empower individuals to transform from potential targets to active defenders with innovative tools.Fable is at the forefront of human risk management, offering a platform that effectively influences employee behavior. Our user-friendly, scalable solution analyzes complex employee data, identifies high-risk behaviors, and delivers timely interventions directly to users in their work environment.Supported by notable investors like Redpoint Ventures and Greylock Partners, and founded by former members of the Abnormal Security team, Fable is tackling one of cybersecurity's greatest challenges in a rapidly expanding market. Our team comprises alumni from esteemed organizations such as Meta, Twitter, and Flexport, as well as top universities including Waterloo, Columbia, and Stanford. This is an exceptional opportunity for you to join us at a time of rapid growth and help shape the future of security.Why Join UsBuild and scale the foundational data infrastructure that drives a groundbreaking product.Collaborate closely with engineering, data science, and product teams to operationalize data at scale.Become part of a small, high-caliber team where your contributions will have a significant impact.As part of an early-stage company, every engineer plays a crucial role in shaping the evolution of our products and the company's approach to data management.Your RoleAs a Platform and Infrastructure Engineer, you will be instrumental in developing and scaling the core systems that underpin Fable’s product and data operations.Your responsibilities will span backend systems including real-time services and data pipelines. You will ensure reliability, scalability, and optimal performance across all layers. This highly collaborative role involves working closely with data and ML teams, contributing to systems that effectively manage data ingestion, processing, and delivery.This role demands cross-functional collaboration with engineering, data, and product teams to create robust, production-grade systems that grow alongside the company.ResponsibilitiesDesign, develop, and maintain scalable backend and infrastructure systems.Collaborate with cross-functional teams to deliver high-quality software solutions.Ensure system reliability, performance, and security through rigorous testing and monitoring.
Compensation: Competitive base salary + substantial equityBenefits: Health & dental insurance, gym reimbursement, daily team lunches, 401(K)About JuliusAt Julius, we're pioneering advancements in applied AI by developing cutting-edge coding agents. Our platform executes approximately 1 million lines of code every 36 hours, serving over 1 million users and generating 3 million+ visualizations. We manage all code in isolated remote containers. As a revenue-generating entity, we are backed by AI Grant and founders with remarkable backgrounds from companies like Vercel, Notion, Perplexity, Palantir, Replit, Zapier, Intercom, and Dropbox.The RoleJoin us in building and scaling the robust code-execution platform that powers Julius, across both cloud and on-prem environments. We orchestrate over 500,000 containers/month and the demand is growing rapidly. You will take ownership of reliability, performance, and security within our multi-tenant compute environment.Your ResponsibilitiesDesign and manage a secure, multi-tenant container infrastructure that ensures quick startup and intelligent autoscaling.Implement on-prem/private cloud deployments using Helm and Terraform, integrating SSO, network controls, and audit logging.Enhance observability (metrics, traces, logs) with well-defined SLOs and lead incident response initiatives.Optimize images, scheduling, networking, and costs, while developing fair-use and rate-limiting controls.Your QualificationsStrong experience with production Kubernetes and container internals (Docker/containerd); solid understanding of networking principles.Familiarity with cloud environments (AWS/GCP/Azure) and Infrastructure as Code (Terraform/Helm).Proficiency in monitoring and logging tools (Prometheus, Grafana, OpenTelemetry, ELK/Vector).Understanding of security best practices for containerized, multi-tenant systems.Preferred QualificationsExperience with gVisor, Kata, Firecracker; Cilium/eBPF; GPU scheduling; serverless autoscaling (KEDA/Knative/Karpenter).Proven experience delivering on-prem or air-gapped enterprise software solutions.A passion for AI, with experience building side projects involving LLMs.Why Join Julius?Be part of a small, senior team where your contributions will have a massive impact. Tackle challenging infrastructure problems at a meaningful scale.
About the TeamJoin the innovative Frontier Systems team at OpenAI, where we design, implement, and maintain the world's largest supercomputers, essential for advancing our most groundbreaking model training initiatives.We transform data center blueprints into operational systems while crafting the software necessary for executing large-scale frontier model trainings.Our mission is to establish, stabilize, and ensure the reliability and efficiency of these hyperscale supercomputers throughout the training of our frontier models.About the RoleWe are seeking passionate engineers to manage the next generation of compute clusters that underpin OpenAI’s frontier research.This position merges distributed systems engineering with practical infrastructure work across our expansive data centers. You will scale Kubernetes clusters to unprecedented levels, automate bare-metal setups, and create the software layer that simplifies the complexity of numerous nodes across various data centers.Your work will be at the crossroads of hardware and software, where speed and reliability are paramount. Be prepared to oversee dynamic operations, swiftly identify and resolve pressing issues, and constantly elevate the standards for automation and uptime.In this role, you will:Provision and scale extensive Kubernetes clusters, including automation for deployment, bootstrapping, and lifecycle managementCreate software abstractions that integrate multiple clusters and provide a cohesive interface for training workloadsOversee node deployment from bare metal to firmware upgrades, ensuring rapid, repeatable setups at scaleEnhance operational metrics by reducing cluster restart times (e.g., from hours to minutes) and expediting firmware and OS upgrade cyclesIntegrate networking and hardware health systems to ensure end-to-end reliability across servers, switches, and data center infrastructureDevelop monitoring and observability systems to identify issues early and maintain cluster stability under high loadsYou might thrive in this role if you:Have extensive experience operating or scaling Kubernetes clusters or similar container orchestration systems in high-growth or hyperscale environmentsPossess strong programming skills in languages relevant to cloud and infrastructure management
Join the Crew of Ivo!At Ivo, we are more than just engineers; we are the pioneers of the digital seas! Our crew has set sail with groundbreaking innovations that have reshaped the landscape of legal tech:• An AI agent that seamlessly integrates with MS Word to enhance your documents [2023]• Transitioning from traditional embedding models to agentic RAG for superior performance [2023]• Advancing large-scale LLM-driven legal fact extraction [2024]• A legal assistant capable of accurately searching vast contract databases [2024]• Clustering legal documents from the same lineage [2025]• Implementing automatic deviation analysis to uncover hidden risks in extensive contract databases [2025]• Merging contracts with amendments to create comprehensive “composite” contracts (one of our clients shed tears of joy upon seeing this) [2025]The Role of an Infrastructure EngineerAs an Infrastructure Engineer, you will be the architect of Ivo's platform, ensuring its robustness and scalability.Your mission includes:• Taking ownership of our environment's future, with ample room for creative system design.• Managing numerous customer deployments—every client deserves a unique setup, from containers to databases.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics, logs, and health checks into user-friendly dashboards and alerts.• Leading the charge during infrastructure incidents.• Accelerating our CI/CD system (currently a sluggish ~12 minutes—let's speed that up!).If you share our passion for LLMs and thrive in a dynamic environment, we want you to help us push the boundaries of DevOps:• Innovating real-time LLM evaluations to ensure the accuracy of our outputs.• Building upon our existing infrastructure to enhance performance and reliability.Set sail with us at Ivo, where your technical skills will help chart the course for the future of legal technology!
Full-time|$148K/yr - $200K/yr|On-site|San Francisco, CA
Biotechnology is transforming our world, influencing everything from the medicines we consume to the crops we cultivate and the materials we utilize daily. To keep pace with the rapid advancements in science, we require cutting-edge technology.At Benchling, our mission is to harness the potential of biotechnology. The most pioneering biotech firms rely on Benchling’s R&D Cloud to facilitate the creation of innovative products and accelerate their journey to milestones and market readiness.Join us in bringing state-of-the-art software solutions to the forefront of modern science.ROLE OVERVIEWWe are seeking a talented Backend Software Engineer to join our Infrastructure Engineering team, where you will build and maintain the foundational platform that powers our product offerings. This role spans various infrastructure disciplines, including cloud infrastructure on AWS, services based on Kubernetes, and the operational tools necessary to ensure system reliability. Collaboration is key as you will work closely with product engineering teams to better understand their requirements and enhance the developer experience throughout Benchling.We are looking for a motivated early-career engineer with strong foundational skills and a desire to grow. The ideal candidate will be enthusiastic about learning, eager to contribute across diverse areas, and ready to take on increasing responsibilities. Given our operation in a regulated environment, your work will focus on building reliable, secure, and auditable systems.RESPONSIBILITIESDevelop, sustain, and enhance core infrastructure and platform services utilized by our product engineering teams.Collaborate with product teams to establish requirements, design efficient pathways, and minimize deployment and operational challenges.Contribute to our Kubernetes-based platform, including service configuration, traffic management, and platform tooling.Design and manage AWS infrastructure and automation, emphasizing scalability, cost efficiency, and resilience.Enhance observability and operational readiness through metrics, logging, tracing, dashboards, and alert systems.Engage in an on-call rotation, manage incident responses, and drive process improvements to avert future occurrences.Produce clear technical documentation and engage in design discussions and reviews; focus on incremental system improvements for maintainability and reliability.
About the TeamJoin OpenAI's Privacy Engineering team, where we operate at the vital crossroads of Security, Privacy, Legal, and Core Infrastructure. Our mission is to develop cutting-edge data infrastructure and systems that empower our privacy, legal, and security teams to operate securely, swiftly, and at scale. We adhere to principles of defensibility by default, enabling impactful research, and fostering a robust security culture in preparation for transformative technologies.About the RoleWe are seeking a talented Software Engineer to design and implement technical systems that facilitate legal compliance workflows, including secure data processing and document review. In this role, you will collaborate closely with Legal, Security, IT, and engineering teams to translate legal processes into actionable technical workflows. This position is perfect for an engineer passionate about large-scale data challenges and who understands the meticulousness required in ensuring compliance.Located in the vibrant city of San Francisco, we offer relocation assistance for qualified candidates.Key Responsibilities:Design and maintain scalable data storage pipelines.Develop search and discovery services (e.g., Spark/Databricks, index layers, metadata catalogs) tailored to partner team requirements.Automate secure data transfers, including encryption, checksumming, and auditing exports to reviewers.Establish secure compute environments that balance usability with stringent security controls.Implement monitoring and KPIs to ensure accountability of data holds and productions.Work cross-functionally to document SOPs, threat models, and chain-of-custody documentation that can withstand scrutiny.Ideal Candidates Will:Possess practical experience in building or operating large-scale data-lake or backup systems (Azure, AWS, GCP).Be proficient with Terraform or Pulumi, CI/CD processes, and capable of converting ad-hoc legal requests into repeatable pipelines.Be comfortable working with discovery workflows (legal holds, enterprise document collections, secure review) or eager to quickly gain expertise.Effectively communicate technical concepts—from storage governance to block-ID APIs—to interdisciplinary teams such as Legal and Engineering.
About the Role cognition is looking for an Infrastructure Software Engineer in the San Francisco Bay Area. This role focuses on designing, building, and maintaining infrastructure that supports high-performance systems. Collaboration with teams across the company is central to the work. What You Will Do Develop and maintain scalable infrastructure solutions …
Full-time|$180K/yr - $247.5K/yr|Remote|San Francisco or Remote
Join the Revolution at CheckAt Check, we are transforming the payroll landscape. Our mission goes beyond just building a successful business; we collaborate with our partners to innovate payroll solutions. As pioneers of embedded payroll, we are reshaping the payment process, enabling payroll businesses to launch, expand, and succeed with ease. Discover our journey | Listen in.Check is more than an API; we are the catalyst for developing and scaling payroll operations.Our TeamThe payroll system is in dire need of innovation. We invite you to join a passionate team dedicated to making an impactful change! At Check, you will leverage creative problem-solving and critical thinking to influence every business we partner with. We view challenges as opportunities for improvement, valuing the unique contributions of each team member in our collective mission.If you're ready to dive in and transform payroll, let's collaborate to simplify complexity and enhance the future for businesses of all sizes.Your RoleAt Check, engineering is our foundation. We believe that payroll should resemble modern financial software; achieving this requires a comprehensive understanding of systems and reliable infrastructure that our partners can trust. Every product we deliver relies on scalable and secure systems that ensure timely payments and payroll processing.We are seeking a Staff Software Engineer who possesses strong software design capabilities coupled with hands-on infrastructure experience. In this position, you will focus on the essential systems that drive payroll operations, enhancing our service scalability, production operations, and empowering engineers with the tools to deliver software confidently and securely.You will collaborate across product and platform areas to enhance our cloud infrastructure, fortify our deployment and monitoring strategies, and streamline the architecture that supports embedded payroll services. The challenges you will address often intersect infrastructure, product, and operational domains.This opportunity is perfect for someone who has managed complex systems end-to-end in a dynamic environment and takes pride in developing resilient, comprehensible infrastructure that is vital to our operations.
Full-time|$400K/yr - $450K/yr|On-site|San Francisco Bay Area
Join Discord, a platform that connects over 200 million users every month primarily through gaming. With over 90% of our users engaged in gaming activities, we facilitate over 1.5 billion hours of gaming conversations, enhancing the experience before, during, and after gameplay.The Infrastructure organization at Discord is fundamental to our user experience. We handle the real-time delivery of over 40 million events per second and manage the storage of trillions of messages, ensuring robust connections among our vast user base. As a Principal Engineer, you will play a pivotal role in guiding our infrastructure teams, shaping our technical vision, and maintaining the reliability of Discord at a massive scale.This position is ideal for a professional who excels at the intersection of advanced technical skills and organizational leadership. You will contribute to our infrastructure roadmap, address our most challenging technical dilemmas, and ensure our systems can efficiently scale to accommodate the next wave of users.
Full-time|$190K/yr - $280K/yr|Hybrid|San Francisco, California
About SentrySentry is dedicated to eliminating poor software experiences. Our mission is to empower developers to create high-quality software swiftly, allowing everyone to enjoy technology to its fullest.With over $217 million raised in funding and a community of over 100,000 organizations, including giants like Disney, Microsoft, and Atlassian, we are developing state-of-the-art performance and error monitoring tools. Our solutions help our partners minimize time spent on bug fixes and maximize product development.In our commitment to collaboration, Sentry follows a hybrid work model across our global offices. We have designated Mondays, Tuesdays, and Thursdays as in-office days to foster effective teamwork. If you are passionate about building tools that enhance the digital experience, join us in creating the next generation of software monitoring solutions.About the RoleAt Sentry.io, we offer vital services for diagnosing application health issues. Our tools are crucial for organizations aiming to respond adeptly in dynamic markets. We ensure a seamless and enjoyable experience in the development and deployment of these tools through a robust continuous integration environment and an insightful deployment pipeline.As part of the Infrastructure Engineering team, your contributions will be instrumental in supporting Sentry's growth and enabling engineering teams to operate with agility and confidence.Your responsibilities will include designing, developing, and maintaining internal software and platform capabilities that alleviate the cognitive load associated with infrastructure and developer tooling. You will create dependable, reusable abstractions that facilitate rapid shipping of features while incorporating durability, security, and operational excellence into service development and management.This role demands strong engineering judgment: selecting reliable technologies, planning for scalability from the outset, and crafting solutions that serve multiple teams. Your focus will be on practical systems that enhance reliability and ownership across the organization, driving adoption through comprehensive documentation, well-designed APIs, and seamless developer experiences that integrate into daily workflows.Ultimately, you will empower engineering teams to flourish within a culture of ownership—enabling them to deploy, manage, and evolve services confidently while minimizing operational burdens.Key ResponsibilitiesDesign systems that scale with company growth, ensuring a balance of reliability, performance, and cost-efficiency.Develop platform services that enhance internal operations and developer productivity.
Join our innovative team at Astranis as a Senior Software Engineer specializing in Infrastructure. In this role, you will be responsible for designing, implementing, and maintaining robust infrastructure solutions that support our cutting-edge satellite technology. Your expertise will play a crucial role in enhancing the reliability and scalability of our systems.
About the TeamAt ChatGPT, we are at the forefront of innovation, continuously enhancing our system with new capabilities and adapting to ever-evolving user needs. To sustain our rapid pace of development, we require a robust infrastructure capable of managing real-world production challenges, such as high concurrency and unpredictable traffic patterns.The mission of the ChatGPT Infrastructure team is to design and maintain the foundational platforms that facilitate swift iterations without compromising on performance or reliability. We create the shared systems, data pathways, rollout procedures, and reliability measures that enable teams to deploy changes to ChatGPT efficiently and at scale.Our focus is on high-impact infrastructure: we develop fundamental systems and streamlined processes that leverage hard-earned operational insights, ensuring that engineers do not have to repeatedly navigate similar challenges and pitfalls as they innovate.About the RoleWe are seeking experienced Senior and Staff Software Engineers to architect and construct the underlying infrastructure that supports ChatGPT, amplifying the productivity of teams working on user experience.This role transcends mere maintenance; it is about building platforms: you will define interfaces, develop essential abstractions, and create tools that promote safe and rapid iterations. Your contributions will lead to reduced friction, fewer regressions, enhanced performance, and systems that scale seamlessly as our product grows.Where You Can Make a DifferenceAs part of our team, you may engage with one or more of the following areas:Platform Foundations & Frameworks: Craft core libraries, service frameworks, and shared components that standardize system development and integration.Scalability & Performance Primitives: Develop patterns and infrastructure aimed at minimizing latency, boosting throughput, and maintaining cost efficiency as demand increases.Reliability Guardrails: Implement design mechanisms to prevent outages, including rate limiting, load shedding, and safe fallbacks.Developer Productivity via Golden Paths: Establish streamlined workflows that make common processes fast, safe, and user-friendly.Observability & Debugging Systems: Create instrumentation and metrics models to enhance debugging capabilities.
About UsAt Koah Labs, we are pioneering an innovative ad network designed to empower the next wave of AI-driven products. Our goal is to enable publishers to maximize their revenue while allowing advertisers to effectively connect with their target audiences, all while prioritizing speed, user experience, and privacy.We are a dynamic and closely-knit team based in San Francisco, composed of professionals with experience from leading companies such as X, Apple, and Meta, as well as early-stage startups. With backing from top-tier investors, we are rapidly expanding and gaining significant traction on both the publisher and advertiser fronts.Joining Koah means becoming part of something transformative from the start: you will be actively involved in deploying code that influences the company and the broader ecosystem. We value agility, trust, and a strong commitment to our craft.Technological FrameworkInfrastructure: Terraform, AWS, LGTM (Loki, Grafana, Tempo, Mimir), Tailscale, CloudflareData Management: PostgreSQL, ClickHouse, Redis, Kafka, PythonCore Applications: Ruby on Rails, React, TypeScriptSDKs: Flutter, React Native, Android, iOSPosition OverviewWe are on the lookout for talented engineers with a passion for designing, developing, and maintaining the infrastructure that supports our adtech platform.Ideal Candidates Will Have:Experience managing or operating robust systems in production at scale.A meticulous approach to detail with a strong emphasis on performance.A desire to deeply engage with challenges and take ownership of the foundational systems that drive our products.An enthusiasm for experimentation, measurement, and crafting dependable systems.A willingness to assume responsibility across a diverse array of systems.
Join Netic, the cutting-edge AI revenue engine powering essential services that form the backbone of the American economy.Backed by $43 million in funding from top investors like Founders Fund, Greylock, Hanabi, and Dylan Field, we have empowered our clients to secure hundreds of thousands of jobs across various service industries in North America. As a pioneer in AI-driven solutions, we are witnessing the emergence of companies operating entirely on our AI-first platform.As an Agent Infrastructure Engineer, you will be at the forefront of architecting and scaling the core framework that underpins our autonomous AI agents, addressing complex real-world challenges with immediate and significant impacts. Collaborate with a passionate team of innovators from renowned companies such as Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard, as we bring frontier AI to the physical economy where the stakes are high, and the data is intricate.If you thrive in dynamic, fast-paced environments and are eager to set new benchmarks in the agentic space, seize this opportunity to make your mark!
About Emergent Labs Inc. Emergent Labs builds autonomous coding agents that generate, test, and deploy production-ready applications from natural language instructions. Our systems run globally, supporting millions of users as they create real software with minimal friction. Since our public debut, Emergent has reached $100M ARR in just 8 months and now serves over 6 million users in more than 190 countries. Users have built over 6.5 million applications on our platform. We have secured backing from investors such as Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator, raising more than $100M to fuel our growth. The team includes repeat founders, Olympiad medalists, IIT & IIM alumni, and engineers with backgrounds at Google, Amazon, and Dropbox. We focus on the toughest problems in AI-driven software creation: correctness, reliability, security, and scaling in live production systems. Role Overview: Infrastructure Software Engineer This role is based in San Francisco. Emergent Labs is looking for engineers who want to help shape the future of software development at a global scale. Expect ownership, autonomy, and the chance to work quickly on impactful systems.
About MiddeskMiddesk simplifies collaboration for businesses by transforming identity verification processes. Since our inception in 2018, we have replaced outdated, manual methods with an efficient platform that provides seamless access to comprehensive and current data. Our services enable companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer lifecycle.As a proud Y Combinator graduate, Middesk is supported by notable investors such as Sequoia Capital and Accel Partners. We have recently been featured on Forbes’ Fintech 50 List and recognized as a leader in business verification by the digital identity strategy firm, Liminal.About the Middesk Engineering Team:At Middesk, we prioritize delivering value to our customers through a concept we call 'Velocity'. This term embodies our commitment to achieving meaningful outcomes rather than merely focusing on code delivery speed. We believe that exceptional products arise from a blend of technical expertise and a profound understanding of our customers’ needs. Our engineering team is composed of humble, self-driven individuals who are dedicated to addressing even the most complex challenges faced by our clients. At Middesk Engineering, our mission is to put customers first.Your Role:We are seeking a talented Infrastructure Engineer to join our DevSecOps team. Your mission will be to empower engineering teams by providing secure, cost-effective, and scalable platform capabilities that enhance software delivery, improve developer experience, and ensure compliance with industry standards. You will be responsible for developing the tools and infrastructure necessary to scale our development and production systems. Your contributions will directly impact the entire Software Development Lifecycle and overall developer experience (DevEx). The systems you will support include Kubernetes, cloud infrastructure, observability, and local development environments.Our work environment is hybrid, requiring a presence in our San Francisco or New York City offices for 2 days each week. Candidates must reside within a reasonable commuting distance as we value in-person collaboration while also supporting flexible work arrangements.Key Responsibilities:Architect, build, and scale cloud infrastructure and orchestration systems (e.g., Kubernetes, Terraform, CI/CD).Take ownership of and enhance developer experience (DevEx) tools and workflows, spanning from local development to deployment.Develop observability systems that offer insights into performance, reliability, and usage metrics.
Join Our Innovative TeamAt OpenAI, security is the cornerstone of our commitment to ensuring artificial general intelligence serves all of humanity. The Identity Infrastructure Engineering team is pivotal in this mission, crafting robust identity and access management solutions that safeguard our model weights, customer data, and essential systems across diverse cloud environments. Collaborating closely with teams across Applied Engineering, Research, IT, and Security, we deliver a secure and scalable platform that empowers permissioning, orchestration, and groundbreaking AI research.Your RoleAs a Software Engineer on our Identity Infrastructure Engineering team, you will play a crucial role in designing, deploying, and managing foundational security tools and infrastructure. This position involves leveraging a wide array of technologies to support multi-cloud deployments, ensuring our researchers and engineers can securely build, test, and scale transformative AI systems. We seek individuals who are technically adept, collaborative, and passionate about integrating secure-by-default principles throughout our technology stack.We invite Software Engineers eager to address challenges in:Identity & Access Management: Develop and sustain systems and interfaces that efficiently manage user and service identities, guaranteeing consistent, fine-grained access controls across various cloud providers and internal services.Multi-Cloud Security: Architect and implement tools that protect model weights, proprietary data, and sensitive assets, seamlessly operating within AWS, Azure, GCP, and future cloud environments.Automation & Tooling: Create robust frameworks, APIs, and CLI tools that automate ongoing security tasks (such as credential provisioning and rotation), allowing teams to focus on AI innovation without compromising security.In this position, you will:Develop new features for our IAM platform that integrate seamlessly with evolving cloud services, enabling teams to operate efficiently while adhering to security best practices.Lead security innovations by designing tools, processes, and frameworks that enhance our infrastructure.This role is primarily based in San Francisco, Seattle, or New York City, with remote work options considered. We embrace a hybrid work model, requiring three days in the office weekly, and provide relocation assistance for new hires.
Senior Software Engineer, Infrastructure & PlatformRole OverviewIn the role of Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will take on the exciting challenge of designing and constructing the essential infrastructure that drives our innovative data generation, evaluation, and agentic systems.Your responsibilities will include developing shared platforms that empower our engineering and research teams to execute large-scale human-in-the-loop workflows, evaluation harnesses, and automated data pipelines essential for training cutting-edge AI models.This position demands a high level of technical expertise and offers extensive ownership. You will be responsible for architecting and building the foundational infrastructure relied upon by numerous engineers, ensuring that systems are scalable, reliable, and capable of handling high-throughput workloads.Collaboration with the founding team will be key as you define system architecture, establish best engineering practices, and create the infrastructure that supports the evolution of AI development.
Position OverviewJoin OpenEvidence as a Data Infrastructure Software Engineer, where you will engineer comprehensive systems that drive essential product and research operations. Your focus will be on optimizing performance, ensuring scalability, and enhancing accuracy, while enjoying the autonomy to manage the infrastructure that assists healthcare professionals in navigating complex clinical decisions in real-time.We value exceptional creators who thrive in versatile roles. Our engineers engage across various products and projects, taking ownership wherever they can make the most significant impact.About OpenEvidenceOpenEvidence is the leading medical AI platform globally, utilized by over 40% of clinicians in the U.S. in just over a year through organic product-led growth. As a $12 billion company, our engineering team comprises 30 talented individuals from MIT, Harvard, and Stanford. We believe that groundbreaking products are born from a small group of exceptional builders, driven by focused goals and empowered to take ownership and act swiftly. We are expanding our team to capitalize on an unparalleled opportunity to set the standard for medical AI platforms.If you are a top-tier engineer or scientist eager to push the boundaries and achieve tangible outcomes that affect millions of lives, we want to connect with you.Our CultureWe expect our work to be performed at an elite level. The journey from concept to execution and scaling is akin to a professional sport, where excellence is non-negotiable. We believe that the creation of innovative technologies is only achievable through complete ownership. Significant achievements happen when individuals take the initiative to see them through.Your ProfileThis role is not for those seeking a 9-to-5 job or merely looking to write papers. If you are ready to dive into the trenches, tackle challenges head-on, and create something from scratch that could impact millions and drive substantial revenue, you might be the perfect fit.We seek brilliant builders who are intelligent, ambitious, resourceful, self-reliant, detail-oriented, driven, hardworking, and humble. Does this sound rare? It is, as we have only found 30 of them so far, and we are eager to discover more.
Full-time|$200K/yr - $200K/yr|On-site|San Francisco
Join Convex in revolutionizing application development!At Convex, we are on a mission to redefine how software is constructed on the Internet. Our innovative platform enables developers to create swift, dependable, and dynamic applications without the need for a backend team. We offer a comprehensive full-stack application platform, meticulously designed with abstractions for databases, computing, and backend services, allowing both developers and LLMs to innovate rapidly, ensuring products that are scalable and maintain simplicity throughout their lifecycle.About Our Team:Our Convex team comprises engineers who have architected and built some of the largest backends globally, managing exabytes of data and millions of transactions per second. We are a friendly, collaborative group of passionate individuals who thrive on in-person collaboration in our San Francisco office.Position Overview:As Convex evolves, we are seeking outstanding senior or staff-level engineers to help us architect and sustain the future of our infrastructure at scale. If you have a passion for distributed systems and a robust background in designing and managing web infrastructure, we want to connect with you!We value robust architecture, effective collaboration, and simplicity. Our team embraces high ownership and places significant emphasis on operational excellence. This role is not solely focused on operations; we seek individuals who are dedicated to designing and constructing systems in the most effective manner possible, especially in a startup environment.Your Responsibilities:Architect, construct, and oversee Convex’s global cloud infrastructure.Analyze and enhance the performance and reliability of our systems.Independently prioritize projects, collaborating closely with the engineering team and CTO.Establish best practices and reliability standards as we expand our team and systems.Develop sophisticated systems and database code.Engage with feedback from leadership regarding seeking simpler and more elegant solutions.What We Value:A strong enthusiasm for distributed systems and backend infrastructure.A collaborative spirit and a desire to grow with the team.A commitment to best practices and maintaining high standards in engineering.
Full-time|$160K/yr - $225K/yr|Hybrid|San Francisco, CA (Hybrid)
About Fable SecurityIn today’s digital landscape, AI-driven threats and human errors represent the most significant risks to enterprise security. Cybercriminals exploit human behavior, contributing to 70% of security breaches. At Fable, we empower individuals to transform from potential targets to active defenders with innovative tools.Fable is at the forefront of human risk management, offering a platform that effectively influences employee behavior. Our user-friendly, scalable solution analyzes complex employee data, identifies high-risk behaviors, and delivers timely interventions directly to users in their work environment.Supported by notable investors like Redpoint Ventures and Greylock Partners, and founded by former members of the Abnormal Security team, Fable is tackling one of cybersecurity's greatest challenges in a rapidly expanding market. Our team comprises alumni from esteemed organizations such as Meta, Twitter, and Flexport, as well as top universities including Waterloo, Columbia, and Stanford. This is an exceptional opportunity for you to join us at a time of rapid growth and help shape the future of security.Why Join UsBuild and scale the foundational data infrastructure that drives a groundbreaking product.Collaborate closely with engineering, data science, and product teams to operationalize data at scale.Become part of a small, high-caliber team where your contributions will have a significant impact.As part of an early-stage company, every engineer plays a crucial role in shaping the evolution of our products and the company's approach to data management.Your RoleAs a Platform and Infrastructure Engineer, you will be instrumental in developing and scaling the core systems that underpin Fable’s product and data operations.Your responsibilities will span backend systems including real-time services and data pipelines. You will ensure reliability, scalability, and optimal performance across all layers. This highly collaborative role involves working closely with data and ML teams, contributing to systems that effectively manage data ingestion, processing, and delivery.This role demands cross-functional collaboration with engineering, data, and product teams to create robust, production-grade systems that grow alongside the company.ResponsibilitiesDesign, develop, and maintain scalable backend and infrastructure systems.Collaborate with cross-functional teams to deliver high-quality software solutions.Ensure system reliability, performance, and security through rigorous testing and monitoring.
Compensation: Competitive base salary + substantial equityBenefits: Health & dental insurance, gym reimbursement, daily team lunches, 401(K)About JuliusAt Julius, we're pioneering advancements in applied AI by developing cutting-edge coding agents. Our platform executes approximately 1 million lines of code every 36 hours, serving over 1 million users and generating 3 million+ visualizations. We manage all code in isolated remote containers. As a revenue-generating entity, we are backed by AI Grant and founders with remarkable backgrounds from companies like Vercel, Notion, Perplexity, Palantir, Replit, Zapier, Intercom, and Dropbox.The RoleJoin us in building and scaling the robust code-execution platform that powers Julius, across both cloud and on-prem environments. We orchestrate over 500,000 containers/month and the demand is growing rapidly. You will take ownership of reliability, performance, and security within our multi-tenant compute environment.Your ResponsibilitiesDesign and manage a secure, multi-tenant container infrastructure that ensures quick startup and intelligent autoscaling.Implement on-prem/private cloud deployments using Helm and Terraform, integrating SSO, network controls, and audit logging.Enhance observability (metrics, traces, logs) with well-defined SLOs and lead incident response initiatives.Optimize images, scheduling, networking, and costs, while developing fair-use and rate-limiting controls.Your QualificationsStrong experience with production Kubernetes and container internals (Docker/containerd); solid understanding of networking principles.Familiarity with cloud environments (AWS/GCP/Azure) and Infrastructure as Code (Terraform/Helm).Proficiency in monitoring and logging tools (Prometheus, Grafana, OpenTelemetry, ELK/Vector).Understanding of security best practices for containerized, multi-tenant systems.Preferred QualificationsExperience with gVisor, Kata, Firecracker; Cilium/eBPF; GPU scheduling; serverless autoscaling (KEDA/Knative/Karpenter).Proven experience delivering on-prem or air-gapped enterprise software solutions.A passion for AI, with experience building side projects involving LLMs.Why Join Julius?Be part of a small, senior team where your contributions will have a massive impact. Tackle challenging infrastructure problems at a meaningful scale.
About the TeamJoin the innovative Frontier Systems team at OpenAI, where we design, implement, and maintain the world's largest supercomputers, essential for advancing our most groundbreaking model training initiatives.We transform data center blueprints into operational systems while crafting the software necessary for executing large-scale frontier model trainings.Our mission is to establish, stabilize, and ensure the reliability and efficiency of these hyperscale supercomputers throughout the training of our frontier models.About the RoleWe are seeking passionate engineers to manage the next generation of compute clusters that underpin OpenAI’s frontier research.This position merges distributed systems engineering with practical infrastructure work across our expansive data centers. You will scale Kubernetes clusters to unprecedented levels, automate bare-metal setups, and create the software layer that simplifies the complexity of numerous nodes across various data centers.Your work will be at the crossroads of hardware and software, where speed and reliability are paramount. Be prepared to oversee dynamic operations, swiftly identify and resolve pressing issues, and constantly elevate the standards for automation and uptime.In this role, you will:Provision and scale extensive Kubernetes clusters, including automation for deployment, bootstrapping, and lifecycle managementCreate software abstractions that integrate multiple clusters and provide a cohesive interface for training workloadsOversee node deployment from bare metal to firmware upgrades, ensuring rapid, repeatable setups at scaleEnhance operational metrics by reducing cluster restart times (e.g., from hours to minutes) and expediting firmware and OS upgrade cyclesIntegrate networking and hardware health systems to ensure end-to-end reliability across servers, switches, and data center infrastructureDevelop monitoring and observability systems to identify issues early and maintain cluster stability under high loadsYou might thrive in this role if you:Have extensive experience operating or scaling Kubernetes clusters or similar container orchestration systems in high-growth or hyperscale environmentsPossess strong programming skills in languages relevant to cloud and infrastructure management
Join the Crew of Ivo!At Ivo, we are more than just engineers; we are the pioneers of the digital seas! Our crew has set sail with groundbreaking innovations that have reshaped the landscape of legal tech:• An AI agent that seamlessly integrates with MS Word to enhance your documents [2023]• Transitioning from traditional embedding models to agentic RAG for superior performance [2023]• Advancing large-scale LLM-driven legal fact extraction [2024]• A legal assistant capable of accurately searching vast contract databases [2024]• Clustering legal documents from the same lineage [2025]• Implementing automatic deviation analysis to uncover hidden risks in extensive contract databases [2025]• Merging contracts with amendments to create comprehensive “composite” contracts (one of our clients shed tears of joy upon seeing this) [2025]The Role of an Infrastructure EngineerAs an Infrastructure Engineer, you will be the architect of Ivo's platform, ensuring its robustness and scalability.Your mission includes:• Taking ownership of our environment's future, with ample room for creative system design.• Managing numerous customer deployments—every client deserves a unique setup, from containers to databases.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics, logs, and health checks into user-friendly dashboards and alerts.• Leading the charge during infrastructure incidents.• Accelerating our CI/CD system (currently a sluggish ~12 minutes—let's speed that up!).If you share our passion for LLMs and thrive in a dynamic environment, we want you to help us push the boundaries of DevOps:• Innovating real-time LLM evaluations to ensure the accuracy of our outputs.• Building upon our existing infrastructure to enhance performance and reliability.Set sail with us at Ivo, where your technical skills will help chart the course for the future of legal technology!
Full-time|$148K/yr - $200K/yr|On-site|San Francisco, CA
Biotechnology is transforming our world, influencing everything from the medicines we consume to the crops we cultivate and the materials we utilize daily. To keep pace with the rapid advancements in science, we require cutting-edge technology.At Benchling, our mission is to harness the potential of biotechnology. The most pioneering biotech firms rely on Benchling’s R&D Cloud to facilitate the creation of innovative products and accelerate their journey to milestones and market readiness.Join us in bringing state-of-the-art software solutions to the forefront of modern science.ROLE OVERVIEWWe are seeking a talented Backend Software Engineer to join our Infrastructure Engineering team, where you will build and maintain the foundational platform that powers our product offerings. This role spans various infrastructure disciplines, including cloud infrastructure on AWS, services based on Kubernetes, and the operational tools necessary to ensure system reliability. Collaboration is key as you will work closely with product engineering teams to better understand their requirements and enhance the developer experience throughout Benchling.We are looking for a motivated early-career engineer with strong foundational skills and a desire to grow. The ideal candidate will be enthusiastic about learning, eager to contribute across diverse areas, and ready to take on increasing responsibilities. Given our operation in a regulated environment, your work will focus on building reliable, secure, and auditable systems.RESPONSIBILITIESDevelop, sustain, and enhance core infrastructure and platform services utilized by our product engineering teams.Collaborate with product teams to establish requirements, design efficient pathways, and minimize deployment and operational challenges.Contribute to our Kubernetes-based platform, including service configuration, traffic management, and platform tooling.Design and manage AWS infrastructure and automation, emphasizing scalability, cost efficiency, and resilience.Enhance observability and operational readiness through metrics, logging, tracing, dashboards, and alert systems.Engage in an on-call rotation, manage incident responses, and drive process improvements to avert future occurrences.Produce clear technical documentation and engage in design discussions and reviews; focus on incremental system improvements for maintainability and reliability.
About the TeamJoin OpenAI's Privacy Engineering team, where we operate at the vital crossroads of Security, Privacy, Legal, and Core Infrastructure. Our mission is to develop cutting-edge data infrastructure and systems that empower our privacy, legal, and security teams to operate securely, swiftly, and at scale. We adhere to principles of defensibility by default, enabling impactful research, and fostering a robust security culture in preparation for transformative technologies.About the RoleWe are seeking a talented Software Engineer to design and implement technical systems that facilitate legal compliance workflows, including secure data processing and document review. In this role, you will collaborate closely with Legal, Security, IT, and engineering teams to translate legal processes into actionable technical workflows. This position is perfect for an engineer passionate about large-scale data challenges and who understands the meticulousness required in ensuring compliance.Located in the vibrant city of San Francisco, we offer relocation assistance for qualified candidates.Key Responsibilities:Design and maintain scalable data storage pipelines.Develop search and discovery services (e.g., Spark/Databricks, index layers, metadata catalogs) tailored to partner team requirements.Automate secure data transfers, including encryption, checksumming, and auditing exports to reviewers.Establish secure compute environments that balance usability with stringent security controls.Implement monitoring and KPIs to ensure accountability of data holds and productions.Work cross-functionally to document SOPs, threat models, and chain-of-custody documentation that can withstand scrutiny.Ideal Candidates Will:Possess practical experience in building or operating large-scale data-lake or backup systems (Azure, AWS, GCP).Be proficient with Terraform or Pulumi, CI/CD processes, and capable of converting ad-hoc legal requests into repeatable pipelines.Be comfortable working with discovery workflows (legal holds, enterprise document collections, secure review) or eager to quickly gain expertise.Effectively communicate technical concepts—from storage governance to block-ID APIs—to interdisciplinary teams such as Legal and Engineering.