Graphcore logoGraphcore logo

Senior Principal Network Engineer

GraphcoreAustin, Texas, United States
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

QualificationsProven experience in designing and managing hyperscale or HPC data center networks. Strong knowledge of high-speed Ethernet fabrics and RDMA technologies. Experience with advanced automation and telemetry systems. Ability to collaborate effectively across multiple teams. Excellent problem-solving skills and a passion for innovative technology.

About the job

About Us

Graphcore stands at the forefront of innovation in Artificial Intelligence computing, pioneering hardware, software, and systems infrastructure essential for the future of AI advancements. Our mission is to facilitate the widespread adoption of AI across various industries, unlocking unprecedented capabilities in technology.

As a proud member of the SoftBank Group, Graphcore is part of an elite consortium of companies driving transformative technological solutions. We share a visionary goal: to enable Artificial Super Intelligence and ensure its benefits are accessible to all.

Our team comprises individuals from diverse backgrounds, contributing a rich blend of expertise ranging from AI research to silicon design and systems architecture. At Graphcore, we cultivate a vibrant culture of continuous learning and relentless innovation.

Job Summary

We are in search of a Senior Principal Network Engineer to lead the design, deployment, and optimization of next-generation AI data center networks. This role is crucial as AI training and inference workloads demand exceptionally high bandwidth, deterministic low latency, and a zero-packet-loss networking environment.

In this capacity, you will work closely with the Network Architecture Lead to create and scale high-performance computing (HPC) network fabrics that support GPU clusters. Collaborating across hardware, networking, and AI application layers, you will ensure that Graphcore’s large-scale AI infrastructure operates at its optimal performance.

The ideal candidate will possess extensive experience in managing hyperscale or HPC data center networks, with a strong command of high-speed Ethernet fabrics, RDMA technologies, advanced automation, and telemetry systems.

The Team

The Data Center Network Engineering team is responsible for designing and operating high-performance network fabrics that power Graphcore’s AI compute platforms. This team works in close collaboration with hardware engineering, AI researchers, and infrastructure teams to develop scalable networking environments tailored for distributed training and inference workloads.

Engineers on this team are engaged in cutting-edge projects involving high-speed Ethernet fabrics, lossless networking, RDMA transport, and large-scale automation frameworks to support next-generation AI clusters.

Responsibilities and Duties

  • Define ultra-high-bandwidth, non-blocking AI network fabrics (Clos spine-leaf-super-spine architectures) for large-scale distributed AI workloads.
  • Optimize performance of lossless Ethernet fabrics using congestion control mechanisms such as PFC, ECN, and DCQCN to support RDMA/RoCEv2 communication.
  • Lead initiatives to implement NetDevOps practices and develop automation for provisioning and configuration.

About Graphcore

Graphcore is a leading innovator in Artificial Intelligence computing, dedicated to advancing hardware and software technologies that facilitate the next generation of AI breakthroughs. As part of the SoftBank Group, we are committed to ensuring that the benefits of AI are shared by all, fostering a diverse and collaborative team culture that champions continuous learning and innovation.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.