Cerebras Systems logoCerebras Systems logo

Senior Deployment Engineer - AI Inference

Cerebras SystemsRemote Office; Sunnyvale, CA; Toronto, Ontario, Canada
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

Qualifications:Proven experience in deploying AI systems, particularly in inference scenarios. Strong understanding of machine learning frameworks and algorithms. Proficiency in programming languages such as Python, C++, or similar. Experience with cloud platforms and deployment pipelines. Excellent problem-solving skills and the ability to work in a fast-paced environment.

About the job

Cerebras Systems is at the forefront of AI technology, creating the world's largest AI chip, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture delivers the computational power equivalent to dozens of GPUs, all on a single chip, providing simplicity in programming akin to a single device. This revolutionary design allows Cerebras to achieve unparalleled training and inference speeds, enabling machine learning users to execute large-scale ML applications without the complexities of managing multiple GPUs or TPUs.

Cerebras is proud to serve an impressive array of clients, including leading model laboratories, global corporations, and pioneering AI-driven startups. Recently, OpenAI announced a multi-year partnership with Cerebras, aiming to deploy 750 megawatts of scale and revolutionizing workflows with ultra-fast inference capabilities.

Our wafer-scale architecture has positioned Cerebras Inference as the fastest solution available for Generative AI, boasting speeds over 10 times faster than GPU-based hyperscale cloud inference services. This significant enhancement is redefining the user experience in AI applications, facilitating real-time iterations and amplifying intelligence through additional agentic computation.

About Cerebras Systems

Cerebras Systems is pioneering AI computing with the development of the largest AI chip globally, leveraging a unique wafer-scale architecture that simplifies machine learning deployment. Our technology is transforming industries by enabling faster and more efficient AI applications, and we are committed to empowering our clients with cutting-edge solutions.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.