character logocharacter logo

Machine Learning Infrastructure Engineer at Character | Redwood City, CA

characterRedwood City, CA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

About Character. AICharacter. AI enables individuals to connect, learn, and tell stories through engaging interactive entertainment. With over 20 million monthly visitors, our platform empowers users to unleash their creativity and imagination by interacting with millions of characters and embarking on limitless adventures. In just two years, we reached unicorn status and were recognized as Google Play's AI App of the Year, highlighting our innovative technology and forward-thinking approach. Join us in redefining entertainment and shaping the future of Consumer AI!

About the job

About the Role

We are in search of experienced Machine Learning Infrastructure Engineers who excel at designing, constructing, and maintaining robust training and serving infrastructures for machine learning research initiatives.

Key Responsibilities

  • Deliver comprehensive infrastructure support for our machine learning research and product development.
  • Develop tools for diagnosing cluster issues and addressing hardware failures effectively.
  • Oversee deployments, manage experiments, and provide ongoing support for our research activities.
  • Optimize GPU allocation and utilization for both training and serving environments.

Qualifications

  • A minimum of 4 years of experience in supporting infrastructure within machine learning environments.
  • Proven experience in creating diagnostic tools for ML infrastructure issues.
  • Familiarity with cloud platforms, such as Compute Engine, Kubernetes, and Cloud Storage.
  • Hands-on experience working with GPUs.

Preferred Qualifications

  • Experience managing large GPU clusters and high-performance computing/networking.
  • Knowledge in supporting large language model training.
  • Familiarity with machine learning frameworks like PyTorch, TensorFlow, or JAX.
  • Experience in GPU kernel development.

About character

Character. AI is at the forefront of interactive entertainment, allowing millions to engage creatively with AI characters. Our rapid growth and accolades, including being recognized as Google Play's AI App of the Year, reflect our commitment to innovation.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.