42dot logo42dot logo

Machine Learning Platform Engineer for Autonomous Driving

42dotPangyo (Software Dream Center), South Korea
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

Bachelor's degree or higher in Computer Science, Engineering, Robotics, or a similar technical field. A minimum of 5 years of experience in Data Engineering or ML Platform roles. Proficient in Python with solid experience in Python SDK development. Solid working experience in databases such as MongoDB, PostgreSQL, etc. Hands-on experience with data pipeline job orchestration using Databricks Workflows or Apache Airflow, along with integrating data pipelines with machine learning models. Extensive experience with data technologies and architectures such as Data Warehouses (e.g., Hive) or Lakehouses (e.g., Delta Lake). Experience with Apache Spark or other big data computing engines.

About the job

Join Our Innovative Team

At 42dot, we are at the forefront of autonomous driving technology. Our Machine Learning (ML) Platform Engineers are instrumental in developing a robust data platform and machine learning training and evaluation systems that support advanced algorithms for autonomous driving. We focus on creating a scalable, distributed system capable of handling vast datasets, comprising millions of scenes, while also delivering high-performance SDKs for ML model training and evaluation. Our platforms significantly enhance the ML model development lifecycle, covering training, evaluation, deployment, and monitoring within cloud environments.

Key Responsibilities

  • Design and develop a highly scalable and reliable data platform to effectively manage, visualize, search, and serve extensive datasets for ML model training, fine-tuning, and validation.

  • Create advanced autonomous driving data SDK functionalities, including scene data search, dataset preparation, and dataset loading.

  • Establish a data lakehouse for autonomous driving scene datasets, integrating sensor data, calibration data, and annotation data.

  • Identify and resolve performance bottlenecks across data processing pipelines, addressing data processing latency, search latency, and Test Procedure (TP) coverage.

  • Set up and maintain infrastructure components for the data platform, including data processing pipelines, databases, data lakehouses, and data serving mechanisms.

  • Work collaboratively with cross-functional teams, including ML algorithm, ML application, and Cloud Infrastructure teams, to ensure alignment of ML platforms with the overall architecture of the autonomous driving system.

Qualifications

  • A Bachelor's degree or higher in Computer Science, Engineering, Robotics, or a related technical discipline.

  • A minimum of 5 years of experience in Data Engineering or ML Platform roles.

  • Proficiency in Python with substantial experience in Python SDK development.

  • Solid experience with databases such as MongoDB and PostgreSQL.

  • Hands-on experience orchestrating data pipeline jobs using Databricks Workflows or Apache Airflow, along with integrating data pipelines with machine learning models.

  • Extensive knowledge of data technologies and architectures, including Data Warehouses (e.g., Hive) and Lakehouses (e.g., Delta Lake).

  • Experience with Apache Spark or other big data computing engines.

About 42dot

42dot is a pioneering company focused on the development of cutting-edge technologies in autonomous driving. We are committed to innovation and excellence, providing a dynamic workplace for professionals eager to make a significant impact in the field of autonomous vehicle technology.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.