Hark logoHark logo

Member of Technical Staff - Pretraining at Hark | San Jose

HarkSan Jose
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

Responsibilities Lead research and development efforts in large-scale LLM and multimodal pretraining, concentrating on enhancing model performance through superior data, scaling, and architectural innovations. Design and refine data pipelines for pretraining, encompassing large-scale data curation, filtering, deduplication, and synthetic data generation. Create and implement effective training strategies for foundational models, including distributed training, scaling laws, and optimization techniques. Enhance pretraining infrastructure, focusing on training systems, data pipelines, and computational efficiency. Develop evaluation frameworks and internal benchmarks to assess pretraining advancements and model capabilities. Collaborate with research and engineering teams to push the boundaries of foundational model performance and scalability. Requirements Demonstrated success in improving large-scale neural network performance through advancements in pretraining data, modeling, or training systems. Extensive experience with large-scale distributed training frameworks such as Megatron, DeepSpeed, or similar. ...

About the job

About Hark

Hark is a pioneering artificial intelligence company dedicated to creating advanced and personalized intelligence systems. Our focus is on building proactive, multimodal AI capable of engaging with the world through speech, text, vision, and persistent memory.

We are merging this intelligence with cutting-edge hardware to establish a universal interface between humans and machines. While current AI largely relies on chat boxes and outdated devices, Hark is at the forefront of developing the next generation of agentic systems that interact naturally with users and their environment.

To achieve our ambitious goals, we are developing multimodal models alongside next-generation AI hardware, designed from the ground up as a single, integrated interface for a new era of intelligent systems.

About the Role

The Omni team at Hark is revolutionizing AI experiences beyond text, focusing on enabling models to comprehend and generate content across various modalities, including text, audio, and vision. Our mission is to create seamless, real-time multimodal intelligence that enhances intuitive and immersive user experiences.

As a key member of the Omni team, you will be responsible for developing large-scale pretraining systems and foundational models. This entails working across the entire stack, from data curation and large-scale training infrastructure to model architecture and optimization. You will significantly contribute to advancing the core capabilities of our models through extensive pretraining efforts.

About Hark

Hark is committed to pioneering the future of AI by creating intelligent systems that can seamlessly interact with the world. Our innovative approach combines advanced AI technologies with next-generation hardware, ensuring that our solutions are at the forefront of the artificial intelligence revolution.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.