Hark logoHark logo

Technical Staff Member - Multimodal AI

HarkSan Jose
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

Responsibilities Advance research and development initiatives to enhance real-time multimodal intelligence, focusing on audio, video, and world modeling. Enhance data quality for large-scale multimodal training through the development of data filtering, curation, and synthetic data generation techniques. Create evaluation frameworks and internal benchmarks to assess model capabilities, reliability, and user experience across various modalities. Design and implement effective algorithms and training strategies to achieve cutting-edge performance in multimodal foundation models. Work closely with product and engineering teams to translate research breakthroughs into impactful AI solutions. Requirements Demonstrated success in leading research that significantly enhances neural network capabilities via advancements in data, modeling, or training methodologies. Extensive experience in data-driven experimentation, systematic analysis, and iterative debugging of models. Proven track record of building or engaging with large-scale multimodal AI systems.

About the job

About Hark

Hark is at the forefront of artificial intelligence, dedicated to creating sophisticated, personalized solutions that are proactive and multimodal. Our technology interacts with the world through speech, text, vision, and persistent memory.

We are integrating this intelligence with cutting-edge hardware to establish a universal interface for human-machine interaction. Unlike existing AI that primarily relies on chat boxes and outdated devices, Hark is pioneering the future with agentic systems that engage naturally with users and their environments.

To realize this vision, we are innovating multimodal models alongside next-generation AI hardware, purposefully designed as a cohesive interface for a new era of intelligent systems.

About the Role

The Omni team at Hark is developing the next generation of AI experiences that extend beyond traditional text-based interactions. Our aim is to enable models that comprehend and generate content across diverse modalities, including text and vision, fostering seamless and immersive user experiences.

As a member of the Omni team, you will play a pivotal role in advancing real-time audio, video, and multimodal world models. This position encompasses working across the full technology stack, from data management and modeling to training, serving, and product integration. You will be instrumental in both pretraining and posttraining initiatives, collaborating closely with product teams to enhance model capabilities and deliver outstanding, end-to-end user experiences.

About Hark

Hark is an innovative artificial intelligence firm dedicated to developing advanced, personalized intelligent systems. By merging cutting-edge technology with next-generation hardware, Hark is paving the way for a new era of seamless human-machine interaction.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.