About the job
Join Us at Twelve Labs
At Twelve Labs, we are at the forefront of developing revolutionary multimodal foundation models that interpret videos with human-like understanding. Our cutting-edge models have set new benchmarks in video-language modeling, enhancing our capabilities in analyzing and interacting with diverse media forms.
Backed by over $110 million in Seed and Series A funding, we are supported by prestigious venture capital firms including NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, along with esteemed AI pioneers such as Fei-Fei Li and Silvio Savarese. While our headquarters is in San Francisco, our significant presence in Seoul highlights our dedication to global innovation.
Our strategic partnerships with NVIDIA and AWS provide us with access to top-tier hardware, including the B300s, which empower our advancement in video AI capabilities.
We embrace the unique journeys of every individual and believe that our diverse backgrounds drive innovation. We seek passionate individuals who resonate with our mission and are eager to make impactful contributions as we redefine technology and transform the world of video understanding and multimodal AI.
About the Video Cognition System Team
Our team is dedicated to creating the first-ever video cognition system capable of processing extensive video libraries into a structured, queryable Video Memory & Cortex for vertical LLM agents.
We are addressing fundamental questions surrounding machine cognition, focusing on perception, memory, reasoning, and attention. Our goal is to design innovative memory structures that exceed traditional context windows and build a reasoning cortex for comprehensive video analysis.
Our research endeavors cover corpus-level reasoning, knowledge extraction, indexing architectures, and multi-video understanding, requiring a synergistic collaboration between research and engineering to create systems that are both scientifically rigorous and impactful.
