About the job
About Twenty
At Twenty, we tackle one of the most significant challenges of our era: safeguarding democracies in a digital world. Our innovative technologies bridge the gap between cyber and electromagnetic domains, where operational speed outpaces human perception and complexity defies traditional boundaries. Our team is not merely about problem-solving; we create transformative outcomes that have a direct influence on national security. We are pragmatic optimists, fully aware that while our mission to protect America and its allies is daunting, success is within reach.
Position Overview
As a Staff Data Engineer, you will be responsible for the data infrastructure that underpins Twenty’s cyber operations applications and capabilities. This role focuses on constructing a robust, high-performance data lake alongside the pipelines, schemas, and query structures required to manage petabyte-scale datasets efficiently. You will collaborate closely with engineers and intelligence analysts to convert disorganized, high-volume operational data into dependable, well-structured systems that drive critical missions. Additionally, you will spearhead technical projects and mentor fellow engineers as we expand our capabilities and deliverables.
Who You Are
- You possess a systems-oriented mindset, understanding how data modeling, storage formats, compute engines, and access patterns interconnect.
- You hold strong opinions on schema and index design and can articulate trade-offs effectively.
- You prioritize measurable reliability, focusing on data quality, lineage, repeatability, and operational excellence.
- You thrive in ambiguous situations, adeptly handling evolving datasets and requirements without compromising standards.
- You collaborate effectively across roles, especially with engineers and analysts who require prompt, accurate insights.
- You take pride in leadership, mentoring others, elevating standards, and driving initiatives to completion.
- You are driven by outcomes that enhance national security and desire your work to have a meaningful impact.
Key Responsibilities
- Lead the design and management of a data lake for cyber operations and intelligence data.
- Develop schemas, partitions, and indexes that optimize performance and cost-effectiveness for complex datasets.
- Work alongside engineers and intelligence analysts to establish query patterns and data products tailored for mission applications.
- Create and enhance ETL pipelines that are observable, recoverable, and resilient to changes in upstream data.
