About the job
POS-4092
HubSpot’s Data Hub serves as a core platform for RevOps, marketing, sales, and customer teams. This system collects data from multiple sources, improves its quality, and ensures it is accessible throughout HubSpot. As a result, teams can run AI-powered demand generation, strategic campaigns, automation, and accurate reporting, without needing deep data expertise.
The Data Hub engineering group sits at the intersection of data engineering, machine learning, applied AI, and market strategy. The team emphasizes reliability, cost efficiency, scalability, and a strong user experience for marketers and sales professionals.
Role overview
HubSpot is seeking a Lead Principal Software Engineer to shape the next generation of the Data Hub. This platform is essential to supporting data-driven demand generation across the business.
What you will do
- Lead the development of data lake and analytics infrastructure, working with technologies such as Iceberg and Spark, and building both batch and streaming data pipelines to support large-scale segmentation, scoring, and demand generation.
- Design and improve data systems with a focus on cost, latency, data freshness, and reliability, making informed trade-offs using principles like the CAP theorem, efficient partitioning, and optimized storage layouts.
- Collaborate with product managers, analytics teams, and go-to-market leaders to deliver solutions that drive business impact, such as enhanced lead scoring, funnel visibility, audience creation, and campaign attribution for marketing and sales.
- Advance the Data Hub as an AI-centric platform by automating delivery of curated, evergreen datasets to AI agents and reporting tools, minimizing manual data work.
What success looks like
This role is suited to a hands-on engineer who works closely with product and design partners. The position also expects a strong influence on the broader engineering organization, helping others to deliver at scale.
Location
Remote - USA
