About the job
Join the innovative engineering team at Chainalysis, where we tackle the most challenging technical problems and create products that foster trust in the cryptocurrency space. With a global presence across Denmark, the UK, Canada, and the USA, we embrace the complexity of our work and collaborate with exceptionally talented colleagues. Our industry is constantly evolving, and our mission is to develop user-centric products backed by a flexible and scalable data platform that responds to rapid changes and delivers value to our customers.
Chainalysis is recognized as the leading provider of blockchain investigation and compliance software, playing a pivotal role in dismantling terrorist financing operations, thwarting major ransomware initiatives, and identifying the Twitter hackers, among other significant achievements.
We are in the process of developing the data platform for blockchain, cryptocurrency, and web3 technologies. We are seeking experienced Senior Data Engineers who are eager to undertake impactful projects and thrive in a large-scale development environment!
Key Responsibilities:
Design and implement cloud-native data ingestion and aggregation processes that handle terabytes of data daily.
Optimize high-performance Spark jobs in Python for detecting market manipulation, fraud, and behavioral patterns.
Enhance and maintain both batch and real-time streaming applications processing billions of records each day.
Architect and sustain scalable data lakehouse environments using technologies such as Parquet, Iceberg, and Delta Lake.
Collaborate on building robust API services on AWS that efficiently interface with our data layer to manage thousands of requests per second.
Assist in modernizing our data stack to achieve a tenfold increase in operational capacity, advancing towards automated, serverless architectures.
Identify and resolve data quality issues and performance bottlenecks within production environments.
