About the job
Thank you for considering a career at IT Concepts, also known as Kentro, where innovation fuels opportunity and collaboration paves the way to success. We take pride in our vibrant community of professionals who are dedicated to advancing our clients' missions, promoting professional development, and positively impacting the communities we serve.
Joining our team means becoming part of a supportive environment that prioritizes your growth. At Kentro, we strive to drive meaningful change, ignite innovation, and reach exceptional milestones together.
We are currently seeking a Senior Data Engineer to lead a significant data discovery and classification project as part of the Zero Trust initiative for the U.S. Special Operations Command (USSOCOM). In this role, you will uncover and understand 'dark data' within the Command's intricate information landscape, which includes hyperscale cloud data lakes, legacy file shares, and isolated storage across SIPR and Top-Secret networks.
Your responsibilities will include designing and managing the deployment of advanced discovery platforms such as BigID and NetApp BlueXP. You will configure these tools to analyze petabytes of structured (SQL/Oracle), semi-structured (logs/NoSQL), and unstructured (SharePoint/File Shares) data. Your main objective will be to create the 'Global Data Inventory'—a real-time map detailing the locations of sensitive Controlled Unclassified Information (CUI) and classified intelligence, empowering security teams with precise protection capabilities. You will leverage your expertise in data pipelines and storage infrastructure to ensure scanning operations achieve full visibility without compromising network performance.
Key Responsibilities:
- Architect and manage data discovery systems, deploying BigID and NetApp BlueXP scanners across hybrid environments, including configuring dockerized collectors for air-gapped discovery on Top-Secret networks.
- Connect discovery tools to enterprise databases (SQL Server, Oracle, PostgreSQL) to scan for PII, DoD ID numbers, and other sensitive data indicators while maintaining database performance.
- Optimize scans for large file repositories (NetApp NAS, QNAP, SharePoint On-Premises), balancing scan windows and throttling to avoid latency issues for mission users.
- Utilize Microsoft Purview Data Map and custom connectors to inventory data located in AWS S3 buckets, Azure Blobs, and Data Lakes.
- Work with mission owners to refine Machine Learning (ML) classifiers to identify specific USSOCOM data types (e.g., mission names, operational codes) and minimize false positives in data inventory.

