About the job
DeepSource is seeking a Monitoring & Observability Engineer to help maintain the reliability and performance of our private cloud platform in New Delhi. This role supports 24x7 operations, working within a team focused on monitoring over 100 applications and more than 1,000 infrastructure assets. The position is essential for meeting strict SLA requirements and ensuring high availability across our systems.
What You Will Do
- Full-Stack Observability: Use Dynatrace APM to monitor application performance, trace transactions, and analyze user experience for more than 100 applications and 12 databases.
- Infrastructure Management: Apply ManageEngine to oversee 100 network devices and over 1,000 hosts, monitoring key health metrics like CPU usage, latency, and packet loss.
- Dependency Mapping: Use Dynatrace Smartscape to map relationships between digital assets and support impact analysis during incidents.
- 24x7 NOC Operations: Provide continuous alert triage and manage escalations as part of a rotating shift schedule.
- Performance Governance: Build real-time dashboards and create detailed Root Cause Analysis (RCA) reports to drive ongoing service improvements.
Location
This position is based in New Delhi, Delhi, India.
