About the job
About Exegy
Exegy stands at the forefront of intelligent market data solutions, advanced trading systems, and resilient technology. As a trusted partner to a diverse ecosystem encompassing buy-side and sell-side firms, exchanges, and financial services technology companies globally, Exegy offers world-class support and managed services from its headquarters in St. Louis, with regional offices across North America, Europe, and the Asia Pacific.
Job Overview
We are on the lookout for a driven Production Support & Monitoring Engineer who will play a pivotal role in ensuring the reliability, efficiency, and availability of our production systems. This critical position involves real-time monitoring of essential systems, swift incident resolution, and proactive implementation of measures to uphold operational excellence. You will work closely with various internal teams to tackle technical challenges and enhance production environments. The ideal candidate will possess robust technical prowess, exemplary problem-solving abilities, and a proactive approach to minimize downtime while boosting system performance.
Join us in this exciting opportunity to make a significant impact in a leading global market data solutions provider.
Key Responsibilities
Oversee production systems and infrastructure to guarantee uptime and meet performance targets.
Identify, diagnose, and resolve production issues in real time to mitigate service disruptions.
Manage incident responses, including escalation, root cause analysis, and post-incident reporting.
Collaborate with engineering teams to create and deploy monitoring tools, alert systems, and automated recovery methods.
Analyze system logs, metrics, and trends to proactively detect potential risks or issues.
Execute software deployments, configuration adjustments, and system upgrades with minimal service interruption.
Maintain and enhance operational runbooks, escalation protocols, and industry best practices.
Foster continuous improvement by identifying opportunities for process optimization and operational efficiency.
Participate in an on-call rotation to provide 24/7 support for production systems.

