This role focuses on maintaining 99.99% uptime for high-frequency trading systems that process millions of transactions daily
Job Summary
This role focuses on maintaining 99.99% uptime for high-frequency trading systems that process millions of transactions daily.
The successful candidate will develop real-time monitoring and alerting systems to ensure early detection of issues in a 24x7 global environment.
Candidates must support critical operations across multiple international data centers while collaborating with engineering teams to integrate reliability best practices.
Matching Summary
Match Score: 85
This role focuses on maintaining 99.99% uptime for high-frequency trading systems that process millions of transactions daily.
Skills & Requirements
Must-have
5+ years SRE or DevOps experience
Linux and Windows system administration
Python, Shell, Perl, JavaScript scripting
Docker and Kubernetes container technologies
Prometheus, Grafana, ELK Stack monitoring
CI/CD pipeline maintenance with Jenkins
TCP/IP networking and load balancing
Nice-to-have
Financial services or trading industry experience
AWS, GCP, or Azure cloud platform knowledge
Experience with high-frequency trading systems
Performance optimization for latency-sensitive apps
Incident management frameworks like ITIL
Strong debugging skills across database layers
Key Requirements
Bachelor's degree in Computer Science or Engineering
5+ years of SRE, DevOps, or similar experience
Strong expertise in Linux and Windows administration