This role is responsible for ensuring the availability, reliability, and operational integrity of mission-critical IT infrastructure including compute, storage, and network systems
Job Summary
This role is responsible for ensuring the availability, reliability, and operational integrity of mission-critical IT infrastructure including compute, storage, and network systems.
The ideal candidate will bring strong operational leadership, hands-on Linux and networking experience, and the ability to manage infrastructure deployments while maintaining strict SLA compliance.
Key responsibilities include leading daily operations of enterprise and colocation data center environments supporting business-critical applications with a focus on 99.99%+ availability.
Matching Summary
This role is responsible for ensuring the availability, reliability, and operational integrity of mission-critical IT infrastructure including compute, storage, and network systems.
Skills & Requirements
Must-have
10-15+ years data center operations experience
Hands-on Linux/Unix production environment skills
Data center networking fundamentals knowledge
99.99%+ availability maintenance and monitoring
High-density rack deployment support (25kW-75kW)
Server provisioning and firmware update management
Nice-to-have
Strong operational leadership capabilities
Root cause analysis and incident management
Experience with colocation service coordination
Ability to manage vendor contracts effectively
Collaboration with facilities and network teams
Key Requirements
Bachelor's degree in computer science or related field
10-15+ years of experience in IT data center operations
Strong hands-on experience with Linux/Unix systems