Provide technical leadership to HPC engineering staff and guide architectural and operational decisions for the HPC environment
Job Summary
Provide technical leadership to HPC engineering staff and guide architectural and operational decisions for the HPC environment.
Design, deploy, and maintain the university’s high-performance computing cluster, including configuring the workload scheduler and architecting quality-of-service policies.
Administer Linux systems, deploy new GPUs for research and teaching, troubleshoot complex issues, and support AI workloads.
Matching Summary
Provide technical leadership to HPC engineering staff and guide architectural and operational decisions for the HPC environment.
Skills & Requirements
Must-have
Linux systems administration
HPC environment management
Workload scheduler configuration
GPU deployment and support
AI workload deployment
Nice-to-have
Technical leadership
Vendor evaluation
Advanced technical support
Key Requirements
Bachelor's degree in Computer Science
3 years of experience in Linux systems administration
Experience with RHEL/CentOS Linux
Experience with parallel computing and batch/scheduling systems