Nebius is seeking a Senior HPC Cluster Engineer to enhance their cutting-edge hyperscaler platform, focusing on GPU computing and InfiniBand networks. The ideal candidate will have extensive experience in system-level software development, Linux systems, and performance optimization, and will play a crucial role in improving infrastructure for high-performance computing.
###
Job Summary
Nebius is leading a new era in cloud computing to serve the global AI economy, creating tools and resources for customers to solve real-world challenges.
The Senior HPC Cluster Engineer will be responsible for tuning the performance of GPU clusters and InfiniBand networks, analyzing and troubleshooting issues, and integrating new hardware.
The role involves enhancing automation systems for proactive monitoring and resolving issues in GPU and InfiniBand environments, ensuring high performance and security in multi-GPU, HPC environments.
Matching Summary
Match Score: 85
Nebius is seeking a Senior HPC Cluster Engineer to enhance their cutting-edge hyperscaler platform, focusing on GPU computing and InfiniBand networks. The ideal candidate will have extensive experience in system-level software development, Linux systems, and performance optimization, and will play a crucial role in improving infrastructure for high-performance computing.
###