Senior High Performance Computing System Administrator

weke.yale.edu

New Haven, Connecticut, US
$90,000.00 - $165,750.00; not specified; not speci...
Hybrid (minimum of 2 days onsite)
Gpu infrastructure enhancements
Hpc linux cluster administration
High-speed networking (infiniband/ethernet)
Yale University is seeking a Senior High Performance Computing System Administrator to enhance its AI-focused HPC infrastructure, supporting faculty and students in research initiatives. The role requires expertise in GPU systems, Linux cluster management, and high-speed networking, with a preference for candidates with relevant experience in a research environment

Job Summary

  • The Yale Center for Research Computing (YCRC) is seeking a versatile system administrator/engineer to enhance the AI HPC infrastructure for faculty and students.
  • This role involves leading the system design, deployment, and support of YCRC’s AI-focused research cluster and storage infrastructure, with a focus on GPU enhancements.
  • The position is hybrid, requiring a minimum of two days per week on site, with infrastructure hosted at Yale data centers and the MGHPCC.

Matching Summary

Match Score: 85

Yale University is seeking a Senior High Performance Computing System Administrator to enhance its AI-focused HPC infrastructure, supporting faculty and students in research initiatives. The role requires expertise in GPU systems, Linux cluster management, and high-speed networking, with a preference for candidates with relevant experience in a research environment.

Salary

$90,000.00 - $165,750.00; Not specified; Not specified

Skills & Requirements

Must-have

  • GPU infrastructure enhancements
  • HPC Linux cluster administration
  • High-speed networking (InfiniBand/Ethernet)
  • Large storage systems and parallel file systems
  • Linux system administration expertise
  • Automation and scripting

Nice-to-have

  • Multi-node GPU system support
  • HPC system customization
  • Research environment support
  • Data-center hardware troubleshooting
  • Computer security expertise

Key Requirements

  • Minimum six years of related work experience
  • Bachelor's Degree in a related field or equivalent experience
  • Experience with accelerators such as GPUs for AI
  • Expertise in HPC Linux cluster administration
  • Expertise in Linux system administration

Work Rights

Not specified

Tailored Resume

Cover Letter