Hpc Cloud Engineer

Trzdev28

Barcelona, Spain
**
Develop, deploy, and operate hpc clusters on aws
Site reliability engineering for hpc services
Monitor cloud spend for efficiency
** Trzdev28 is seeking an HPC Cloud Engineer to work in Barcelona, focusing on developing and managing AWS-based high-performance computing services to support scientific applications. The role requires strong technical skills in cloud infrastructure and a collaborative approach to problem-solving, with an emphasis on communication and efficiency. **

Job Summary

  • Develop, deliver and operate high performance computing clusters and applications on AWS, taking a Site Reliability Engineering approach.
  • Constantly monitor cloud spend to ensure efficiency and cost-effectiveness, and keep the cloud HPC infrastructure updated and aligned with security standards.
  • Join a supportive team of equals that values ownership, curiosity and practical problem solving, building robust platforms and turning complexity into clarity.

Matching Summary

Match Score: 75

** Trzdev28 is seeking an HPC Cloud Engineer to work in Barcelona, focusing on developing and managing AWS-based high-performance computing services to support scientific applications. The role requires strong technical skills in cloud infrastructure and a collaborative approach to problem-solving, with an emphasis on communication and efficiency. **

Skills & Requirements

Must-have

  • Develop, deploy, and operate HPC clusters on AWS
  • Site Reliability Engineering for HPC services
  • Monitor cloud spend for efficiency
  • Linux environment administration and programming
  • Terraform infrastructure-as-code
  • Python programming and bash scripting
  • DevOps team experience and agile methodologies

Nice-to-have

  • Scientific degree or experience
  • Computational analysis of scientific data
  • GPU, AI/ML tools and frameworks
  • Parallel programming techniques
  • Workflow engines expertise
  • Container runtimes familiarity
  • Regression tests and benchmarks for HPC

Key Requirements

  • Experience with large scale AWS infrastructure
  • Well Architected Framework knowledge
  • Experience migrating HPC workloads to cloud
  • AWS Certified Solution Architect (desirable)
  • Experience with SLURM administration (desirable)
  • Experience with Easybuild or Spack (desirable)
  • Experience with Reframe HPC (desirable)

Work Rights

Not specified

Tailored Resume

Cover Letter