Systems Engineer, Hpc

Mistral AI

Paris, France
On-site
Strong linux systems administration
Large-scale environments (hpc/cloud)
Job schedulers (e.g. slurm)
Operate and maintain large-scale Linux environments, monitor system health, and support production and research workloads

Job Summary

  • Operate and maintain large-scale Linux environments, monitor system health, and support production and research workloads.
  • Scale clusters toward thousands of nodes, work on petabyte-scale storage, and improve performance and reliability.
  • Automate operational tasks using Python, Bash, Ansible, or Terraform, and contribute to system design and architecture decisions.

Matching Summary

Operate and maintain large-scale Linux environments, monitor system health, and support production and research workloads.

Skills & Requirements

Must-have

  • Strong Linux systems administration
  • Large-scale environments (HPC/cloud)
  • Job schedulers (e.g. Slurm)
  • Troubleshooting systems, hardware, networks

Nice-to-have

  • Containers/orchestration (Kubernetes)
  • Storage systems (Ceph, Lustre, NFS)
  • Networking fundamentals (InfiniBand)
  • Infrastructure as Code/automation
  • GPU or AI/ML experience

Key Requirements

  • Strong Linux systems administration experience
  • Experience in large-scale environments
  • Experience with Job schedulers
  • Solid troubleshooting skills

Work Rights

Not specified

Tailored Resume

Cover Letter