AI Egineer_GPU Performance

MICHAEL PAGE (PERSONNEL) PTE. LTD.

Islandwide, Singapore
Competitive compensation; performance-linked incen...
5 years gpu computing experience
Deep knowledge of gpu architectures
Pytorch distributed model training
The role involves architecting and executing large-scale model training on multi-node, multi-GPU clusters

Job Summary

  • The role involves architecting and executing large-scale model training on multi-node, multi-GPU clusters.
  • Candidates will optimize training and inference performance using advanced distributed strategies like DDP and FSDP.
  • The company offers competitive compensation, performance-linked incentives, and exposure to industry-leading AI technologies.

Matching Summary

Match Score: 85

The role involves architecting and executing large-scale model training on multi-node, multi-GPU clusters.

Salary

Competitive compensation; Performance-linked incentives; Not specified

Skills & Requirements

Must-have

  • 5 years GPU computing experience
  • Deep knowledge of GPU architectures
  • PyTorch distributed model training
  • LLM fine-tuning and inference optimization
  • Advanced C++ and CUDA proficiency

Nice-to-have

  • Experience with DDP and FSDP strategies
  • Knowledge of DeepSpeed and Megatron-LM
  • Background in autonomous AI agent development
  • Understanding of CI/CD pipelines
  • Collaboration with hardware architects

Key Requirements

  • At least 5 years experience in GPU computing
  • Expertise in LLM fine-tuning and GenAI
  • Solid understanding of end-to-end ML systems

Work Rights

Not specified

Tailored Resume

Cover Letter