LLM Optimization Engineer

HPC AI TECHNOLOGY PTE. LTD.

Singapore, Singapore
Not specified
Proficiency in python and c++
Strong foundations in data structures
Experience with pytorch frameworks
HPC AI TECHNOLOGY PTE. LTD. is seeking an LLM Optimization Engineer to enhance parallel computing strategies and optimize training and inference frameworks. Candidates should possess strong programming skills in Python and C++, with significant experience in high-performance computing concepts and frameworks

Job Summary

  • The role involves designing and implementing efficient parallel computing strategies to improve end-to-end throughput and latency.
  • Candidates must have solid experience with PyTorch, including a deep understanding of model execution workflows and computation graph mechanisms.
  • The position requires familiarity with high-performance computing concepts such as operator fusion and memory hierarchy.

Matching Summary

Match Score: 85

HPC AI TECHNOLOGY PTE. LTD. is seeking an LLM Optimization Engineer to enhance parallel computing strategies and optimize training and inference frameworks. Candidates should possess strong programming skills in Python and C++, with significant experience in high-performance computing concepts and frameworks.

Skills & Requirements

Must-have

  • Proficiency in Python and C++
  • Strong foundations in data structures
  • Experience with PyTorch frameworks
  • Knowledge of HPC concepts
  • Understanding of accelerator architectures

Nice-to-have

  • Experience with vLLM and SGLang
  • KV cache optimization techniques
  • Low-precision computation experience
  • Productionizing large model systems
  • Attention optimization skills

Key Requirements

  • Proficiency in Python and C++
  • Solid experience with PyTorch
  • Familiarity with GPU/NPU architectures

Work Rights

Not specified

Tailored Resume

Cover Letter