HPC AI TECHNOLOGY PTE. LTD. is seeking an LLM Optimization Engineer to enhance parallel computing strategies and optimize training and inference frameworks. Candidates should possess strong programming skills in Python and C++, with significant experience in high-performance computing concepts and frameworks
Job Summary
The role involves designing and implementing efficient parallel computing strategies to improve end-to-end throughput and latency.
Candidates must have solid experience with PyTorch, including a deep understanding of model execution workflows and computation graph mechanisms.
The position requires familiarity with high-performance computing concepts such as operator fusion and memory hierarchy.
Matching Summary
Match Score: 85
HPC AI TECHNOLOGY PTE. LTD. is seeking an LLM Optimization Engineer to enhance parallel computing strategies and optimize training and inference frameworks. Candidates should possess strong programming skills in Python and C++, with significant experience in high-performance computing concepts and frameworks.