AI Inference Engineer

HPC AI TECHNOLOGY PTE. LTD.

D01 Cecil, Marina, People’s Park, Raffles Place, 16 COLLYER QUAY COLLYER QUAY CENTRE 049318
Sgd 6,500 - 10,000 / monthly pm
On-site
Cuda
Ai accelerator
HPC AI Technology Pte. Ltd. is seeking an AI Inference Engineer to develop, optimize, and maintain high-performance inference services for large language models and multimodal models. The role requires expertise in GPU optimization, AI infrastructure development, and distributed training systems, with a focus on delivering low-latency AI services for a large user base

Job Summary

  • Job Description We are looking for a highly skilled engineer to build, optimize, and maintain high-performance inference services for large language models (LLMs) and multimodal models

Matching Summary

Match Score: 85

HPC AI Technology Pte. Ltd. is seeking an AI Inference Engineer to develop, optimize, and maintain high-performance inference services for large language models and multimodal models. The role requires expertise in GPU optimization, AI infrastructure development, and distributed training systems, with a focus on delivering low-latency AI services for a large user base.

Salary

SGD 6,500 - 10,000 / Monthly

Skills & Requirements

Must-have

  • CUDA
  • AI Accelerator

Nice-to-have

  • Operations Performance
  • Kubernetes
  • Data Structures
  • Full Stack Development
  • PyTorch
  • High Performance Computing
  • Performance Tuning
  • Systems Programming
  • Training
  • Electrical Engineering

Key Requirements

  • Minimum 1 years experience

Work Rights

Tailored Resume

Cover Letter