Kaishi Partners Pte. Ltd. is seeking an Infrastructure Engineer with expertise in GPU, Kubernetes, and distributed systems to support their AI infrastructure initiatives in Singapore. The role involves designing and scaling systems that enable high-performance computing for AI applications, offering an opportunity to work in a collaborative, technical environment
Job Summary
The role involves building and scaling Kubernetes orchestration for a rapidly growing multi-million-dollar H200 GPU cluster.
Candidates will design distributed infrastructure powering large-scale AI workloads including training, inference, and crawling.
The team offers high ownership to shape foundational systems within a highly technical, low-ego engineering culture.
Matching Summary
Match Score: 85
Kaishi Partners Pte. Ltd. is seeking an Infrastructure Engineer with expertise in GPU, Kubernetes, and distributed systems to support their AI infrastructure initiatives in Singapore. The role involves designing and scaling systems that enable high-performance computing for AI applications, offering an opportunity to work in a collaborative, technical environment.
Salary
Competitive compensation; Meaningful equity upside; Not specified
Skills & Requirements
Must-have
Large-scale Kubernetes production experience
GPU cluster design and operation
Distributed compute system architecture
Cloud batch processing systems
Observability and reliability engineering
Nice-to-have
Experience with Ray or distributed batch systems
Optimizing GPU utilization and scheduling
AWS infrastructure at scale exposure
AI/ML infrastructure environment familiarity
Key Requirements
Strong hands-on experience with Kubernetes in production environments
Background in high-performance engineering environments
Experience designing large-scale infrastructure systems