Not specified (assumed to be hybrid based on job nature).
Bachelor's degree in computer science or related field
Proficiency in go, python, or shell programming
Kubernetes and container operation experience
ByteDance is seeking a Site Reliability Engineer for its Machine Learning Systems team in Singapore. The role focuses on developing and maintaining high-performance, reliable ML systems, requiring strong technical skills in programming, Kubernetes, and distributed systems
Job Summary
The ByteDance Large Model Team is committed to developing the most advanced AI large model technology in the industry.
You will build massively distributed ML training and inference systems integrating GPU/NPU/RDMA/Storage components.
The role offers a positive team atmosphere, competitive compensation, meal allowances, and flexible hours.
Matching Summary
Match Score: 85
ByteDance is seeking a Site Reliability Engineer for its Machine Learning Systems team in Singapore. The role focuses on developing and maintaining high-performance, reliable ML systems, requiring strong technical skills in programming, Kubernetes, and distributed systems.
Salary
Competitive compensation; Meal allowance provided; Paid leave included
Skills & Requirements
Must-have
Bachelor's degree in Computer Science or related field
Proficiency in Go, Python, or Shell programming
Kubernetes and container operation experience
Linux environment operations and maintenance
Nice-to-have
Experience with large-scale ML distributed systems