Anyscale is democratizing distributed computing by commercializing the open-source Ray project to make scalable machine learning accessible to all developers
Job Summary
Anyscale is democratizing distributed computing by commercializing the open-source Ray project to make scalable machine learning accessible to all developers.
The role involves designing and optimizing critical infrastructure components that power both the control plane and data plane for large-scale AI workloads.
Candidates will collaborate with leading experts to build seamless integrations between open-source tools and proprietary products while ensuring high reliability and performance.
Matching Summary
Anyscale is democratizing distributed computing by commercializing the open-source Ray project to make scalable machine learning accessible to all developers.
Skills & Requirements
Must-have
3+ years production code experience
Kubernetes and container orchestration expertise
Cloud-native infrastructure on AWS Azure GCP
Proficiency in Go and Python programming
Deep understanding of Linux kernel and networking
Nice-to-have
Experience with Prometheus and Grafana observability
Knowledge of GPU and TPU accelerator integration
Familiarity with open-source Ray ecosystem
Background in distributed systems architecture
Ability to participate in on-call support rotations
Key Requirements
Bachelor's degree in Computer Science or equivalent practical experience
Hands-on experience building highly available distributed systems
Expertise in cloud-native technologies and Kubernetes deployments