Nebius is seeking a Technical Product Manager for their Soperator platform, which focuses on managing GPU clusters for ML workloads. This remote position involves owning product direction, conducting customer discovery, and leading cross-team execution within a fast-paced AI infrastructure environment
Job Summary
Nebius is leading a new era in cloud computing to serve the global AI economy, creating tools for customers to solve real-world challenges without massive infrastructure costs.
The role involves owning the full user journey across Soperator clusters, defining product direction end-to-end, and leading deep customer discovery.
We expect strong technical depth in distributed systems, cloud infrastructure, or ML platforms, with hands-on familiarity with large-scale ML training and orchestration tools.
Matching Summary
Match Score: 85
Nebius is seeking a Technical Product Manager for their Soperator platform, which focuses on managing GPU clusters for ML workloads. This remote position involves owning product direction, conducting customer discovery, and leading cross-team execution within a fast-paced AI infrastructure environment.
Skills & Requirements
Must-have
product direction end-to-end
customer discovery and analysis
drive execution across platform teams
open-source strategy and execution
Slurm, Kubernetes, Ray
technically complex products
Nice-to-have
GPU platforms and HPC primitives
modern ML training stacks
efficiency and reliability metrics
large-scale LLM training/inference
customer-facing technical experience
Key Requirements
3-5+ years in Product Management, ML infrastructure/MLOps, distributed systems, or cloud platform engineering
Strong technical depth in distributed systems, cloud infrastructure, or ML platforms
Hands-on familiarity with large-scale ML training and orchestration tools
Track record of shipping technically complex products
Strong communication and stakeholder management
Experience with product analytics, data-informed prioritization, and experimentation