Kubernetes Platform Engineer

Hewlett Packard Enterprise

Colorado, United States
Base: $111,500 - $243,000 usd depending on locatio...
Hybrid
Kubernetes platform engineering
Rdma-class networking design
Hpe slingshot/cxi fabric integration
This role leads the technical design for Kubernetes-native RDMA networking to support distributed AI inference workloads on HPC clusters

Job Summary

  • This role leads the technical design for Kubernetes-native RDMA networking to support distributed AI inference workloads on HPC clusters.
  • The engineer will own the end-to-end implementation of exposing RDMA-capable NIC resources using operators and custom APIs without requiring privileged pods or container rebuilds.
  • Hewlett Packard Enterprise offers a hybrid work model with flexibility, comprehensive health benefits, and investment in professional development.

Matching Summary

This role leads the technical design for Kubernetes-native RDMA networking to support distributed AI inference workloads on HPC clusters.

Salary

Base: $111,500 - $243,000 USD depending on location; Bonus/Equity: Variable incentives may be offered; Benefits: Comprehensive suite including health, wellbeing, and professional development programs

Skills & Requirements

Must-have

  • Kubernetes platform engineering
  • RDMA-class networking design
  • HPE Slingshot/CXI fabric integration
  • Dynamic Resource Allocation (DRA)
  • Container Device Interface (CDI)
  • Multus and secondary CNI configuration
  • Custom CRD and API design

Nice-to-have

  • NVIDIA NIM and vLLM experience
  • TensorRT-LLM deployment knowledge
  • KV-cache movement optimization
  • Prefill/decode separation patterns
  • Security-first mindset
  • Cross-domain architectural thinking

Key Requirements

  • Experience with Kubernetes Operators
  • Knowledge of NVIDIA NIM and GPU Operator
  • Understanding of HPC fabric technologies

Work Rights

Not specified

Tailored Resume

Cover Letter