Lead Software Engineer

Armada

Bangalore, India
On-site
Gpu-as-a-service platform architecture
Kubernetes internals and custom operators
Distributed systems design
Lead the design of a globally scalable AI control plane for GPU, storage, and network orchestration

Job Summary

  • Lead the design of a globally scalable AI control plane for GPU, storage, and network orchestration.
  • Architect hard isolation strategies across kernel, hypervisor, and hardware layers.
  • Define platform SLOs, capacity planning models, and GPU availability targets.

Matching Summary

Lead the design of a globally scalable AI control plane for GPU, storage, and network orchestration.

Skills & Requirements

Must-have

  • GPU-as-a-Service platform architecture
  • Kubernetes internals and custom operators
  • Distributed systems design
  • Hard isolation strategies (IOMMU, SR-IOV)
  • Zero-trust networking principles
  • RDMA, GPUDirect, RoCE v2 optimization
  • VXLAN and BGP-EVPN networking

Nice-to-have

  • Leading AI platform development
  • Mentoring engineering teams
  • Cross-functional collaboration
  • Building infrastructure at scale

Key Requirements

  • Highly experienced Lead Software Engineer
  • Experience with distributed systems
  • Experience with Kubernetes internals
  • Experience with GPU infrastructure

Work Rights

Not specified

Tailored Resume

Cover Letter