Senior Staff Ai Platform Engineer

NVIDIA

Base: 168,000 usd - 322,000 usd; bonus/equity: not...
Llm/ml infrastructure scaling
Kubernetes and bare-metal infrastructure
Observability for ai workloads
Define and lead AI-native infrastructure roadmaps and cross-organizational initiatives

Job Summary

  • Define and lead AI-native infrastructure roadmaps and cross-organizational initiatives.
  • Architect and scale LLM/ML infrastructure across cloud-native clusters and on-premises hardware.
  • Drive AI-assisted engineering practices and mentor engineers to foster an AI-first culture.

Matching Summary

Define and lead AI-native infrastructure roadmaps and cross-organizational initiatives.

Salary

Base: 168,000 USD - 322,000 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • LLM/ML infrastructure scaling
  • Kubernetes and bare-metal infrastructure
  • Observability for AI workloads
  • Python and systems language expertise
  • Distributed systems debugging

Nice-to-have

  • AI-assisted development tools
  • AI supply chain security
  • AI-specific threat models
  • Structured, automation-first approach
  • AI-first engineering practices

Key Requirements

  • 10+ years in cloud, platform, or SRE roles
  • Bachelors degree or equivalent experience
  • Strong Python and at least one systems language
  • Deep experience building and scaling distributed systems
  • Strong observability design
  • Hands-on experience operating AI/ML platforms
  • Experience with infrastructure and application security practices

Work Rights

Not specified

Tailored Resume

Cover Letter