Senior Engineering Manager, Ai Runtime

Databricks

Mountain View, California, US
$228,600—$314,250 usd; not specified; not specifie...
On-site
Ai runtime product ownership
Distributed training orchestration
Managed gpu training at scale
Lead, mentor, and grow a high-performing engineering team responsible for the Custom Training product and its foundational infrastructure

Job Summary

  • Lead, mentor, and grow a high-performing engineering team responsible for the Custom Training product and its foundational infrastructure.
  • Define and own the product and technical roadmap for AIR, balancing customer experience, functionality, and foundational investments.
  • Collaborate closely with product, research, platform, infrastructure teams, and customers to drive end-to-end delivery, from ideation and prioritization to launch and operation.

Matching Summary

Lead, mentor, and grow a high-performing engineering team responsible for the Custom Training product and its foundational infrastructure.

Salary

$228,600—$314,250 USD; Not specified; Not specified

Skills & Requirements

Must-have

  • AI Runtime product ownership
  • distributed training orchestration
  • managed GPU training at scale
  • customer-facing capabilities
  • observability and reliability practices

Nice-to-have

  • accelerating medical breakthroughs
  • solving tough problems
  • driving end-to-end delivery
  • leading through ambiguity

Key Requirements

  • 8+ years software engineering
  • 3+ years engineering management
  • Track record building/operating managed GPU training
  • Deep familiarity with distributed training frameworks
  • Experience with training resilience patterns
  • Understanding of GPU performance fundamentals
  • Experience building platform products with SLAs
  • BS/MS in Computer Science or related field

Work Rights

Not specified

Tailored Resume

Cover Letter