Solutions Architect, Inference Deployments

NVIDIA

Base: 152,000 usd - 241,500 usd; bonus/equity: not...
Experience with nvidia dynamo
Kubernetes orchestration
Deploying distributed systems
You will collaborate closely with engineering, DevOps, and customers to develop enterprise AI solutions

Job Summary

  • You will collaborate closely with engineering, DevOps, and customers to develop enterprise AI solutions.
  • The role involves building inference pipelines and mentoring teams in deploying disaggregated inference systems.
  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer.

Matching Summary

You will collaborate closely with engineering, DevOps, and customers to develop enterprise AI solutions.

Salary

Base: 152,000 USD - 241,500 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Experience with NVIDIA Dynamo
  • Kubernetes orchestration
  • Deploying distributed systems

Nice-to-have

  • Understanding of transformer neural networks
  • Contributions to open-source projects
  • NVIDIA Certified AI Engineer

Key Requirements

  • 5+ years in Solutions Architecture
  • Experience with TensorRT-LLM
  • BS in CS/Engineering or equivalent

Work Rights

Not specified

Tailored Resume

Cover Letter