Solutions Architect, Inference Deployments

Invidia

CA, United States
Base: 152,000 usd - 241,500 usd; bonus/equity: eli...
Ai inference workloads on kubernetes
Nvidia gpu orchestration
Distributed systems deployment
We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes

Job Summary

  • We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes.
  • As a Solutions Architect focused on inference, you’ll collaborate closely with our engineering, DevOps, and customers to develop enterprise AI solutions.
  • Your base salary will be determined based on your location, experience, and the pay of employees in similar positions, with eligibility for equity and benefits.

Matching Summary

We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes.

Salary

Base: 152,000 USD - 241,500 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • AI inference workloads on Kubernetes
  • NVIDIA GPU orchestration
  • Distributed systems deployment
  • Model optimization and serving
  • Low-latency inference tuning

Nice-to-have

  • NVIDIA inference technology deployment
  • Transformer neural network understanding
  • Inference acceleration technologies
  • Open-source project contributions
  • Technical mentorship and leadership

Key Requirements

  • 5+ years in Solutions Architecture
  • Experience with NVIDIA Dynamo, Triton, or TensorRT-LLM
  • BS in CS/Engineering or equivalent experience

Work Rights

Not specified

Tailored Resume

Cover Letter