We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes
Job Summary
We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes.
As a Solutions Architect focused on inference, you’ll collaborate closely with our engineering, DevOps, and customers to develop enterprise AI solutions.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions, with eligibility for equity and benefits.
Matching Summary
We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes.
Salary
Base: 152,000 USD - 241,500 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits
Skills & Requirements
Must-have
AI inference workloads on Kubernetes
NVIDIA GPU orchestration
Distributed systems deployment
Model optimization and serving
Low-latency inference tuning
Nice-to-have
NVIDIA inference technology deployment
Transformer neural network understanding
Inference acceleration technologies
Open-source project contributions
Technical mentorship and leadership
Key Requirements
5+ years in Solutions Architecture
Experience with NVIDIA Dynamo, Triton, or TensorRT-LLM