Solutions Architect, Llm Model Builder

Nvidia Corporation

CA, United States
Base: 152,000 usd - 241,500 usd; bonus/equity: equ...
**
Foundation model solutions
Reasoning, multimodal, fine-tuning, serving
Compute planning and sizing
** NVIDIA is seeking a Solutions Architect specializing in foundation models to support partners in building and deploying large-scale AI solutions. The candidate will act as a technical advisor, guiding partners through various stages of model implementation and optimization. **

Job Summary

  • Act as a strategic technical expert and hands-on advisor, helping partners build, benchmark, fine-tune, optimize, and deploy foundation model solutions.
  • Guide partners on compute planning, including cluster sizing, GPU and network selection, storage, memory tradeoffs, and latency/throughput targets.
  • Develop reference architectures, playbooks, benchmark recipes, TCO calculators, and sizing models across NVIDIA's accelerated computing platform.

Matching Summary

Match Score: 75

** NVIDIA is seeking a Solutions Architect specializing in foundation models to support partners in building and deploying large-scale AI solutions. The candidate will act as a technical advisor, guiding partners through various stages of model implementation and optimization. **

Salary

Base: 152,000 USD - 241,500 USD; Bonus/Equity: equity; Benefits: comprehensive benefits package

Skills & Requirements

Must-have

  • Foundation model solutions
  • Reasoning, multimodal, fine-tuning, serving
  • Compute planning and sizing
  • Inference architecture guidance
  • Python, PyTorch, JAX, or TensorFlow
  • CUDA, NeMo, TensorRT-LLM, Triton, NIMs

Nice-to-have

  • Large-scale AI systems deployment
  • GPU infrastructure knowledge
  • Open-source contributions
  • Cross-functional technical communication

Key Requirements

  • MSc, PhD in relevant fields or equivalent experience
  • 5+ years of experience with LLMs and large-scale inference systems
  • Hands-on expertise in fine-tuning, benchmarking, evaluation, optimization, and production deployment
  • Familiarity with reasoning models, RL, and synthetic data generation
  • Familiarity with Nemotron, NeMo, Dynamo, TensorRT-LLM, Triton, vLLM

Work Rights

Not specified

Tailored Resume

Cover Letter