Senior Solutions Architect - Kv Cache And Ai Storage

Nvidia Corporation

Not specified
Llm inference platforms
Nvidia gpus
Kv cache solutions
Nvidia Corporation is seeking a Senior Solutions Architect focused on KV Cache and AI Storage to lead technical explorations with customers and develop next-generation LLM inference platforms. The ideal candidate will have extensive experience in distributed storage, caching, and a solid understanding of modern storage technologies, particularly in relation to AI applications

Job Summary

  • Collaborate closely with our largest customers to build next-generation LLM inference platforms powered by NVIDIA GPUs, Dynamo/KVBM, and CMX.
  • Build end-to-end KV cache solutions using tiered memory and NVIDIA modern networking technologies.
  • Translate customer difficulties into clear feature requests and roadmap input for NVIDIA products.

Matching Summary

Match Score: 85

Nvidia Corporation is seeking a Senior Solutions Architect focused on KV Cache and AI Storage to lead technical explorations with customers and develop next-generation LLM inference platforms. The ideal candidate will have extensive experience in distributed storage, caching, and a solid understanding of modern storage technologies, particularly in relation to AI applications.

Skills & Requirements

Must-have

  • LLM inference platforms
  • NVIDIA GPUs
  • KV cache solutions
  • distributed storage
  • Transformer / LLM inference
  • NVMe SSDs
  • tiered memory

Nice-to-have

  • LLM inference platforms in cloud
  • custom KV stores/cache layers
  • NVIDIA technologies exposure
  • public talks or papers

Key Requirements

  • Bachelor's degree or higher
  • 5+ years of relevant experience
  • 2+ years passionate about KV stores/caches
  • Hands-on experience with distributed storage
  • Experience with at least one LLM serving stack

Work Rights

Not specified

Tailored Resume

Cover Letter