You will collaborate across the organization to bring flagship models from our community and partners to life as optimized NVIDIA Inference Microservices
Job Summary
You will collaborate across the organization to bring flagship models from our community and partners to life as optimized NVIDIA Inference Microservices.
This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution.
You'll work on enterprise-grade GPU clusters capable of hundreds of PetaFLOPS and gain early access to unreleased hardware, impacting NVIDIA's roadmap and the broader AI landscape.
Matching Summary
You will collaborate across the organization to bring flagship models from our community and partners to life as optimized NVIDIA Inference Microservices.
Skills & Requirements
Must-have
Deep learning model evaluation
NVIDIA Inference Microservices (NIM)
Large language models (LLMs)
Performance analysis and debugging
AI/DL algorithms expertise
Collaboration with open-source community
Nice-to-have
Experience with TensorRT, ONNX, or Triton
DevOps/MLOps practices
High-performance computing (HPC) clusters
Linux and Docker containerization
Accuracy evaluation of LLMs
Key Requirements
BS, MS, or PhD in Computer Science or related field