**
Nvidia Corporation is seeking a new college graduate for the role of AI Inference Performance Engineer, focusing on optimizing and benchmarking GenAI inference on its latest accelerators. The position involves a combination of technical leadership, collaboration with various teams, and contributions to open-source projects, offering a competitive salary and benefits.
**
Job Summary
We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads.
Drive industry benchmark results: own the end-to-end optimization pipeline, implement and integrate optimizations in quantization, scheduling, memory management, and distributed inference across TensorRT-LLM, SGLang, and vLLM.
Partner with architecture, kernel, and compiler teams to shape GPU roadmaps based on real workload data.
Matching Summary
Match Score: 75
**
Nvidia Corporation is seeking a new college graduate for the role of AI Inference Performance Engineer, focusing on optimizing and benchmarking GenAI inference on its latest accelerators. The position involves a combination of technical leadership, collaboration with various teams, and contributions to open-source projects, offering a competitive salary and benefits.
**