NVIDIA is seeking a Senior Software Engineer for AI Inference to enhance open-source LLM serving by optimizing upstream inference engines like vLLM and SGLang. The role involves hands-on contributions to performance improvements and collaboration with various teams to ensure high-throughput, low-latency inference on NVIDIA's platforms
Matching Summary
Match Score: 85
NVIDIA is seeking a Senior Software Engineer for AI Inference to enhance open-source LLM serving by optimizing upstream inference engines like vLLM and SGLang. The role involves hands-on contributions to performance improvements and collaboration with various teams to ensure high-throughput, low-latency inference on NVIDIA's platforms.