Ai Computing Development Engineer, Tensorrt-llm

Nvidia Corporation

Not specified; not specified; not specified
C/c++ or python programming skills
Experience with pytorch and tensorrt-llm
Deep learning framework expertise
The role involves crafting robust inferencing software scalable across multiple platforms for NVIDIA's product lines

Job Summary

  • The role involves crafting robust inferencing software scalable across multiple platforms for NVIDIA's product lines.
  • Candidates must possess excellent C/C++ or Python programming skills along with experience in deep learning frameworks like PyTorch and TensorRT-LLM.
  • The team is responsible for guiding the direction of machine learning inferencing while collaborating with software, research, and product teams.

Matching Summary

The role involves crafting robust inferencing software scalable across multiple platforms for NVIDIA's product lines.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • C/C++ or Python programming skills
  • Experience with PyTorch and TensorRT-LLM
  • Deep learning framework expertise
  • Performance analysis and optimization
  • Master's degree in Computer Science

Nice-to-have

  • Strong curiosity about AI developments
  • Excellent interpersonal communication skills
  • Ability to work without supervision
  • Publishing results in scientific conferences
  • Collaboration across research teams

Key Requirements

  • Masters or higher degree in Computer Engineering or related field
  • 2+ years of relevant software development experience
  • Proficiency in debugging and test design

Work Rights

Not specified

Tailored Resume

Cover Letter