Dl Algorithms Engineer - Cosmos - New College Graduate 2026

Invidia

Us, CA, United States
Base: 124,000 usd - 241,500 usd depending on level...
Not specified
Deep learning model optimization
Large language models (llms)
Vision-language models (vlms)
NVIDIA is seeking a Deep Learning Algorithms Engineer, ideally a new college graduate by 2026, with expertise in optimizing and deploying Large Language Models and other advanced AI models. The role involves collaboration across teams to enhance model performance on NVIDIA GPU platforms for physical and generative AI applications

Job Summary

  • You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production.
  • The role focuses on optimizing and deploying deep learning models for low-latency, high-throughput inference across diverse GPU platforms, especially for physical AI and generative AI applications.
  • You will contribute to automation and tooling development for NVIDIA Inference Microservices and create automated benchmarks to track performance regressions.

Matching Summary

Match Score: 85

NVIDIA is seeking a Deep Learning Algorithms Engineer, ideally a new college graduate by 2026, with expertise in optimizing and deploying Large Language Models and other advanced AI models. The role involves collaboration across teams to enhance model performance on NVIDIA GPU platforms for physical and generative AI applications.

Salary

Base: 124,000 USD - 241,500 USD depending on level; Bonus/Equity: Eligible; Benefits: Eligible

Skills & Requirements

Must-have

  • Deep learning model optimization
  • Large Language Models (LLMs)
  • Vision-Language Models (VLMs)
  • GPU performance profiling
  • PyTorch or TensorFlow deployment
  • Inference optimization techniques
  • Serving models with Triton Inference Server

Nice-to-have

  • Experience with NVIDIA Cosmos and Omniverse
  • Model quantization and parallelization
  • Distributed systems for large-scale inference
  • Automation and benchmarking tooling
  • Data curation pipelines
  • Collaboration with multidisciplinary teams

Key Requirements

  • Master’s or PhD in Computer Science or related field
  • Experience in deep learning or physical AI development
  • Strong foundation in transformer architectures and attention mechanisms
  • Proficient programming in Python and C++
  • Experience with GPU profiling tools like Nsight and nsys
  • Familiarity with Docker-based model serving
  • Not specified work authorization

Work Rights

Not specified

Tailored Resume

Cover Letter