Deep Learning Engineer - Llm And Vlm Model Compression

Nvidia Corporation

Poland
Base: 292,500 pln - 507,000 pln or 375,000 pln - 6...
On-site
Deep learning frameworks
Llm and vlm model compression
Pruning, distillation, and nas
Design and implement a deep learning framework for compressing large language and vision-language models to deliver highly optimized, high-performance AI systems used worldwide

Job Summary

  • Design and implement a deep learning framework for compressing large language and vision-language models to deliver highly optimized, high-performance AI systems used worldwide.
  • Develop and integrate new algorithms for pruning, NAS, and distillation in collaboration with NVIDIA researchers and engineers.
  • Lead best-practices for building, testing, and releasing DL software.

Matching Summary

Design and implement a deep learning framework for compressing large language and vision-language models to deliver highly optimized, high-performance AI systems used worldwide.

Salary

Base: 292,500 PLN - 507,000 PLN or 375,000 PLN - 650,000 PLN; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Deep learning frameworks
  • LLM and VLM model compression
  • Pruning, distillation, and NAS
  • Python programming skills
  • PyTorch experience

Nice-to-have

  • Pushing boundaries of AI efficiency
  • World-class teams collaboration
  • Enterprise-grade GPU clusters
  • Unreleased hardware experience
  • First-author publication

Key Requirements

  • 8+ years of experience
  • BSc, MS or PhD degree
  • Hands-on LLM or VLM experience
  • Extensive DL framework knowledge
  • Strong problem solving skills

Work Rights

Not specified

Tailored Resume

Cover Letter