Ml Engineer L4, Consumer Inference

Netflix

Los Gatos, CA, USA
Base: $100,000 - $464,000; bonus/equity: stock opt...
Machine learning model deployment
Large language models (llms) optimization
Python and java programming
Netflix is a leading entertainment service with a mission to innovate through machine learning and AI across its global platform

Job Summary

  • Netflix is a leading entertainment service with a mission to innovate through machine learning and AI across its global platform.
  • The role involves building customer-facing libraries and services to productize ML models for efficient, scalable, and low-latency inference in production.
  • Netflix offers a unique culture emphasizing transparency, autonomy, career growth, and comprehensive benefits including health plans and stock options.

Matching Summary

Netflix is a leading entertainment service with a mission to innovate through machine learning and AI across its global platform.

Salary

Base: $100,000 - $464,000; Bonus/Equity: Stock options available; Benefits: Comprehensive health, retirement, and paid leave programs

Skills & Requirements

Must-have

  • Machine learning model deployment
  • Large language models (LLMs) optimization
  • Python and Java programming
  • GPU inference optimization
  • Containerization with Docker
  • Kubernetes orchestration
  • ML model registry management

Nice-to-have

  • Customer-driven development mindset
  • Collaboration and communication skills
  • Autonomy in project decision-making
  • Experience with CI/CD and observability
  • Eagerness to learn and adapt

Key Requirements

  • Experience with TensorFlow and PyTorch
  • Proven skills in scalable model serving solutions
  • Strong programming expertise in Python and Java
  • Experience with Triton Inference Server and TensorRT
  • Knowledge of multi-language CI/CD pipelines
  • Ability to manage ML platform incident workflows

Work Rights

Not specified

Tailored Resume

Cover Letter