Senior Ai Ops Engineer| Kubernetes/docker

KLA

Ann Arbor, MI, United States
Base: $134,800 - $229,200 annually; bonus/equity: ...
Experiment tracking and reproducibility
Ci/cd for machine learning
Workflow orchestration with airflow/kubeflow/argo
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem investing heavily in innovation and R&D

Job Summary

  • KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem investing heavily in innovation and R&D.
  • The Senior AI Ops Engineer will build and own automated pipelines for model training, fine-tuning, evaluation, and deployment across environments.
  • Employees benefit from a comprehensive rewards package including medical, dental, vision, 401(K) matching, stock purchase programs, and career development opportunities.

Matching Summary

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem investing heavily in innovation and R&D.

Salary

Base: $134,800 - $229,200 annually; Bonus/Equity: Not specified; Benefits: Medical, dental, vision, 401(K) matching, stock purchase, tuition reimbursement, and more

Skills & Requirements

Must-have

  • Experiment tracking and reproducibility
  • CI/CD for machine learning
  • Workflow orchestration with Airflow/Kubeflow/Argo
  • Containerization with Docker
  • Kubernetes orchestration
  • Distributed GPU training optimization
  • Python automation frameworks

Nice-to-have

  • Mentoring and setting organizational standards
  • Experience with RLHF pipelines
  • Parameter-efficient fine-tuning methods
  • Strong communication skills
  • Problem-solving in distributed systems
  • Experience with Infrastructure-as-Code

Key Requirements

  • Bachelor’s degree in Computer Science or related field
  • 5+ years experience in MLOps or related engineering roles
  • Master’s degree with 6 years experience or Bachelor’s with 8 years experience
  • Experience operating production ML systems
  • Experience with version control and CI/CD
  • Experience with GPU workloads at scale

Work Rights

Not specified

Tailored Resume

Cover Letter