Machine Learning Infrastructure Engineer- Model Inference

Abridge

San Francisco, CA, US
Remote
Kubernetes cluster administration
Ml model serving infrastructure
Api orchestration system
Your work will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions

Job Summary

  • Your work will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions.
  • Design, deploy and maintain scalable Kubernetes clusters for AI model inference and training.
  • At Abridge, we’re transforming healthcare delivery experiences with generative AI, enabling clinicians and patients to connect in deeper, more meaningful ways.

Matching Summary

Your work will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions.

Skills & Requirements

Must-have

  • Kubernetes cluster administration
  • ML model serving infrastructure
  • API orchestration system
  • GPU utilization optimization
  • High-performance, low-latency serving

Nice-to-have

  • Responsible AI deployment
  • Generative AI for healthcare
  • EMR integrations
  • Linked Evidence technology

Key Requirements

  • Production ML model deployment experience
  • Container orchestration expertise
  • Distributed systems architecture knowledge
  • Kubernetes administration experience
  • API development for distributed systems

Work Rights

Not specified

Tailored Resume

Cover Letter