Ml Ops Infrastructure Engineer

Centific

Remote
$150k annually; not specified; not specified
Hybrid
On-premises gpu infrastructure management
Kubernetes cluster administration
Nvidia gpu hardware and software
Our Vision AI platform runs where the data is generated — on-premises, inside government facilities, and at the network edge — not in a hyperscaler cloud

Job Summary

  • Our Vision AI platform runs where the data is generated — on-premises, inside government facilities, and at the network edge — not in a hyperscaler cloud.
  • As our MLOps / AI Infrastructure Engineer, you will own all of it.
  • Hands-on ownership of some of the most demanding AI infrastructure in the public sector — H200 GPU clusters, high-bandwidth interconnects, and purpose-built on-premises deployments.

Matching Summary

Our Vision AI platform runs where the data is generated — on-premises, inside government facilities, and at the network edge — not in a hyperscaler cloud.

Salary

$150K Annually; Not specified; Not specified

Skills & Requirements

Must-have

  • On-premises GPU infrastructure management
  • Kubernetes cluster administration
  • NVIDIA GPU hardware and software
  • High-bandwidth networking fabric
  • Software-defined storage solutions
  • MLOps pipelines and model serving
  • NIST SP 800-171 compliance

Nice-to-have

  • Air-gapped network environments
  • CMMC Level 2/3 assessment
  • NVIDIA DGX Systems experience
  • Edge Kubernetes deployment
  • Observability stacks

Key Requirements

  • 6+ years infrastructure engineering
  • 3+ years GPU compute clusters
  • Production Kubernetes on bare-metal
  • Strong networking fundamentals
  • Software-defined storage experience
  • Practical MLOps experience
  • Working knowledge of NIST SP 800-171
  • Proficiency with IaC tooling
  • Strong Linux systems administration

Work Rights

US Person status or active security clearance advantageous

Tailored Resume

Cover Letter