Senior Ai And Hpc Observability Engineer

NVIDIA

Base: 152,000 usd - 241,500 usd for level 3; bonus...
Strong programming skills in python, go, or java
Experience building distributed data pipelines
Solid experience with promql and time-series data systems
NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in AI and HPC

Job Summary

  • NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in AI and HPC.
  • The role involves designing and scaling observability platforms for high-volume metrics, logs, and traces.
  • You will collaborate with various teams to deliver production-grade observability solutions.

Matching Summary

NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in AI and HPC.

Salary

Base: 152,000 USD - 241,500 USD for Level 3; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Strong programming skills in Python, Go, or Java
  • Experience building distributed data pipelines
  • Solid experience with PromQL and time-series data systems

Nice-to-have

  • Proven experience designing observability platforms
  • Hands-on expertise with OpenTelemetry and Prometheus
  • Experience integrating observability with AI/ML pipelines

Key Requirements

  • Bachelor’s degree in Computer Science or related field
  • 5+ years of experience in production environments
  • Strong understanding of distributed systems and fault-tolerant design

Work Rights

Not specified

Tailored Resume

Cover Letter