Plan and implement test automation for apis and services
Build and maintain regression suites
As part of the NMX QA group, you will own the quality of a distributed, cloud-scale management and telemetry platform that sits at the heart of NVIDIA’s next-generation AI data centers
Job Summary
As part of the NMX QA group, you will own the quality of a distributed, cloud-scale management and telemetry platform that sits at the heart of NVIDIA’s next-generation AI data centers.
Design, develop, and execute end-to-end tests for new NMX features as part of GA and maintenance releases.
Investigate complex issues across multiple services: reproduce bugs, analyze logs and telemetry, collaborate closely with development and architecture teams to isolate root causes, and verify fixes.
Matching Summary
As part of the NMX QA group, you will own the quality of a distributed, cloud-scale management and telemetry platform that sits at the heart of NVIDIA’s next-generation AI data centers.
Skills & Requirements
Must-have
Design, develop, and execute end-to-end tests
Plan and implement test automation for APIs and services
Build and maintain regression suites
Integrate and validate with 3rd-party components
Investigate complex issues across multiple services
Contribute to product observability
Nice-to-have
Experience with telemetry/monitoring platforms
Experience in HPC or AI data center environments
Experience designing automation infrastructure
Hands-on experience with containers and orchestration
Familiarity with NVLink / InfiniBand
Key Requirements
5+ years of hands-on QA / test automation experience
5+ years with Python, Bash, or similar scripting
3+ years networking and system background (TCP/IP, L2/L3)
Strong Linux fundamentals
Proven ability to work independently and end-to-end