Agent Rl Infra Engineer

Invidia

Us, CA, United States
Base: 224,000 usd - 356,500 usd; bonus/equity: eli...
Reinforcement learning techniques operationalization
Distributed training frameworks experience
Ml pipeline automation and gpu cluster management
This role offers a rare chance to shape how autonomous, self-improving agents learn and evolve across the enterprise

Job Summary

  • This role offers a rare chance to shape how autonomous, self-improving agents learn and evolve across the enterprise.
  • The work involves creating enterprise-ready RL capabilities and partnering with agent teams to implement them in production.
  • The position includes eligibility for equity and benefits, with a competitive base salary range based on location and experience.

Matching Summary

This role offers a rare chance to shape how autonomous, self-improving agents learn and evolve across the enterprise.

Salary

Base: 224,000 USD - 356,500 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • Reinforcement learning techniques operationalization
  • Distributed training frameworks experience
  • ML pipeline automation and GPU cluster management
  • Python, Go, or Rust programming proficiency
  • Enterprise-ready RL environment design
  • Integration with NeMo Microservices

Nice-to-have

  • Building RL environments as self-service capabilities
  • Familiarity with NVIDIA infrastructure and AI Factory
  • Experience with data curation and active learning
  • Knowledge of continuous learning loops and data flywheel architectures

Key Requirements

  • MS in CS, ML, or related field or equivalent experience
  • 10+ years of relevant experience
  • Experience operationalizing fine-tuning and RL methods
  • Familiarity with distributed training frameworks
  • ML ops skills including pipeline automation and job orchestration

Work Rights

Not specified

Tailored Resume

Cover Letter