This role sits at the intersection of reinforcement learning, large language models, and real-world autonomous systems to build learning systems that power continuously improving agents.
Base: $244,140 - $413,160; Bonus/Equity: Not specified; Benefits: Snacks, lunches, fun activities, competitive compensation package
Must-have
Nice-to-have
Not specified