Research Intern — Applied Reinforcement Learning

Centific Global

Palo Alto, CA, US
$35-$45 hourly; not specified; not specified ph
Fully remote
Reinforcement learning
Agentic ai workflows
Llm-based agents
Centific AI Research seeks a PhD Research Intern to design and evaluate reinforcement learning (RL) systems for agentic AI workflows

Job Summary

  • Centific AI Research seeks a PhD Research Intern to design and evaluate reinforcement learning (RL) systems for agentic AI workflows.
  • You will develop RL environments, reward models, and post-training pipelines for LLM-based agents, translating research into practical enterprise solutions.
  • Competitive stipend and real-world impactful projects with mentorship from researchers and engineers are offered.

Matching Summary

Centific AI Research seeks a PhD Research Intern to design and evaluate reinforcement learning (RL) systems for agentic AI workflows.

Salary

$35-$45 Hourly; Not specified; Not specified

Skills & Requirements

Must-have

  • Reinforcement Learning
  • Agentic AI workflows
  • LLM-based agents
  • PyTorch
  • GPU-based training
  • RL fundamentals

Nice-to-have

  • Top ML conference publications
  • Offline RL research
  • Model-based RL research
  • Hierarchical RL research
  • Multi-agent systems experience

Key Requirements

  • PhD candidate in CS, ML, or related field
  • Strong Python and PyTorch skills
  • Solid understanding of RL fundamentals
  • Experience with LLMs and post-training techniques
  • Strong experimentation practices

Work Rights

Not specified

Tailored Resume

Cover Letter