Staff/senior Staff Ai Engineer, Model Post-training And Alignment

OKX

San Jose, United States
Base: $313,055.00 to $450,000.00; bonus/equity: pe...
On-site
Large model post-training
Preference learning and alignment
Reinforcement learning
This role focuses on designing, executing, and optimizing post-training pipelines to improve model performance, controllability, domain adaptation, and reasoning capabilities

Job Summary

  • This role focuses on designing, executing, and optimizing post-training pipelines to improve model performance, controllability, domain adaptation, and reasoning capabilities.
  • You will work across the full lifecycle of post-training—from data strategy and reward modeling to reinforcement learning–based optimization and production-grade inference deployment.
  • Competitive total compensation package, L&D programs, Education subsidy, Various team building programs, company events, Wellness and meal allowances, Comprehensive healthcare schemes.

Matching Summary

This role focuses on designing, executing, and optimizing post-training pipelines to improve model performance, controllability, domain adaptation, and reasoning capabilities.

Salary

Base: $313,055.00 to $450,000.00; Bonus/Equity: performance bonus and long-term incentives may be provided; Benefits: full range of medical, financial, and/or other benefits

Skills & Requirements

Must-have

  • large model post-training
  • preference learning and alignment
  • reinforcement learning
  • domain-specific data strategies
  • low-latency serving frameworks

Nice-to-have

  • crypto exchange
  • decentralized applications
  • proof of reserves
  • We Before Me
  • Do the Right Thing
  • Get Things Done

Key Requirements

  • 8 years of industry experience
  • Bachelor's in Computer Science, AI, Machine Learning, or related fields
  • Experience deploying models in low-latency production environments

Work Rights

Not specified

Tailored Resume

Cover Letter