Staff Ai Engineer, Model Post-training And Alignment

OKX

Singapore, Singapore
On-site
Large model post-training pipeline
Preference learning and alignment techniques
Reinforcement learning-based optimization
OKX is seeking a highly skilled Machine Learning Engineer specializing in large model post-training and alignment for their Singapore office. The ideal candidate should have significant experience in executing post-training pipelines and optimizing model performance, with a strong focus on reinforcement learning techniques

Job Summary

  • Lead and execute the full post-training pipeline for large language models (LLMs), including supervised fine-tuning, preference optimization, and reinforcement learning–based methods.
  • Design and implement advanced training paradigms such as DPO (Direct Preference Optimization) and GRPO (Generalized Reward Policy Optimization).
  • Optimize inference efficiency and deploy models using low-latency serving frameworks such as vLLM and SGLang.

Matching Summary

Match Score: 85

OKX is seeking a highly skilled Machine Learning Engineer specializing in large model post-training and alignment for their Singapore office. The ideal candidate should have significant experience in executing post-training pipelines and optimizing model performance, with a strong focus on reinforcement learning techniques.

Skills & Requirements

Must-have

  • large model post-training pipeline
  • preference learning and alignment techniques
  • reinforcement learning-based optimization
  • low-latency inference deployment
  • domain-specific data strategies

Nice-to-have

  • collaboration with research teams
  • productionize training and deployment workflows
  • AI feedback loops

Key Requirements

  • 8 years of industry experience
  • Bachelor's in Computer Science, AI, Machine Learning, or related fields
  • Deep familiarity with DPO, GRPO, and RL-based post-training
  • Experience training specialized small models from scratch
  • Experience deploying models with vLLM, SGLang, or similar

Work Rights

Must have current right to work in Singapore

Tailored Resume

Cover Letter