Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities.
Base: $252,000 - $315,000 USD; Equity: Subject to Board of Director approval; Benefits: Comprehensive health, dental and vision coverage, retirement benefits, learning and development stipend, generous PTO, commuter stipend
Must-have
Nice-to-have
Not specified