Core technology development for large language models
Experience with rlhf and reward modeling
Proficient in python and deep learning frameworks
Tencent is seeking a Large Model Algorithm Researcher to advance the development of core technologies related to large language models, specifically in the post-training phase. The ideal candidate will possess advanced knowledge in AI, particularly in reinforcement learning and model alignment, and will contribute to optimizing algorithms and evaluation metrics
Job Summary
The role focuses on enhancing model capabilities through advanced algorithms.
You will manage data feedback loops to improve model training outcomes.
The company values diverse voices and fosters an innovative environment.
Matching Summary
Match Score: 85
Tencent is seeking a Large Model Algorithm Researcher to advance the development of core technologies related to large language models, specifically in the post-training phase. The ideal candidate will possess advanced knowledge in AI, particularly in reinforcement learning and model alignment, and will contribute to optimizing algorithms and evaluation metrics.
Skills & Requirements
Must-have
Core technology development for large language models
Experience with RLHF and Reward Modeling
Proficient in Python and deep learning frameworks
Nice-to-have
Strong technical enthusiasm and self-motivation
Good teamwork and communication skills
Experience with open-source contributions
Key Requirements
Master's degree or higher in relevant fields
Substantial research experience in post-training areas
Practical experience with large-scale training frameworks