Responsible for the fine-tuning of large language models, including supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF), and domain-specific model adaptation, to improve model performance and adaptability in vertical scenarios.
Must-have
Nice-to-have
Not specified