Research Scientist, Multimodal Interaction & World Model
BYTEDANCE PTE. LTD.
Singapore, Singapore
Not specified (assumed hybrid or onsite based on location)
Multimodal understanding research
Large-scale model optimization
Computer vision and aigc expertise
ByteDance is seeking a Research Scientist for its Multimodal Interaction & World Model team in Singapore to lead research in multimodal intelligence and AI applications. The role focuses on developing advanced models for multimodal understanding and interaction, requiring strong expertise in machine learning and computer vision
Job Summary
The team focuses on solving challenges in multimodal intelligence and virtual reality world interaction through cutting-edge AI research.
Responsibilities include exploring large-scale multi-modal understanding models, optimizing system performance, and building universal agents for GUI and games.
Candidates are expected to possess deep research experience in fields like computer vision, AIGC, and machine learning to drive foundation technology advancements.
Matching Summary
Match Score: 85
ByteDance is seeking a Research Scientist for its Multimodal Interaction & World Model team in Singapore to lead research in multimodal intelligence and AI applications. The role focuses on developing advanced models for multimodal understanding and interaction, requiring strong expertise in machine learning and computer vision.
Skills & Requirements
Must-have
Multimodal understanding research
Large-scale model optimization
Computer vision and AIGC expertise
Machine learning and reinforcement learning
Data construction and instruction fine-tuning
Nice-to-have
Top conference publication record
ACM/ICPC or Kaggle competition experience
Strong C/C++ or Python coding skills
Experience with multimodal RAG and visual COT
Leadership in influential multimodal projects
Key Requirements
Bachelor degree or above in related majors
In-depth research experience in specified AI fields