Senior/staff Research Scientist - Frontier Benchmarks
Snorkel AI
Remote, United States
Base: $220,000 - $320,000 usd; bonus/equity: not s...
Remote
Strong research background in ai/ml evaluation
Track record of rigorous experimental design
Experience measuring impact of training data on models
Snorkel AI is seeking a Senior/Staff Research Scientist to lead the development of frontier benchmarks and datasets, focusing on the intersection of research and commercial strategy. The ideal candidate should have a strong background in AI/ML evaluation, exceptional communication skills, and a genuine interest in the commercial aspects of AI data services
Job Summary
The role involves designing state-of-the-art datasets that drive frontier model training and evaluation based on current model performance.
You will translate benchmark insights into compelling narratives to articulate the ROI of expert-curated data for customers and stakeholders.
This position offers a unique opportunity to shape priorities and influence strategic decisions at a company scaling rapidly with market-proven solutions.
Matching Summary
Match Score: 85
Snorkel AI is seeking a Senior/Staff Research Scientist to lead the development of frontier benchmarks and datasets, focusing on the intersection of research and commercial strategy. The ideal candidate should have a strong background in AI/ML evaluation, exceptional communication skills, and a genuine interest in the commercial aspects of AI data services.
Salary
Base: $220,000 - $320,000 USD; Bonus/Equity: Not specified; Benefits: Not specified
Skills & Requirements
Must-have
Strong research background in AI/ML evaluation
Track record of rigorous experimental design
Experience measuring impact of training data on models
Nice-to-have
Exceptional communication skills for technical and non-technical audiences
Genuine interest in GTM strategy and startup dynamics
Comfort operating in fast-moving ambiguous problem spaces
Key Requirements
Ph.D. in machine learning, NLP, or related field preferred
Equivalent industry or research lab experience considered