Staff Data Scientist, Reasoning

Biohub

New York, NY, US
Base: $214,000 - $268,000; bonus/equity: eligible ...
On-site
Design reasoning tasks for models
Build training datasets for biology experiments
Design training strategies for reasoning
This role is part of the Data team, responsible for maximizing the speed, agility, and capability of biological AI research by connecting public data resources and Biohub's experimental platforms to AI systems

Job Summary

  • This role is part of the Data team, responsible for maximizing the speed, agility, and capability of biological AI research by connecting public data resources and Biohub's experimental platforms to AI systems.
  • You will define the data approach to train our reasoning system, operating with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring individual contributors.
  • We offer a wide range of benefits to support the people who make all we do possible, including a generous employer match on employee 401(k) contributions and paid time off to volunteer.

Matching Summary

This role is part of the Data team, responsible for maximizing the speed, agility, and capability of biological AI research by connecting public data resources and Biohub's experimental platforms to AI systems.

Salary

Base: $214,000 - $268,000; Bonus/Equity: Eligible for discretionary annual performance bonus program; Benefits: Generous 401(k) match, paid time off for volunteering, family-forming benefits, relocation support

Skills & Requirements

Must-have

  • design reasoning tasks for models
  • build training datasets for biology experiments
  • design training strategies for reasoning
  • create evaluation and benchmarking frameworks
  • partner with scientific data strategy
  • set technical direction for reasoning data

Nice-to-have

  • understand scientific experimentation deeply
  • think creatively about data representations
  • experience building reasoning systems
  • invent methods for biological frontier models

Key Requirements

  • PhD in machine learning, computational biology, or quantitative field
  • Hands-on understanding of scientific reasoning in biology
  • Experience curating training data for reasoning models
  • Track record of novel methodological contributions
  • Familiarity with evaluation methodology for ambiguous tasks
  • Strong computational skills (Python, data processing at scale)

Work Rights

Not specified

Tailored Resume

Cover Letter