Machine Learning Scientist I/ii, Multi-modal Scientific Reasonings

Lilasciences

Cambridge, United States
$176,000 — $304,000 usd; bonus potential + generou...
On-site
Multi-modal reasoning with vision-language models
Interpreting scientific data (images, plots, text)
Designing training, adaptation, and test-time methods
Lila Sciences is seeking a Machine Learning Scientist to advance multi-modal reasoning with vision-language models using real-world scientific data. The ideal candidate will have an advanced degree, experience in multi-modal ML, and strong engineering skills in modern ML frameworks

Job Summary

  • We’re hiring a Machine Learning Scientist to advance multi‑modal reasoning with vision‑language models (VLMs) on real‑world scientific data.
  • You’ll design and build state‑of‑the‑art methods to advance the state of Scientific Superintelligence.
  • We offer competitive compensation including bonus potential and generous early equity.

Matching Summary

Match Score: 85

Lila Sciences is seeking a Machine Learning Scientist to advance multi-modal reasoning with vision-language models using real-world scientific data. The ideal candidate will have an advanced degree, experience in multi-modal ML, and strong engineering skills in modern ML frameworks.

Salary

$176,000 — $304,000 USD; Bonus potential and generous early equity; Not specified

Skills & Requirements

Must-have

  • multi-modal reasoning with vision-language models
  • interpreting scientific data (images, plots, text)
  • designing training, adaptation, and test-time methods
  • building datasets and benchmarks from scientific artifacts
  • developing perception modules (OCR, table/structure recognition, plot parsing)
  • modern machine learning frameworks (PyTorch, Huggingface)

Nice-to-have

  • scientific data modalities in real-world laboratories
  • publications in top ML/CV/NLP venues
  • contributions to open-source multi-modal tooling

Key Requirements

  • Advanced degree in CS/AI, Applied Math/Stats, EE, or physical sciences with ML focus, or equivalent experience
  • Track record in multi-modal ML or VLMs
  • Understanding of scientific QA/benchmarks and custom evaluation design
  • Experience with multi-modal fine-tuning, document parsing & understanding, dataset curation and benchmarking
  • Strong engineering skills

Work Rights

Not specified

Tailored Resume

Cover Letter