Senior / Principal Research Engineer, Llm Synthetic Data

Lilasciences

Cambridge, MA, USA
Base: $224,000—$336,000 usd; bonus/equity: bonus p...
On-site
Synthetic data strategy
Evaluation frameworks
Asset quality standards
Lila Sciences is seeking a Senior/Principal Research Engineer specializing in synthetic data to help drive the development and implementation of their synthetic data program. The ideal candidate will have extensive experience in machine learning and synthetic data generation, as well as a proven track record in leading initiatives in this field

Job Summary

  • Contribute to the vision, roadmap, and delivery of our synthetic data program, from asset generation and simulation to ML training integration and measurable model gains.
  • Design, generate, and implement artificial datasets to train, test, and improve Lila’s platform and help us reach our goals.
  • Develop evaluation frameworks that tie synthetic interventions to real model performance and establish standards for asset quality, diversity, documentation, and reproducibility.

Matching Summary

Match Score: 85

Lila Sciences is seeking a Senior/Principal Research Engineer specializing in synthetic data to help drive the development and implementation of their synthetic data program. The ideal candidate will have extensive experience in machine learning and synthetic data generation, as well as a proven track record in leading initiatives in this field.

Salary

Base: $224,000—$336,000 USD; Bonus/Equity: bonus potential and generous early equity; Benefits: Not specified

Skills & Requirements

Must-have

  • synthetic data strategy
  • evaluation frameworks
  • asset quality standards
  • ML workflows
  • Python
  • PyTorch

Nice-to-have

  • instruction fine tuning
  • hillclimbing
  • quantization/distillation
  • cost optimization
  • compliance-heavy environments

Key Requirements

  • 6+ years in applied ML/ML systems
  • 3+ years leading industry initiatives
  • 8+ years working with modern ML workflows
  • comfortable profiling and optimizing GPU-heavy pipelines

Work Rights

Not specified

Tailored Resume

Cover Letter