Senior Machine Learning Engineer - Model Evaluations, Public Sector

Scale

San Francisco, CA, US
Base: $240,450 - $300,300 usd (sf/ny/seattle); bas...
On-site
Python programming skills
Tensorflow or pytorch experience
Automated ml evaluation pipeline development
The role involves deploying advanced AI systems including LLMs and agentic models into mission-critical government environments

Job Summary

  • The role involves deploying advanced AI systems including LLMs and agentic models into mission-critical government environments.
  • Candidates will design test datasets and benchmarks to measure generalization, bias, explainability, and failure modes.
  • This position requires an active security clearance or the ability to obtain one within a short timeframe.

Matching Summary

The role involves deploying advanced AI systems including LLMs and agentic models into mission-critical government environments.

Salary

Base: $240,450 - $300,300 USD (SF/NY/Seattle); Base: $216,300 - $269,850 USD (DC/TX/CO/HI); Equity: Included based on Board approval; Benefits: Health, dental, vision, retirement, PTO, stipend

Skills & Requirements

Must-have

  • Python programming skills
  • TensorFlow or PyTorch experience
  • Automated ML evaluation pipeline development
  • LLM agent testing infrastructure
  • Active security clearance eligibility

Nice-to-have

  • Graduate degree in CS or AI
  • Cloud deployment experience AWS GCP
  • Knowledge of adversarial robustness
  • Experience with regulated ML domains
  • Familiarity with interpretability frameworks

Key Requirements

  • Active security clearance or ability to obtain
  • Strong background in algorithms and data structures
  • Production experience in deep learning or NLP

Work Rights

Must have active security clearance or ability to obtain

Tailored Resume

Cover Letter