Senior Engineer Ii, Ai Inference Engine Systems

DigitalOcean

San Francisco, United States
Base: $167,200.00 - $209,000; bonus/equity: eligib...
On-site
Distributed systems expertise
Ai/ml domain knowledge with vllm or sglang
Golang or python proficiency
The role involves acting as a technical leader responsible for designing critical data plane components that host large generative AI models

Job Summary

  • The role involves acting as a technical leader responsible for designing critical data plane components that host large generative AI models.
  • Candidates must possess hands-on experience hosting large language models using inference engines like vLLM, SGLang, or Modular.
  • DigitalOcean offers competitive compensation including base salary, potential bonuses, equity grants, and access to LinkedIn Learning for career development.

Matching Summary

The role involves acting as a technical leader responsible for designing critical data plane components that host large generative AI models.

Salary

Base: $167,200.00 - $209,000; Bonus/Equity: Eligible for bonus based on performance and equity grants upon hire; Benefits: Competitive array including flexible time off and EAP

Skills & Requirements

Must-have

  • Distributed systems expertise
  • AI/ML domain knowledge with vLLM or SGLang
  • GoLang or Python proficiency
  • GPU-level optimization experience
  • High-scale cloud operations

Nice-to-have

  • Experience with open-source software integration
  • Mentorship of junior engineers
  • Knowledge of continuous batching techniques
  • Familiarity with NVIDIA Dynamo or Ray Serve
  • Bias for action and scrappy mindset

Key Requirements

  • Expert-level proficiency in GoLang or Python
  • Proven experience shipping customer-facing software products
  • Strong background in microservices and infrastructure as code

Work Rights

Not specified

Tailored Resume

Cover Letter