Senior Software Engineer, Inference Platform

MongoDB

Palo Alto, California, US
Base: $126,000—$248,000 usd; bonus/equity: not spe...
On-site
Backend or infrastructure systems at scale
Cloud-native architectures
Distributed systems
Build the next-generation inference platform supporting embedding models for semantic search, retrieval, and AI-native experiences in MongoDB Atlas

Job Summary

  • Build the next-generation inference platform supporting embedding models for semantic search, retrieval, and AI-native experiences in MongoDB Atlas.
  • Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas, supporting semantic search and hybrid retrieval.
  • Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment.

Matching Summary

Build the next-generation inference platform supporting embedding models for semantic search, retrieval, and AI-native experiences in MongoDB Atlas.

Salary

Base: $126,000—$248,000 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Backend or infrastructure systems at scale
  • Cloud-native architectures
  • Distributed systems
  • Multi-tenant service design
  • ML model serving and inference runtimes
  • Go, Rust, Python, or C++

Nice-to-have

  • Vector search systems knowledge
  • Hybrid retrieval understanding
  • Retrieval-augmented generation (RAG)
  • Open-source ML serving contributions

Key Requirements

  • 5+ years of experience
  • Strong software engineering skills
  • Cloud-native architectures
  • Distributed systems
  • Multi-tenant service design

Work Rights

Not specified

Tailored Resume

Cover Letter