Software Engineer, Machine Learning Infrastructure

Stripe

US
On-site
Design and build scalable services
Ml model training and serving
Llm applications development
Stripe is seeking a Software Engineer for their Machine Learning Infrastructure team, focusing on building scalable and reliable services that enhance ML development and operations. The ideal candidate should have a strong background in software development, particularly in service-oriented architecture and large-scale distributed systems, along with experience in ML platforms and MLOps

Job Summary

  • The ML Infra team builds services and tools that power every step in the ML lifecycle, including data exploration, feature generation, experimentation, training, deploying, serving ML models, and building LLM applications.
  • You will design and build scalable, reliable, and secure services for notebooks, ML model training, experimentation, serving, and LLM applications across multiple regions.
  • You will work closely with machine learning engineers, data scientists, and product engineering teams to enable seamless end-to-end experience in building solutions across data, analytics, and AI/ML platforms.

Matching Summary

Match Score: 85

Stripe is seeking a Software Engineer for their Machine Learning Infrastructure team, focusing on building scalable and reliable services that enhance ML development and operations. The ideal candidate should have a strong background in software development, particularly in service-oriented architecture and large-scale distributed systems, along with experience in ML platforms and MLOps.

Skills & Requirements

Must-have

  • design and build scalable services
  • ML model training and serving
  • LLM applications development
  • full software development lifecycle
  • high availability low latency systems

Nice-to-have

  • building production AI agents
  • familiarity with LLMs and frameworks
  • solving business problems
  • learning new technologies

Key Requirements

  • 2+ years professional software development experience
  • service oriented architecture and distributed systems
  • production ML platforms or MLOps solutions
  • running operations for high availability
  • partnering with other teams

Work Rights

Not specified

Tailored Resume

Cover Letter