Lead Ai Engineer (fm Hosting, Llm Inference)

Capital One

McLean, VA, US
Mclean, va: $197,300 - $225,100; new york, ny: $21...
Llm inference and optimization
Foundation model training
Ai software components
We are creating responsible and reliable AI systems, changing banking for good

Job Summary

  • We are creating responsible and reliable AI systems, changing banking for good.
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.

Matching Summary

We are creating responsible and reliable AI systems, changing banking for good.

Salary

McLean, VA: $197,300 - $225,100; New York, NY: $215,200 - $245,600; San Francisco, CA: $215,200 - $245,600; San Jose, CA: $215,200 - $245,600; Bonus/Equity: Performance based incentive compensation; Benefits: Comprehensive, competitive, and inclusive set of health, financial and other benefits

Skills & Requirements

Must-have

  • LLM inference and optimization
  • Foundation model training
  • AI software components
  • AWS Ultraclusters, Huggingface, VectorDBs
  • PyTorch, Nemo Guardrails

Nice-to-have

  • Transformative power of emerging AI
  • Clarity to big, undefined problems
  • Courage to share new ideas
  • Resilient trail blazer

Key Requirements

  • Bachelor's degree or Master's degree
  • 4+ years AI/ML algorithm experience
  • 2+ years AI/ML algorithm experience (Master's)
  • 4+ years Python, Go, Scala, or Java
  • 6 years cloud AI solution deployment

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter