Senior Ai/ml Platform Engineer (llm/slm Inference)

Cisco UK

San Jose, US
Base: $199,700.00 - $292,800.00; bonus/equity: not...
On-site
Llm/slm inference optimization
Productionize ai/ml features
On-prem inference packaging
Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production

Job Summary

  • Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production.
  • You will focus on end-to-end AI DevOps for LLMs/SLMs, including on-prem inference packaging, runtime optimization, deployment automation, and model/service observability.
  • This role requires strong software engineering, hands-on GPU inference experience, and a track record of operationalizing models at scale.

Matching Summary

Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production.

Salary

Base: $199,700.00 - $292,800.00; Bonus/Equity: Not specified; Benefits: Medical, dental and vision insurance, 401(k) plan with matching contribution, paid parental leave, disability coverage, basic life insurance, restricted stock units, paid time away (holidays, vacation, sick time, personal wellness, floating holidays, birthday, year-end shutdown)

Skills & Requirements

Must-have

  • LLM/SLM inference optimization
  • Productionize AI/ML features
  • On-prem inference packaging
  • Scalable serving architectures
  • CI/CD for models and prompts
  • Model and service observability

Nice-to-have

  • Cross-functional team collaboration
  • Fast-paced environment delivery
  • Clear technical communication

Key Requirements

  • 7+ years of related experience with Bachelor's degree
  • 4+ years of related experience with Master's degree
  • Experience in Python, Java or C++
  • Experience with PyTorch/TensorFlow
  • Experience deploying and operating NLP/Generative AI systems

Work Rights

Not specified

Tailored Resume

Cover Letter