Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production
Job Summary
Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production.
You will focus on end-to-end AI DevOps for LLMs/SLMs, including on-prem inference packaging, runtime optimization, deployment automation, and model/service observability.
This role requires strong software engineering, hands-on GPU inference experience, and a track record of operationalizing models at scale.
Matching Summary
Join Cisco’s CX AI Incubation Team to build and operate scalable AI systems that move from prototype to production.
Salary
Base: $199,700.00 - $292,800.00; Bonus/Equity: Not specified; Benefits: Medical, dental and vision insurance, 401(k) plan with matching contribution, paid parental leave, disability coverage, basic life insurance, restricted stock units, paid time away (holidays, vacation, sick time, personal wellness, floating holidays, birthday, year-end shutdown)
Skills & Requirements
Must-have
LLM/SLM inference optimization
Productionize AI/ML features
On-prem inference packaging
Scalable serving architectures
CI/CD for models and prompts
Model and service observability
Nice-to-have
Cross-functional team collaboration
Fast-paced environment delivery
Clear technical communication
Key Requirements
7+ years of related experience with Bachelor's degree
4+ years of related experience with Master's degree
Experience in Python, Java or C++
Experience with PyTorch/TensorFlow
Experience deploying and operating NLP/Generative AI systems