Software Engineer Iii -gen Ai Inferencing

Bank of America (GHR)

Not specified; not specified; not specified
**
5+ years oop python/scala/java experience
Gen ai rag process implementation
Vllm/triton inference server deployment
** Bank of America is seeking a Software Engineer III specializing in Gen AI inferencing to join their innovative AI team. The role involves designing, building, and operating reusable toolkits for Gen AI capabilities, requiring strong programming skills in Python, Scala, or Java, alongside experience in deploying AI models. **

Job Summary

  • This position focuses on designing and building the next generation of Gen AI platform to empower initiatives across Consumer, Small Business, Global Banking, and Wealth organizations.
  • The role requires developing complex requirements while ensuring software meets functional, non-functional, and compliance standards with maintainability built-in from the outset.
  • Candidates will mentor other engineers, automate release activities, and collaborate with product teams to deliver secure, scalable, and high-performance AI capabilities.

Matching Summary

Match Score: 75

** Bank of America is seeking a Software Engineer III specializing in Gen AI inferencing to join their innovative AI team. The role involves designing, building, and operating reusable toolkits for Gen AI capabilities, requiring strong programming skills in Python, Scala, or Java, alongside experience in deploying AI models. **

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • 5+ years OOP Python/Scala/Java experience
  • Gen AI RAG process implementation
  • vLLM/Triton Inference Server deployment
  • MLOps and fine-tuning techniques
  • CI-CD automation practices
  • MongoDB Redis API development
  • Containerization and DevOps tools

Nice-to-have

  • Experience with open-source Gen AI toolsets
  • Strong mentorship and coaching skills
  • Culture of quality and innovation
  • Research on new UI/UX analytics tools
  • Collaboration with data scientists

Key Requirements

  • 5+ years OOP programming experience
  • Expert level development skills in Python/Scala/Java
  • Hands-on experience with MLOps and inference frameworks
  • Production deployment experience with vLLM or Triton
  • Knowledge of generative AI RAG processes
  • Experience with MongoDB, Redis, and FastAPI
  • Proficiency with Git, Jenkins, SonarQube, and Ansible

Work Rights

Not specified

Tailored Resume

Cover Letter