Senior Machine Learning Engineer- Llms & Self-hosted Ai

Navan (TripActions)

Tel Aviv, Israel
On-site
Python and bash scripting
Pytorch and hugging face
Vllm inference deployment
Lead the transition to a self-hosted, scalable LLM ecosystem for an advanced agentic support chatbot

Job Summary

  • Lead the transition to a self-hosted, scalable LLM ecosystem for an advanced agentic support chatbot.
  • Architect, deploy, and optimize infrastructure for 50-100 distinct models ranging from 100M to 70B parameters.
  • Develop and refine agentic capabilities, optimize inference, and build rigorous evaluation frameworks for model performance.

Matching Summary

Lead the transition to a self-hosted, scalable LLM ecosystem for an advanced agentic support chatbot.

Skills & Requirements

Must-have

  • Python and Bash scripting
  • PyTorch and Hugging Face
  • vLLM inference deployment
  • Agentic systems development
  • LLM fine-tuning strategies
  • Offline and online model evaluation
  • A/B testing statistical analysis

Nice-to-have

  • Ray for distributed orchestration
  • Model quantization techniques
  • FastAPI microservices
  • Data engineering principles
  • MLOps practices and tools

Key Requirements

  • Experience with AI coding assistants
  • Experience with high-performance inference servers
  • Understanding of LLM architectures
  • Experience with PEFT/LoRA
  • Knowledge of classification/retrieval metrics
  • Experience with statistical tests for A/B testing

Work Rights

Not specified

Tailored Resume

Cover Letter