Staff Ai Engineer - Grafana Ops, Ai/ml | Usa | Remote

Grafana Labs

USA
Base: usd 174,986 - usd 209,983; bonus/equity: equ...
Remote
Develop ai features for incident management
Rapid experimentation and iteration
Collaborate with cross-functional teams
Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools

Job Summary

  • Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools.
  • Implement a highly iterative process where you quickly prototype, test, and validate with real users, including shipping and evolving LLM- or agent-powered workflows.
  • Take full ownership of the AI solutions you develop, ensuring they are not only innovative but also scalable, maintainable, and aligned with real user workflows.

Matching Summary

Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools.

Salary

Base: USD 174,986 - USD 209,983; Bonus/Equity: Equity, Bonus (if applicable); Benefits: Listed here

Skills & Requirements

Must-have

  • Develop AI features for incident management
  • Rapid experimentation and iteration
  • Collaborate with cross-functional teams
  • Utilize AI and automation tools
  • Experience with LLMs and GenAI applications
  • Deliver production software used by users
  • Cloud-native environments (AWS, GCP, Azure)
  • Experience with observability tools

Nice-to-have

  • Agent frameworks or multi-agent workflows
  • Infrastructure/DevOps tooling
  • Model fine-tuning techniques
  • Building observability tooling

Key Requirements

  • Experience with LLMs, prompt engineering, GenAI
  • Delivered production software
  • Cloud-native environments
  • Observability tools experience

Work Rights

Not specified

Tailored Resume

Cover Letter