Senior Ai Engineer - Grafana Ops, Ai/ml | Usa | Remote

Grafana Labs

Canada
Base: cad 164,490 - cad 197,389; equity: restricte...
Remote
Ai solutions for incident detection
Llm and agent-powered workflows
Observability data and tools
Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools

Job Summary

  • Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools.
  • Implement a highly iterative process where you quickly prototype, test, and validate with real users, including shipping and evolving LLM- or agent-powered workflows.
  • Work with data analysts, product managers, and designers to shape AI-driven product features, including integration of agentic components with internal tools, alerting systems, runbooks, and developer workflows.

Matching Summary

Build and deliver AI solutions to help users detect, triage, and resolve incidents using observability data and tools.

Salary

Base: CAD 164,490 - CAD 197,389; Equity: Restricted Stock Units (RSUs); Benefits: Not specified

Skills & Requirements

Must-have

  • AI solutions for incident detection
  • LLM and agent-powered workflows
  • Observability data and tools
  • Cloud-native environments (AWS, GCP, Azure)
  • Observability tools for troubleshooting

Nice-to-have

  • Agent frameworks or multi-agent workflows
  • Infrastructure/DevOps tooling
  • Model fine-tuning techniques
  • Building observability tooling

Key Requirements

  • Experience with LLMs, prompt engineering
  • Building applications powered by GenAI
  • Delivering production software
  • Cloud-native environments exposure
  • Experience using observability tools

Work Rights

Not specified

Tailored Resume

Cover Letter