Senior Software Engineer, Observability Insights

CoreWeave

New York, NY, US
Base: $165,000 to $242,000; bonus/equity: discreti...
On-site
Backend systems and distributed apis
Developer-facing infrastructure
Reliability engineering and fault-tolerant design
CoreWeave is building the next-generation insights layer for AI systems, empowering users to understand, troubleshoot, and optimize complex AI workloads by transforming telemetry into actionable insights

Job Summary

  • CoreWeave is building the next-generation insights layer for AI systems, empowering users to understand, troubleshoot, and optimize complex AI workloads by transforming telemetry into actionable insights.
  • As a Senior Software Engineer, you will lead the development of agentic interfaces and product experiences, design multi-tenant APIs, managed Grafana experiences, and MCP-based tool servers.
  • CoreWeave offers a comprehensive benefits program including 100% paid medical, dental, and vision insurance, a 401(k) with employer match, and flexible PTO.

Matching Summary

CoreWeave is building the next-generation insights layer for AI systems, empowering users to understand, troubleshoot, and optimize complex AI workloads by transforming telemetry into actionable insights.

Salary

Base: $165,000 to $242,000; Bonus/Equity: discretionary bonus, equity awards; Benefits: comprehensive benefits program

Skills & Requirements

Must-have

  • backend systems and distributed APIs
  • developer-facing infrastructure
  • reliability engineering and fault-tolerant design
  • observability systems like ClickHouse, Loki, Prometheus
  • agentic applications or LLM-based features
  • production code in Go and Python integration

Nice-to-have

  • customer-obsessed approach to SDKs, CLIs
  • SLOs, error budgets, multi-tenant system resilience
  • grounding, tool calling, operational safety
  • end-to-end telemetry-to-insights pipelines
  • Kubernetes clusters at scale for AI workloads
  • logging, tracing, and metrics platforms
  • distributed systems or API services at cloud scale
  • LLM frameworks, MCP, and agentic tooling

Key Requirements

  • 6+ years of experience in software or infrastructure engineering
  • Experience with observability systems
  • Experience in agentic applications or LLM-based features
  • Comfortable writing production code primarily in Go
  • Collaborative experience in agile teams

Work Rights

Not specified

Tailored Resume

Cover Letter