Software Engineering Manager 1 – Streaming & Cloud Platform Reliability

HPE (Hewlett Packard Enterprise)

Cupertino, California, United States
Base: usd 155,500 - 315,000; bonus/equity: not spe...
Hybrid
Incident management and postmortem follow-ups
Streaming data pipelines with kafka, flink, storm
Backend distributed systems development
This role focuses on driving concrete postmortem action items to improve the reliability of Mist’s cloud platform and streaming data pipelines

Job Summary

  • This role focuses on driving concrete postmortem action items to improve the reliability of Mist’s cloud platform and streaming data pipelines.
  • The position offers a hybrid work environment with onsite collaboration multiple days per week in Cupertino, California, and involves close collaboration with senior engineers and SRE teams.
  • HPE provides comprehensive benefits supporting physical, financial, and emotional wellbeing, along with programs for personal and professional development in an inclusive culture.

Matching Summary

This role focuses on driving concrete postmortem action items to improve the reliability of Mist’s cloud platform and streaming data pipelines.

Salary

Base: USD 155,500 - 315,000; Bonus/Equity: Not specified; Benefits: Comprehensive health and wellbeing benefits

Skills & Requirements

Must-have

  • Incident management and postmortem follow-ups
  • Streaming data pipelines with Kafka, Flink, Storm
  • Backend distributed systems development
  • RESTful API design and operation
  • Cloud-native infrastructure with Kubernetes
  • Production incident response and remediation

Nice-to-have

  • Big-data and ETL systems experience
  • Webhook and event-delivery systems knowledge
  • Multi-region and disaster recovery design
  • DevOps practices and CI/CD automation
  • Observability stacks like Prometheus and Grafana
  • Collaborative team culture and mentoring

Key Requirements

  • 7+ years professional software engineering experience
  • 2+ years team lead experience with hands-on coding
  • 5+ years backend or distributed systems in Python, Go, or Java
  • 3+ years designing and operating distributed event-driven systems
  • 3+ years building and operating RESTful APIs
  • 3+ years with cloud-native infrastructure and CI/CD
  • 3+ years with production datastores like Redis and Postgres
  • 2+ years production incident response experience
  • US Citizen or Green Card holder required

Work Rights

Must have US citizenship or Green Card

Tailored Resume

Cover Letter