Lead and grow a team of highly independent engineers across Reliability & Resilience and Developer Productivity teams; set org structure, hiring plan, and delivery goals
Job Summary
Lead and grow a team of highly independent engineers across Reliability & Resilience and Developer Productivity teams; set org structure, hiring plan, and delivery goals.
Build an industry-leading reliability practice: manage SLOs and error budgets, run incident response and postmortems, and prioritize resilience work across critical services.
Competitive compensation package, including equity, inclusive healthcare, and flexible time off.
Matching Summary
Lead and grow a team of highly independent engineers across Reliability & Resilience and Developer Productivity teams; set org structure, hiring plan, and delivery goals.
Salary
Base: $208,000-$300,000 (NYC); Bonus/Equity: Equity included; Benefits: Inclusive Healthcare Package, Learn and Grow, Flexible Time Off, WFH budget
Skills & Requirements
Must-have
Kubernetes production operations
CI/CD systems
API gateways
storage and caching infrastructure
observability
secrets management
cost and capacity management
SaaS vendor relationships
Nice-to-have
experience using Vercel platform
managing managers is a plus
psychological safety
technical conflict resolution
Key Requirements
3+ years managing engineers
8+ years building and operating large-scale distributed systems
Hands-on technical depth
Track record owning key platform dependencies
Demonstrated ownership of reliability programs
Proven ability to translate business goals into technical strategy