Site Reliability Engineering Lead

Workforcity

Base: $120,800.00 - $170,800.00; bonus/equity: not...
Ai and devops platform support
Incident and problem resolution
Platform roadmap definition
This role is responsible for contributing to the stability, reliability, and performance of our critical AI and DevOps platforms

Job Summary

  • This role is responsible for contributing to the stability, reliability, and performance of our critical AI and DevOps platforms.
  • The ideal candidate will help lead a team of SRE and Support engineers, facilitate incident and problem resolution, and collaborate with engineering and development teams to enhance platform services and supportability.
  • Assist in executing resilience activities such as wargaming scenarios, chaos engineering tests, and disaster recovery drills.

Matching Summary

This role is responsible for contributing to the stability, reliability, and performance of our critical AI and DevOps platforms.

Salary

Base: $120,800.00 - $170,800.00; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • AI and DevOps platform support
  • incident and problem resolution
  • platform roadmap definition
  • automation initiatives
  • enterprise-wide observability strategy
  • operational health of production platforms

Nice-to-have

  • fostering collaborative environment
  • skill development encouragement
  • cost-reduction efforts participation
  • business review meetings participation
  • hands-on familiarity with platform architecture

Key Requirements

  • 6+ years of relevant experience
  • experience contributing to architecture discussions
  • experience working with senior stakeholders
  • demonstrated experience supporting IT service improvements
  • strong communication and presentation skills
  • experience supporting technical roadmaps
  • experience participating in resilience-related activities
  • ability to collaborate with cross-functional teams
  • strong organizational and workload-planning skills
  • working knowledge of Generative AI concepts
  • experience with CI/CD and configuration management tools
  • experience with Red Hat OpenShift or similar Kubernetes technologies
  • experience with databases such as Postgres, Oracle, MongoDB, or Redis
  • experience writing or maintaining code in Java, Python, Go
  • hands-on experience with modern observability and monitoring tools
  • Bachelor’s/University degree required
  • Master’s degree preferred

Work Rights

Not specified

Tailored Resume

Cover Letter