Sr Associate Site Reliability Engineer (sre)

080

Fully remote
Minimum two years sre or devops experience
Bachelor's degree in computer science
Strong linux operating system knowledge
The role involves preparing, automating, and executing weekly maintenance activities and tenant management operations for a leading AI platform

Job Summary

  • The role involves preparing, automating, and executing weekly maintenance activities and tenant management operations for a leading AI platform.
  • Candidates will create comprehensive runbooks for maintenance orchestration and issue remediation while focusing on security and self-healing capabilities.
  • Workday offers a flexible work approach combining remote and in-person time, requiring at least half of the quarter to be spent in the office or field.

Matching Summary

The role involves preparing, automating, and executing weekly maintenance activities and tenant management operations for a leading AI platform.

Skills & Requirements

Must-have

  • Minimum two years SRE or DevOps experience
  • Bachelor's degree in Computer Science
  • Strong Linux operating system knowledge
  • Detailed runbook documentation skills

Nice-to-have

  • Experience with Ansible, Python, or Bash automation
  • Knowledge of Kubernetes and Public Cloud providers
  • Passion for reducing toil and improving resiliency
  • Collaborative mindset with curiosity and optimism

Key Requirements

  • 2+ years experience in SRE, DevOps, or Operations
  • Bachelor's Degree in Computer Science or related field

Work Rights

Not specified

Tailored Resume

Cover Letter