Shell and scripting languages (python, go, bash, ruby)
Configuration management tools (puppet, ansible)
**
The Wikimedia Foundation is seeking a Senior Site Reliability Engineer to enhance the reliability and delivery of Wikipedia and its underlying infrastructure. The role emphasizes operational tasks, automation, and collaboration in a remote work environment, while embodying the organization's values focused on free knowledge sharing.
**
Job Summary
The SRE team is responsible for ensuring the global top-10 website and its underlying infrastructure are healthy and developing further.
Perform day-to-day operational/DevOps tasks, implement configuration management, and lead continuous improvement through automation.
Work closely with product teams on architectural design for scalable functionality and participate in a 24/7 on-call rotation.
Matching Summary
Match Score: 75
**
The Wikimedia Foundation is seeking a Senior Site Reliability Engineer to enhance the reliability and delivery of Wikipedia and its underlying infrastructure. The role emphasizes operational tasks, automation, and collaboration in a remote work environment, while embodying the organization's values focused on free knowledge sharing.
**
Salary
Base: US$ 113,082 - US$ 175,725 (for US-based applicants); Bonus/Equity: Not specified; Benefits: Not specified
Skills & Requirements
Must-have
6+ years SRE/Operations/DevOps experience
Shell and scripting languages (Python, Go, Bash, Ruby)
Configuration management tools (Puppet, Ansible)
Distributed caching systems
TCP/IP, HTTP, TLS, DNS understanding
Linux system-level troubleshooting
Automating tasks and processes
Incident response and root cause analysis
Nice-to-have
High-performance HTTP(S) caching-proxy software
Linux kernel tuning
Monitoring, metrics, logging infrastructure
Free and Open Source software contribution
LAMP stack technologies
Defining cross-team SLOs
Key Requirements
6+ years of experience in an SRE/Operations/DevOps role
Experience with shell and scripting languages (Python, Go, Bash, Ruby)
Experience with configuration management tools (Puppet, Ansible)
Experience with distributed caching systems
Thorough understanding of TCP/IP, HTTP, TLS, and DNS
Experience with package management on Linux systems (Debian)
Strong Linux system-level troubleshooting skills
History of automating tasks and processes
Experience leading and participating in incident response