Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology
Job Summary
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology.
Collaborate with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle.
Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.
Matching Summary
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology.
Skills & Requirements
Must-have
Kubernetes (Openshift, EKS, AKS)
Python or Golang development
Docker and Containerization
Automation of operational processes
System reliability and scalability
Nice-to-have
Cloud technologies (AWS, Azure)
Ansible Playbooks or Chef Cookbooks
Complex system integrations
REST APIs and microservices
Observability and Telemetry
Key Requirements
Experience configuring, using or maintaining Kubernetes
Experience in developing and coding software using Python or Golang
Experience with Docker, Containers and Cloud-Native utilities