Ensure the reliability, availability, and resiliency of Resmed’s digital products by designing and operating fault-tolerant systems
Job Summary
Ensure the reliability, availability, and resiliency of Resmed’s digital products by designing and operating fault-tolerant systems.
Design, implement, and maintain monitoring, alerting, logging, and tracing solutions that provide real-time visibility into system behavior and customer experience.
We focus on creating a diverse and inclusive culture, encouraging individual expression in the workplace and thrive on the innovative ideas this generates.
Matching Summary
Ensure the reliability, availability, and resiliency of Resmed’s digital products by designing and operating fault-tolerant systems.
Skills & Requirements
Must-have
Site Reliability Engineering
Kubernetes production systems
AWS and infrastructure-as-code
CI/CD pipelines and automated deployments
Python for automation
distributed systems and networking
Nice-to-have
developer experience improvement
operational maturity
customer experience focus
diverse and inclusive culture
Key Requirements
Experience in SRE, DevOps, or Infrastructure Engineering