Sr Site Reliability Engineer

Thetradedesk

Sydney, Australia
On-site
Network automation
Large-scale ip networking
Bgp and ospf protocols
Design, build, and scale a global network platform spanning physical datacenters and multi-cloud environments across AWS, Azure, and Alibaba Cloud

Job Summary

  • Design, build, and scale a global network platform spanning physical datacenters and multi-cloud environments across AWS, Azure, and Alibaba Cloud.
  • Own troubleshooting and resolution of complex network issues, upholding high availability and performance across the entire infrastructure footprint.
  • Eliminate toil by building tools, automating workflows, and continuously improving the processes your team depends on every day.

Matching Summary

Design, build, and scale a global network platform spanning physical datacenters and multi-cloud environments across AWS, Azure, and Alibaba Cloud.

Skills & Requirements

Must-have

  • Network automation
  • Large-scale IP networking
  • BGP and OSPF protocols
  • Kubernetes networking (Cilium, Calico)
  • Software load balancers (NGINX, Envoy, HAProxy)
  • Troubleshooting Kubernetes/Docker networking
  • Python or Go for automation

Nice-to-have

  • AI-assisted development tools
  • Platform engineering background
  • Self-directed high-impact contributions
  • Empathetic and collaborative mindset
  • Interest in building infrastructure

Key Requirements

  • 6-8 years of network automation and operational experience
  • Strong development and networking experience
  • Deep expertise in TCP/IP and OSI model
  • Experience with Kubernetes networking technologies
  • Managed software load balancers
  • Proficient in advanced networking technologies (IPv6, SDN, QoS)
  • Operated network devices at scale
  • Skilled with monitoring and alerting systems (Prometheus, Grafana)
  • Infrastructure-as-code and DevOps/SRE principles
  • Experience integrating AI tools into engineering processes

Work Rights

Not specified

Tailored Resume

Cover Letter