Principal Software Engineer - Ai Platform

ZeroAOG

Toronto, ON, CA
Base: $168,000 - $252,000 cad; bonus/equity: eligi...
Fully remote
Infrastructure automation and site reliability engineering
Managing kubernetes multi-cluster environments
Terraform for infrastructure as code
The role involves leading high-impact infrastructure initiatives for a leading AI platform, working with public clouds and capacity management principles to optimize developer interactions

Job Summary

  • The role involves leading high-impact infrastructure initiatives for a leading AI platform, working with public clouds and capacity management principles to optimize developer interactions.
  • You will architect cluster consolidation, design global continuous deployment systems, and lead ArgoCD rollout across multiple clusters ensuring security compliance.
  • Workday offers a culture rooted in integrity, empathy, and collaboration, providing trust, tools, and support for long-term growth and meaningful work.

Matching Summary

The role involves leading high-impact infrastructure initiatives for a leading AI platform, working with public clouds and capacity management principles to optimize developer interactions.

Salary

Base: $168,000 - $252,000 CAD; Bonus/Equity: Eligible for bonus and stock grants; Benefits: Comprehensive benefits package

Skills & Requirements

Must-have

  • Infrastructure Automation and Site Reliability Engineering
  • Managing Kubernetes multi-cluster environments
  • Terraform for Infrastructure as Code
  • GitOps workflows using ArgoCD
  • Python programming for automation and microservices
  • Designing distributed systems architecture
  • CI/CD pipeline orchestration with security compliance

Nice-to-have

  • Collaborative team player and mentor
  • Experience with monitoring tools like Grafana
  • Passion for innovative problem solving
  • Ability to work in FedRAMP compliant environments
  • Flexible work schedule with remote and in-person balance

Key Requirements

  • 10+ years software engineering or DevOps experience
  • 8+ years in Infrastructure Automation or SRE
  • 5+ years managing Kubernetes at scale
  • 5+ years using Terraform with AWS/GCP
  • 3+ years designing GitOps with ArgoCD
  • 5+ years professional Python programming
  • Lead architect for major infrastructure initiatives
  • Experience with CI/CD pipelines enforcing SOC2 and FedRAMP
  • BS/MS in Computer Science or related field
  • Availability for on-call rotational support

Work Rights

Not specified

Tailored Resume

Cover Letter