Staff Site Reliability Engineer

Figureai

San Jose, CA, United States
Base: $175,000 - $250,000 annually; bonus/equity: ...
On-site
Linux/unix systems administration
Cloud and on-prem infrastructure
High-availability systems design
This role is responsible for setting up and managing cloud and on-prem infrastructure to deliver highly available, reliable, and automated systems

Job Summary

  • This role is responsible for setting up and managing cloud and on-prem infrastructure to deliver highly available, reliable, and automated systems.
  • Be the go to person for mission critical infrastructure enabling critical operations such as Source Configuration Management, CI/CD systems, software distribution, supplier portals, manufacturing and more.
  • Reduce human workload through automation to automate deployment and scaling.

Matching Summary

This role is responsible for setting up and managing cloud and on-prem infrastructure to deliver highly available, reliable, and automated systems.

Salary

Base: $175,000 - $250,000 annually; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Linux/Unix systems administration
  • Cloud and on-prem infrastructure
  • High-availability systems design
  • Infrastructure as Code (IaC)
  • Monitoring and alerting tools
  • Networking fundamentals

Nice-to-have

  • Migrate SaaS to self-hosted
  • Data-driven optimization
  • Cross-functional collaboration

Key Requirements

  • Strong Linux/Unix systems administration experience
  • Proficiency in programming/scripting
  • Extensive cloud platform experience (Azure, AWS, GCP)
  • Experience with IaC tools (Terraform, CloudFormation, Ansible)
  • Familiarity with monitoring tools (Prometheus, Grafana, Datadog)
  • Experience defining SLOs and runbooks

Work Rights

Not specified

Tailored Resume

Cover Letter