Software Engineer, Infrastructure

FAL

San Francisco, CA, United States
Base: $180,000-250,000; bonus/equity: + equity; be...
On-site
Python production tooling development
Bare-metal and cloud server fleet management
Deep linux kernel tuning and networking
The role involves building a Python fleet tracking system to manage the full lifecycle of thousands of servers including procurement and health monitoring

Job Summary

  • The role involves building a Python fleet tracking system to manage the full lifecycle of thousands of servers including procurement and health monitoring.
  • Engineers will leverage AI to automate alerting and recovery processes while implementing strict OS-level security baselines.
  • The position offers competitive compensation ranging from $180,000 to $250,000 plus equity and relocation assistance to San Francisco.

Matching Summary

The role involves building a Python fleet tracking system to manage the full lifecycle of thousands of servers including procurement and health monitoring.

Salary

Base: $180,000-250,000; Bonus/Equity: Plus equity; Benefits: Health, dental, vision insurance

Skills & Requirements

Must-have

  • Python production tooling development
  • Bare-metal and cloud server fleet management
  • Deep Linux kernel tuning and networking
  • Configuration management with Ansible and Terraform
  • Distributed storage system optimization

Nice-to-have

  • NVIDIA GPU infrastructure and diagnostics
  • Network configuration and BGP protocols
  • AMD GPU experience
  • PXE/iPXE bare metal provisioning
  • SOC 2 and ISO 27001 compliance frameworks

Key Requirements

  • 3+ years managing server fleets at scale
  • Strong software engineering skills in Python
  • Deep knowledge of Linux boot process and kernel tuning

Work Rights

Not specified

Tailored Resume

Cover Letter