Own incident management end-to-end, including triage, mitigation, root-cause analysis, and blameless postmortems with action items
Job Summary
Own incident management end-to-end, including triage, mitigation, root-cause analysis, and blameless postmortems with action items.
Proactively identify performance bottlenecks and lead tuning efforts across PHP, MySQL, caching, and queues to ensure availability of critical e-commerce flows.
Reduce toil through scripting and automation using Bash, Python, or PHP CLI, and support stakeholders with ad-hoc investigations.
Matching Summary
Own incident management end-to-end, including triage, mitigation, root-cause analysis, and blameless postmortems with action items.
Skills & Requirements
Must-have
PHP web applications production support
PHP debugging and development
Linux shell and systemd
MySQL slow query analysis
Drupal and Acquia ecosystem
Atlassian suite expertise
Caching and queues support
Nice-to-have
Calm structured communicator under pressure
Ownership mindset bias for action
Simplifying complex systems
Basic SQL analytics
Office tools knowledge
Key Requirements
3-6+ years in Production Support/Operations/Development
Familiarity with pure PHP development
Solid Linux system administration
MySQL expertise
CMS expertise in Drupal and Acquia
Project Management Tools expertise
Observability tools like CloudWatch
Experience supporting payment gateways and order/fulfillment flows