Staff Software Engineer - Resiliency And Platform Engineering
SkyTouch Technology
Scottsdale, AZ, USA
Hybrid
Java-based services
Spring boot applications
Aws public cloud environments
You will help strengthen the resiliency, safety, and operability of a large-scale, multi-tenant SaaS platform by improving foundational platform capabilities, runtime behavior, and the developer experience
Job Summary
You will help strengthen the resiliency, safety, and operability of a large-scale, multi-tenant SaaS platform by improving foundational platform capabilities, runtime behavior, and the developer experience.
This role writes production code, but not for feature delivery; coding is focused on platform-level resiliency, developer enablement, observability, and systemic improvements rather than customer-facing feature enhancements.
Choice prioritizes our associate wellbeing by offering a comprehensive benefits program that is both competitive and flexible to help you achieve your wellbeing goals.
Matching Summary
You will help strengthen the resiliency, safety, and operability of a large-scale, multi-tenant SaaS platform by improving foundational platform capabilities, runtime behavior, and the developer experience.
Skills & Requirements
Must-have
Java-based services
Spring Boot applications
AWS public cloud environments
application monitoring and observability platforms
designing and delivering platform-level capabilities
AI-assisted development tools
Nice-to-have
improving how software is built and operated
strengthening platforms
improving developer experience
preventing failures
improving system behavior under stress
influencing engineering practices
Key Requirements
8-10+ years of hands-on experience
Bachelor’s degree in computer science or equivalent practical experience
Hands-on experience designing, building, and operating Java-based services
Experience developing and supporting cloud-native and serverless workloads
Strong practical experience working in AWS public cloud environments
Working knowledge of relational and non-relational data stores
Experience using application monitoring and observability platforms
Solid understanding of Site Reliability Engineering (SRE) principles