Cloud monitoring tools (new relic, dynatrace, splunk)
System metrics, logs, and traces analysis
Configure monitoring dashboards and alerts
Lead the design, implementation, and optimization of observability solutions using modern tools and techniques to monitor the health, performance, and behavior of our cloud infrastructure
Job Summary
Lead the design, implementation, and optimization of observability solutions using modern tools and techniques to monitor the health, performance, and behavior of our cloud infrastructure.
Build observability solutions utilizing advanced cloud monitoring tools such as New Relic, Dynatrace, Splunk or equivalent, to provide comprehensive insights into system metrics, logs, and traces.
Stay up to date with latest trends and innovations in observability and cloud monitoring technologies, evaluating and integrating new tools and methodologies to improve our observability capabilities.
Matching Summary
Lead the design, implementation, and optimization of observability solutions using modern tools and techniques to monitor the health, performance, and behavior of our cloud infrastructure.
Skills & Requirements
Must-have
Cloud monitoring tools (New Relic, Dynatrace, Splunk)
System metrics, logs, and traces analysis
Configure monitoring dashboards and alerts
Define observability best practices
Analyze system performance and behavior
Develop and automate observability processes
On-call for incidents and outages
Nice-to-have
Passionate about observability
Strong technical skills
Excellent communication and collaboration
Chaos engineering principles
Key Requirements
5+ years in cloud operations, SRE, or similar
3-5 years with Python or Go
Experience with AWS
Experience with Terraform
3-5 years designing observability solutions
Bachelor's degree or equivalent practical experience