Proficiency in python, go, or javascript programming
Experience with prometheus, grafana, splunk, or elk stack
This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability
Job Summary
This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability.
You will design and implement monitoring, alerting, and visualization tools using platforms like Prometheus, Grafana, and Splunk.
The position requires creating automated alerting systems and scripts to detect network anomalies and ensure minimal downtime.
Matching Summary
This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability.
Skills & Requirements
Must-have
5+ years experience in network engineering or SRE
Proficiency in Python, Go, or JavaScript programming
Experience with Prometheus, Grafana, Splunk, or ELK Stack
Strong understanding of TCP/IP, BGP, OSPF, DNS protocols
Knowledge of Infrastructure as Code tools like Terraform
Nice-to-have
Experience with AI/ML-based network monitoring tools
Familiarity with chaos engineering practices
Certifications such as CCNA, CCNP, or AWS Advanced Networking
Experience with Kubernetes and service meshes
Background in cloud networking across AWS, Azure, GCP
Key Requirements
Bachelor's degree in computer science or related field
5+ years of experience in network engineering, SRE, or full-stack development
Basic familiarity with container orchestration tools like Kubernetes