Network Sre Full Stack Observability Engineer

Citi

Hybrid
5+ years experience in network engineering or sre
Proficiency in python, go, or javascript programming
Experience with prometheus, grafana, splunk, or elk stack
This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability

Job Summary

  • This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability.
  • You will design and implement monitoring, alerting, and visualization tools using platforms like Prometheus, Grafana, and Splunk.
  • The position requires creating automated alerting systems and scripts to detect network anomalies and ensure minimal downtime.

Matching Summary

This role combines expertise in full-stack development, network engineering, and site reliability engineering to enhance critical infrastructure observability.

Skills & Requirements

Must-have

  • 5+ years experience in network engineering or SRE
  • Proficiency in Python, Go, or JavaScript programming
  • Experience with Prometheus, Grafana, Splunk, or ELK Stack
  • Strong understanding of TCP/IP, BGP, OSPF, DNS protocols
  • Knowledge of Infrastructure as Code tools like Terraform

Nice-to-have

  • Experience with AI/ML-based network monitoring tools
  • Familiarity with chaos engineering practices
  • Certifications such as CCNA, CCNP, or AWS Advanced Networking
  • Experience with Kubernetes and service meshes
  • Background in cloud networking across AWS, Azure, GCP

Key Requirements

  • Bachelor's degree in computer science or related field
  • 5+ years of experience in network engineering, SRE, or full-stack development
  • Basic familiarity with container orchestration tools like Kubernetes

Work Rights

Not specified

Tailored Resume

Cover Letter