DevOps SRE
Job Description:
The Monitoring and Observability team is responsible for managing
Operating with a global footprint
Collaborating across various organizations to understand and develop observability solutions for enterprise wide deployment at scale
Managing the legacy monitoring stack across the Production Management organization
Driving the strategic delivery of end to end Observability solutions
Providing indepth analysis with interpretive thinking to define problems and develop innovative solutions
Directly impacting the business by influencing strategic functional decisions through advice counsel or provided services
Persuading and influencing others through strong and comprehensive communication and diplomacy skills
Performing other duties and functions as assigned
Essential Skills
OpenShiftKubernetes Administration Experience deploying managing and troubleshooting containerized applications on OpenShiftKubernetes including resource management and networking
Grafana Observability Stack
o Proficiency in administering Geneos ITRS at scale
o Proficiency in administering Grafana user management data sources dashboards s
o Working knowledge of Grafana backend components Mimir metrics Loki logs and Tempo traces
o Experience with Prometheus for metric collection and PromQL for querying
Helm Chart Management Experience with Helm for deploying applications including creating modifying and managing Helm charts library charts and dependencies
Technical Documentation Ability to create clear and concise documentation for systems and processes
Desired Skills
Application Deployment Ability to deploy applications using Lightspeed Enterprise
Google Cloud Operations Experience with Google Cloud operations
Scripting Automation Experience with Bash or Python scripting for automating operational tasks
Qualifications
6-10 years experience
Practical problem solving and strategic thinking skills
Demonstrated leadership interpersonal skills and relationship building skills
Service oriented attitude
Ability to work in a fastpaced environment
Experience working or leading requirement gathering efforts for multiple large development projects at onetime
Proficient using basic technical tools and systems
Good interpersonal and communication skills
Skills
Mandatory Skills : Git, Observability
Good to Have Skills : Ansible, Docker, Kubernetes