DevOps SRE

Job Description:

The Monitoring and Observability team is responsible for managing

Operating with a global footprint

Collaborating across various organizations to understand and develop observability solutions for enterprise wide deployment at scale

Managing the legacy monitoring stack across the Production Management organization

Driving the strategic delivery of end to end Observability solutions

Providing indepth analysis with interpretive thinking to define problems and develop innovative solutions

Directly impacting the business by influencing strategic functional decisions through advice counsel or provided services

Persuading and influencing others through strong and comprehensive communication and diplomacy skills

Performing other duties and functions as assigned

Essential Skills

OpenShiftKubernetes Administration Experience deploying managing and troubleshooting containerized applications on OpenShiftKubernetes including resource management and networking

Grafana Observability Stack

o Proficiency in administering Geneos ITRS at scale

o Proficiency in administering Grafana user management data sources dashboards s

o Working knowledge of Grafana backend components Mimir metrics Loki logs and Tempo traces

o Experience with Prometheus for metric collection and PromQL for querying

Helm Chart Management Experience with Helm for deploying applications including creating modifying and managing Helm charts library charts and dependencies

Technical Documentation Ability to create clear and concise documentation for systems and processes

Desired Skills

Application Deployment Ability to deploy applications using Lightspeed Enterprise

Google Cloud Operations Experience with Google Cloud operations

Scripting Automation Experience with Bash or Python scripting for automating operational tasks

Qualifications

6-10 years experience

Practical problem solving and strategic thinking skills

Demonstrated leadership interpersonal skills and relationship building skills

Service oriented attitude

Ability to work in a fastpaced environment

Experience working or leading requirement gathering efforts for multiple large development projects at onetime

Proficient using basic technical tools and systems

Good interpersonal and communication skills

Skills

Mandatory Skills : Git, Observability

Good to Have Skills : Ansible, Docker, Kubernetes

Job Details

Company
iXceed Solutions
Location
London Area, United Kingdom
Posted