and activity. Knowledge of distributed computing and cloud-native applications, including proficiency in AWS, Terraform, ELK stack (including monitoring tools as mentioned), PagerDuty/OpsGenie or similar, and Jenkins. NON-TECHNICAL REQUIREMENTS: Awareness of Site Reliability Engineering (SRE) principles, including Service Level Objectives (SLOs), Service Level Indicators (SLIs), and more »
across web, mobile and API channels; Provide 1st and 2nd line support for the trading platforms; System monitoring with real time monitoring tools. (Nagios, OpsGenie, Splunk, AppDynamics, Geneos and Bespoke tools); Provide proactive and reactive support to application and operational issues across both production and non-production environments; Proactively more »
high service availability, able to define, implement and improve business performance SLO’s. Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) Maintain existing compliance and governance standards established in the business Key Experience: Deep understanding of Google Cloud (GCP more »