the security operations team. They will also be proficient in using multiple ticketing systems to manage incidents effectively, ensuring servicelevelobjectives are adhered to. Experience utilising Kusto Query Language (KQL) for log analysis will also be beneficial. This is a More ❯
levels. Collaborate with team members to identify servicelevel indicators, establish servicelevelobjectives, and error budgets with stakeholders. Maintain high technical expertise in one or more domains, proactively resolving technology bottlenecks. Serve … of software applications and technical processes, with emerging expertise in specific disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Knowledge of CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containers and orchestration tools like Docker, Kubernetes, ECS. More ❯
experience with Reliability concepts to ensure high performance and high service availability, able to define implement and improve business performance SLO's. 2+ years of experience with Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and … of applications and services running on IaaS and PaaS in Microsoft Azure. AWS and GCP are nice to have. ServiceLevelObjectives and indicators focused on improving business workflow performance and availability. Technical and business dashboards, metrics, and actionable alerting. Processes More ❯