1 to 25 of 3,216 Observability Jobs

Senior Service Quality Manager

Hiring Organisation
Undisclosed
Location
Sheffield, England, United Kingdom
overall processes. Focus Areas: Mission-critical production support (L1/L2 with L3 coordination) Incident & problem management (RCA, reduction of repeat incidents) Monitoring/Observability (Splunk, AppD, Grafana) SRE/DevOps collaboration (automation, CI/CD, resilience) Regulatory compliance & security in banking environments Team leadership across regions/time zones … across time zones to coordinate with global and regional teams. Strong understanding of banking systems and compliance with local and global regulations. Monitoring and Observability: Deploy and maintain monitoring tools (e.g., AppD, Grafana, Splunk) to track system performance, security events, and compliance metrics. Develop dashboards and alerts to proactively detect ...

Expert Solution Architect

Hiring Organisation
Finastra
Location
City of London, London, United Kingdom
/CD: Jenkins/GitHub Actions/Azure DevOps Containerization: Docker, Kubernetes, Helm IaC: Terraform OS: RedHat Enterprise Linux Familiarity with monitoring and observability tools: Prometheus , Grafana . Experience with testing frameworks and tools: JUnit , Postman . Knowledge of cloud-native architectures and migration strategies (preferably Azure). Strong documentation ...

Expert Solution Architect

Hiring Organisation
Finastra
Location
London Area, United Kingdom
/CD: Jenkins/GitHub Actions/Azure DevOps Containerization: Docker, Kubernetes, Helm IaC: Terraform OS: RedHat Enterprise Linux Familiarity with monitoring and observability tools: Prometheus , Grafana . Experience with testing frameworks and tools: JUnit , Postman . Knowledge of cloud-native architectures and migration strategies (preferably Azure). Strong documentation ...

Expert Solution Architect

Hiring Organisation
Finastra
Location
Slough, Berkshire, UK
Employment Type
Full-time
/CD: Jenkins/GitHub Actions/Azure DevOps Containerization: Docker, Kubernetes, Helm IaC: Terraform OS: RedHat Enterprise Linux Familiarity with monitoring and observability tools: Prometheus, Grafana. Experience with testing frameworks and tools: JUnit, Postman. Knowledge of cloud-native architectures and migration strategies (preferably Azure). Strong documentation skills using ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
Slough, Berkshire, UK
Employment Type
Full-time
contribute to backend development when needed. Our systems are already live and in active use, so your focus will be on reliability, performance, automation, observability, and clean, maintainable Infrastructure-as-Code. You'll also support Python and Node.js backend services. This role is centred on operational excellence, continuous improvement … Azure acceptable) Experience working with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
City of London, London, United Kingdom
contribute to backend development when needed. Our systems are already live and in active use, so your focus will be on reliability, performance, automation, observability, and clean, maintainable Infrastructure-as-Code. You’ll also support Python and Node.js backend services. This role is centred on operational excellence, continuous improvement … Azure acceptable) Experience working with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
London Area, United Kingdom
contribute to backend development when needed. Our systems are already live and in active use, so your focus will be on reliability, performance, automation, observability, and clean, maintainable Infrastructure-as-Code. You’ll also support Python and Node.js backend services. This role is centred on operational excellence, continuous improvement … Azure acceptable) Experience working with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Birmingham, England, United Kingdom
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You’ll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

Site Reliability Engineer

Hiring Organisation
Profile 29
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
performant infrastructure that underpins critical public-sector services. You’ll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. You’ll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure … PostgreSQL, Elasticsearch, and MongoDB Configure and support identity and access management (IdAM) solutions such as Keycloak Monitor system health, performance, and capacity using modern observability stacks (Prometheus, Grafana, ELK, OpenTelemetry) Champion DevSecOps practices, embedding security and compliance into every stage of delivery Automate deployment, scaling, and recovery processes to improve ...

Site Reliability Engineer - SC Cleared

Hiring Organisation
Cognizant
Location
City of London, London, United Kingdom
incl. EKS, AKS, GKE), ECS, and Azure Container Instances (ACI) — including cluster lifecycle, node pools, autoscaling, ingress, service mesh, secrets, and backup/restore. Observability : Instrument services and infra with New Relic, Grafana (incl. Loki/Tempo where applicable) and cloud‐native telemetry. Define SLIs/SLOs, build actionable dashboards … Containers & orchestration: Kubernetes administration knowledge (controllers, scheduling, ingress, autoscaling, troubleshooting) and experience with EKS/AKS/GKE and/or ECS/ACI. Observability: Practical use of New Relic and Grafana to define metrics/traces/logs, tune alerts, and drive SLOs. Scripting & automation: Proficiency in Python ...

Site Reliability Engineer - SC Cleared

Hiring Organisation
Cognizant
Location
London Area, United Kingdom
incl. EKS, AKS, GKE), ECS, and Azure Container Instances (ACI) — including cluster lifecycle, node pools, autoscaling, ingress, service mesh, secrets, and backup/restore. Observability : Instrument services and infra with New Relic, Grafana (incl. Loki/Tempo where applicable) and cloud‐native telemetry. Define SLIs/SLOs, build actionable dashboards … Containers & orchestration: Kubernetes administration knowledge (controllers, scheduling, ingress, autoscaling, troubleshooting) and experience with EKS/AKS/GKE and/or ECS/ACI. Observability: Practical use of New Relic and Grafana to define metrics/traces/logs, tune alerts, and drive SLOs. Scripting & automation: Proficiency in Python ...

Senior Specialist Engineer (SRE)

Hiring Organisation
UK Health Security Agency
Location
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom
Employment Type
Permanent
Salary
£41983.00 - £52113.00 a year
identify bottlenecks with an engineering mindset. Ensure systems can handle current and future workloads through automation and capacity planning. Continuously improve services through observability, and identify ways to improve observability practices. Follow SRE principles. Guide and educate stakeholders to adopt implemented principles. Provide technical documentation for engineers. Providing training, where … production incidents, ensuring minimal downtime and quick restoration of services. Perform root cause analysis and postmortems, implementing lessons learned to prevent recurrence. Monitoring, Alerting & Observability Contribute to the design and implementation of effective monitoring and alerting systems using tools and dashboards. Improve observability of services, ensuring issues are identified ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
United Kingdom
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You’ll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Liverpool, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Swindon, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Edinburgh, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Midlands, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Glasgow, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Leeds, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Bristol, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Bradford, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Manchester, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Coventry, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
Belfast, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...

AWS Cloud DevOps Engineer

Hiring Organisation
Apexon
Location
London, UK
Employment Type
Full-time
DevOps Engineer to support the design, automation, and operation of our cloud runtime platforms. You'll contribute to infrastructure-as-code, CI/CD, observability, and reliability efforts while learning from senior engineers and collaborating with application teams, Tech Risk, and vendor platforms. What you will do Support the design … CDK. Assist in CI/CD pipeline development and maintenance (GitLab CI/Jenkins/GitHub Actions/AWS Code Pipeline). Contribute to observability efforts including metrics, logs, traces, and alerting systems. Participate in incident response and post-incident reviews; help reduce MTTR through automation. Automate operational tasks using ...