Permanent Observability Jobs in the UK

1 to 25 of 600 Permanent Observability Jobs in the UK

Software Engineer - FTC

Oxford, Oxfordshire, United Kingdom
Hybrid / WFH Options
Nominet
control (Git) and testing practices (integration, automation). Problem-solving, collaboration, and growth mindset. Nice to have: Containerisation and orchestration (Docker, Kubernetes). Infrastructure as Code (Terraform, Ansible). Observability tools (Prometheus, Grafana, Databricks). What To Expect Next: 1st stage: Introduction call with a member of the TA team (30 mins) 2nd stage: Hiring manager interview (60 mins) What More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Stratospherec Ltd
and also with another public cloud provider such as AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Employment Type: Permanent
Salary: £80000 - £85000/annum Excellent Benefits package
Posted:

Reliability Engineer

London Area, United Kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Reliability Engineer

City of London, London, United Kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Reliability Engineer

slough, south east england, united kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Reliability Engineer

london, south east england, united kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Reliability Engineer

london (city of london), south east england, united kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Senior Development Enablement Engineer

Edinburgh, Midlothian, United Kingdom
Hybrid / WFH Options
Aberdeen Group
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (e.g., Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Posted:

Platform Engineer

warrington, cheshire, north west england, united kingdom
Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Posted:

Platform Engineer

bolton, greater manchester, north west england, united kingdom
Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Posted:

Development Enablement Engineer

Edinburgh, Midlothian, United Kingdom
Hybrid / WFH Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps/Platform Engineer

United Kingdom
iVedha Inc
AI-enhanced automation. Build and maintain CI/CD (Jenkins, GitLab CI, GitHub Actions, ArgoCD). Cloud infrastructure (AWS, Azure, GCP), container orchestration (Kubernetes, Docker). Logging, monitoring, and observability (Prometheus, Grafana, ELK/EFK), including AI-driven log analysis and incident prediction. Experience supporting MLOps: deploying ML workflows, ensuring model traceability and compliance. Use of AI assistants and workflow More ❯
Posted:

Cloud DevOps Engineer

Manchester, North West, United Kingdom
Frontier Resourcing Ltd
as Terraform or CloudFormation. Implement and manage CI/CD pipelines , enabling continuous integration and deployment of mission-critical applications. Monitor and optimise system performance, availability, and security, applying observability best practices. Collaborate in an Agile environment, engaging with stakeholders to gather requirements and deliver iterative improvements. This role allows you to apply your expertise to challenging problems while shaping More ❯
Employment Type: Permanent
Posted:

AWS Platform Engineer

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Fruition Group
DynamoDB, S3, IAM, and RDS. Understanding of DevOps practices, including CI/CD pipelines and automation. Strong knowledge of cloud security best practices, IAM policies, and networking. Experience with observability tools like CloudWatch, Prometheus, or Grafana. Preferred: Experience mentoring junior team members and promoting DevOps practices. Familiarity with multi-cloud environments (e.g., GCP, Azure). Knowledge of database performance optimisation. More ❯
Employment Type: Permanent, Work From Home
Salary: £75,000
Posted:

DevOps Engineer

Derbyshire, Burton upon Trent, Staffordshire, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
Employment Type: Permanent
Salary: £60000 - £65000/annum Bonus + Benefits
Posted:

DevOps Engineer

Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
Employment Type: Permanent
Salary: £65,000
Posted:

Senior MLOps Engineer

London Area, United Kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
Posted:

Senior MLOps Engineer

City of London, London, United Kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
Posted:

Senior MLOps Engineer

slough, south east england, united kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
Posted:

Senior MLOps Engineer

london, south east england, united kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
Posted:

Senior MLOps Engineer

london (city of london), south east england, united kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
Posted:

Software Engineer

Cheltenham, England, United Kingdom
Hybrid / WFH Options
Argo DevOps Solutions Ltd
BDD approaches (e.g., Cucumber, Gherkin) for test automation Containerisation & Microservices Container Technologies: Practical understanding of Docker or equivalent solutions Microservice Patterns: Experience architecting microservice-based systems with built-in observability and security Cloud Services & Environments Cloud Providers: Demonstrable experience with AWS or Azure Security & Configuration: Ability to build, configure, and secure cloud environments effectively Security & CI/CD Security Integration More ❯
Posted:

Software Engineer

gloucester, south west england, united kingdom
Hybrid / WFH Options
Argo DevOps Solutions Ltd
BDD approaches (e.g., Cucumber, Gherkin) for test automation Containerisation & Microservices Container Technologies: Practical understanding of Docker or equivalent solutions Microservice Patterns: Experience architecting microservice-based systems with built-in observability and security Cloud Services & Environments Cloud Providers: Demonstrable experience with AWS or Azure Security & Configuration: Ability to build, configure, and secure cloud environments effectively Security & CI/CD Security Integration More ❯
Posted:

Platform Engineer

City of London, London, United Kingdom
REVYBE IT RECRUITMENT LIMITED
youllhelpshapethenextgenerationoftheirAWS-basedinfrastructureanddevelopertooling. Thisisanopportunitytohaverealtechnicalinfluenceina regulated,cloud-nativeenvironment ,drivingbestpracticesacrossDevOps,infrastructure,andplatformengineering. TechStack Cloud: AWS(EC2,RDS,S3,IAM,CloudWatch,Lambda) InfrastructureasCode: Terraform Containerisation&Orchestration: Docker,Kubernetes(EKS),Helm ConfigurationManagement: Ansible Monitoring&Observability: Grafana,Prometheus CI/CD: GitHubActions Automation&Scripting: Python,Bash,GoorJava WhatWereLookingFor Provenexperiencerunning AWScloudinfrastructure inaproductionorregulated(financial)environment. Hands-onexperiencemanaging Kubernetesclusters (preferablyEKS). Strongunderstandingof InfrastructureasCode usingTerraform. Familiaritywith monitoringandobservability stackssuchasPrometheusandGrafana. Experiencebuildingandmaintaining CI More ❯
Employment Type: Permanent
Salary: £80,000
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£67,500
Median
£80,000
75th Percentile
£100,938
90th Percentile
£130,000