Permanent Observability Job Vacancies

1 to 25 of 1,004 Permanent Observability Jobs

DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/ )

United Kingdom
Hybrid / WFH Options
Crane Venture Partners
such as Kubernetes, Docker Swarm, or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Monitoring & Observability Engineer

South East London, London, United Kingdom
COMPUTACENTER (UK) LIMITED
GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater … visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms … with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring & Observability SME within customer delivery teams Support incident response activities and postmortems by identifying patterns, root causes, and optimisation opportunities Work collaboratively with cross-functional teams to define and implement best practices in observability and monitoring Attend customer and More ❯
Employment Type: Permanent
Posted:

Monitoring & Observability Engineer

London, United Kingdom
Computacenter AG & Co. oHG
Select how often (in days) to receive an alert: Monitoring & Observability Engineer Life on the team At Computacenter, you'll be joining a world-class team of over 1,000 skilled professionals within Group Professional Services (GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and … modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll … do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Monitoring & Observability Engineer

Lakenheath, Suffolk, United Kingdom
Computacenter AG & Co. oHG
Select how often (in days) to receive an alert: Monitoring & Observability Engineer Life on the team At Computacenter, you'll be joining a world-class team of over 1,000 skilled professionals within Group Professional Services (GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and … modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll … do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
Python (or other language), Bash/Shell, YAML including any Development frameworks Extensive experience and in-depth knowledge of the Linux operating system for effective troubleshooting activities Experience with Observability tools like Grafana, Prometheus, ELK, OCI Observability We highly value ownership and initiative with capabilities to drive projects independently Dealing with changes on a daily basis in a very dynamic More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Stratospherec Ltd
one or more public cloud providers such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Employment Type: Permanent
Salary: £85000 - £90000/annum Excellent Benefits package
Posted:

Senior DevOps Engineer, Clinical Software

United Kingdom
Waters Corporation
to maintain a CI build environment capable of running automation tests for effective feedback. Assist in designing, developing and implementing automation test frameworks. Develop and improve our monitoring and observability tooling. Coach and mentorteam matesto improve their own DevOps skills and experience Research emerging tools, trends and methodologies Assist in managing checked in source code from check-in through to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Southampton, Hampshire, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management … principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Hampshire, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management … principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless More ❯
Posted:

SysOps Engineer

Eastbourne, England, United Kingdom
Hybrid / WFH Options
AxisOps
and architecture through to production and operations. Our strength lies in software delivery, supported by deep expertise in platform engineering, built on an understanding of private cloud-native infrastructure, observability, and DevSecOps. Our culture We value sharp thinking, clear communication, and teams that look out for each other. At AxisOps, our core values are: Ingenuity – solving hard problems with elegant … runtimes is welcome but not required) Maintain and evolve microservice architecture built in Python and PHP, with deployment via GitLab CI/CD and runtime orchestration via Andromeda Deliver observability using Prometheus, Grafana, and the ELK stack, supporting metrics, logs, and alerting workflows Support and maintain internal ML infrastructure and pipelines , helping ensure that our AI and data workloads run … maintain standardised developer desktop environments , supporting our engineering team’s daily tooling and dev workflow Contribute to our IoT platform , including reliable edge infrastructure, secure messaging, and data flow observability Support and maintain our private datacentre , including rack-level hardware, networking, and server fleet resilience Continuously improve security posture , covering patching, firewall maintenance, secrets handling, and backup strategy Write markdown More ❯
Posted:

DV Cleared Platform Engineer

Swindon, UK
Global Technology Solutions Ltd
the provisioning and management of systems using Infrastructure as Code (IaC) Support containerisation and orchestration technologies such as Docker and Kubernetes Monitor platform performance, availability, and security using modern observability tools Collaborate with DevOps, security, and application teams to ensure seamless and secure delivery pipelines Implement and maintain CI/CD pipelines and deployment automation Manage secure configurations, patching, and More ❯
Posted:

Senior DevOps Engineer (SC Cleared)

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
working in Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset More ❯
Posted:

Senior DevOps Engineer (SC Cleared)

London Area, United Kingdom
Hybrid / WFH Options
Amber Labs
working in Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset More ❯
Posted:

Junior Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
operating system for effective troubleshooting activities Awareness of any cloud infrastructure principles (like AWS, GCP or OCI), understanding basic principles of secure software delivery is a plus Familiar with Observability tools like Grafana or Prometheus, understanding the importance of giving the correct visibility to our platforms and environments We highly value ownership and initiative with capabilities to drive projects independently More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Technical Architect / Data DevOps Engineer

Bristol, Gloucestershire, United Kingdom
Hewlett Packard Enterprise Development LP
etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Manual Tester (DV Security Clearance)

Basingstoke, Hampshire, South East
CGI
Manual Tester (DV Security Clearance) Position Description Are you an experienced Test Analyst with a background in secure or classified programmes, ready to contribute to projects of national importance? Step into a role where you'll challenge the complex to More ❯
Employment Type: Permanent
Posted:

Senior Site Reliability Engineer - AWS Kubernetes

London, England, United Kingdom
SGI
and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster … performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability More ❯
Posted:

Lead Site Reliability Engineer

Belgium
Tenth Revolution Group
engineering teams to deploy faster and more confidently-without compromising stability or uptime. As the SRE Lead, you'll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers. What You'll Be Doing: Leadership & Strategy -Lead and grow … Proven experience in an SRE or DevOps leadership role. -Deep understanding of networking, containers (Docker, Kubernetes), and cloud infrastructure (AWS/GCP/Azure). -Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.). -Proficiency with infrastructure-as-code tools like Terraform or Pulumi. -Experience with CI/CD pipelines and GitOps practices. -Excellent communication and More ❯
Employment Type: Permanent
Salary: EUR Annual
Posted:

Senior Platform Developer

Edinburgh, United Kingdom
Hybrid / WFH Options
Registers of Scotland
WAF, CloudFront, API GW, AWS Organizations, S3, ECS, EKS, Route 53, ELBs, OpenShift, Kubernetes, Docker Languages: TypeScript, Python Security & Scanning: AWS Guardrails, Checkov, Prisma Cloud, OSV Scanner, SonarQube, Renovate Observability & Logging: CloudWatch, OpenSearch Operating System Management: RedHat Satellite, AMI lifecycle management, Ubuntu Landscape Testing Tools: Pytest, Jest, Cypress APIs/Microservices: RESTful APIs, API Gateway, containerised services Version Control: GitLab … to as Senior DevOps Engineer. On a typical day you will Design, build, and maintain scalable, high-quality software and platform systems Implement and manage CI/CD pipelines, observability, security automation, automated testing, and engineering standards Lead feature development from concept to production with focus on quality and performance Troubleshoot issues, ensuring resilience, reliability, and minimal user disruption Contribute More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Solace Messaging Administrator

City of London, London, United Kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Solace Messaging Administrator

London Area, United Kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Posted:

Lead Site Reliability Engineer

Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
for someone who has: Strong .NET framework knowledge (C#,ASP.NET Core etc..) Expertise in Windows Server administration Database administration (SQL Server primarily) Ability to instrument and consume monitoring and observability tools (Application Insights, Prometheus, Grafana) Experience using PowerShell, Azure CLI, and Bash for automation tasks Previous experience with Azure DevOps, Jenkins, GitHub Actions, or similar tools Containerisation and orchestration (Docker More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Senior Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Nordcloud group
to L3 networking Programming languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Architect - Switzerland

Buchs, St. Gallen, United Kingdom
Proactive Global
scaffolding Collaborate with engineering teams, QA, DevOps, and product managers to deliver integrated solutions Mentor engineers in architectural thinking and AI-assisted development Ensure architectural alignment across systems with observability using Prometheus, Grafana, ELK Stack Required Skills & Qualifications: Master's degree in Computer Science, Software Engineering, or related field 8+ years of software engineering experience, with 3+ years in architectural More ❯
Employment Type: Permanent
Salary: £138118 - £164016/annum
Posted:

Platform(DevOps) Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
Linux - RHEL and debian based flavours Cloud - Oracle Cloud CI/CD - gitlab-ci, jenkins, ansible, terraform, Helm, Fluxcd Web servers - nginx Caching - Redis Messaging queues - Kafka Monitoring and Observability - ELK, Grafana, Prometheus, Thanos Traffic Management - Haproxy, Keepalived, Cloud Load Balancing, etc Scripting - bash, python Virtualisation and Orchestration - docker, kubernetes Databases - Cloud Managed MySql Desireable OS Administration - Linux - debian based More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£65,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£120,000