Observability Jobs in Manchester

1 to 25 of 106 Observability Jobs in Manchester

Senior DevOps Engineer

Manchester, Lancashire, United Kingdom
Hybrid / WFH Options
Arm Limited
infrastructure "Nice To Have" Skills and Experience: Experience in a GitOps solution such as ArgoCD, Flux or Fleet Implementation of the Security Development Lifecycle (SDL) in infrastructure Monitoring and observability using Prometheus and Grafana, ELK stack or equivalent Use of Kubernetes management systems such as Rancher Familiarity with open source project development cycles and contribution processes, particularly around CI/ More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Infrastructure Engineer

Bury, England, United Kingdom
Stockford Recruitment
DevOps & Automation Create and manage automation pipelines for deployments. Implement Infrastructure as Code (IaC) using tools such as Terraform or Ansible. Monitor and enhance system performance using logging and observability tools. Develop automation solutions for provisioning, scaling, and maintenance. Support containerization efforts with Docker/Kubernetes where applicable. Networking & System Administration Configure and maintain network infrastructure, including firewalls, VLANs, and More ❯
Posted:

Senior Site Reliability Engineer

Manchester, United Kingdom
S&P Global, Inc
multiple stakeholders including development teams to implement and maintain reliable and scalable systems while adhering to industry best practices and security standards. Responsibilities and Impact: Design, implement, and maintain observability solutions to track system health and performance. Analyze observability data to identify and troubleshoot potential issues proactively. Develop and implement alerts and notifications for critical events. Collaborate with development teams … in Computer Science, Information Technology, or a related field. 5+ years of experience as a Site Reliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure as Code, Terraform is preferred Comfortable with CI/CD pipelines such as GitHub Actions, Azure DevOps More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
Magentus Group
to implement robust solutions that improve system performance, security, and developer productivity. You will be responsible for maintaining and evolving platform services, adopting best practices in infrastructure as code, observability, and DevOps methodologies. Key Responsibilities of the role: Platform Development & Automation Design, develop, and maintain cloud-native infrastructure and platform services. Automate provisioning, scaling, and monitoring of infrastructure and application … reliability. Implement Infrastructure as Code (IaC) using tools such as CDK, Terraform or CloudFormation. Reliability & Security Ensure platform reliability, scalability, and security through best practices and proactive monitoring. Implement observability solutions including logging, metrics, and distributed tracing. Support incident response and post-mortem analysis, driving continuous improvements. Collaborate with security teams to ensure compliance with security and regulatory requirements. Collaboration … tools (GitHub Actions, GitLab CI, or similar). Experience with scripting or programming languages (Python, Go, Bash, etc.). Understanding of networking, security principles, and best practices. Knowledge of observability tools such as Datadog, Prometheus, Grafana, etc. Desired Attributes Strong problem-solving skills with a proactive approach to improving systems and processes. Excellent communication and collaboration skills, able to work More ❯
Posted:

Senior AWS Engineer

Manchester, North West
Hybrid / WFH Options
BAE Systems
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
Employment Type: Permanent
Posted:

Senior MLOps/GenAI Infrastructure Engineer

Salford, England, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯
Posted:

AWS Engineer

Manchester, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Automation Test Lead

Manchester, England, United Kingdom
Hybrid / WFH Options
N BROWN
/community of practice and mentor QA engineers across disciplines. Delivery Partnership: Collaborate with product, engineering, and platform leads to balance quality, speed, and risk - driving quality gates, test observability, and release readiness. Metrics & Governance: Define meaningful test/quality metrics, monitor performance, and communicate insights to senior stakeholders. What skills and experience will you have? Proven experience leading test More ❯
Posted:

Senior Site Reliability Engineer

Manchester, United Kingdom
Hybrid / WFH Options
Embarcaderomediagroup
ll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and more cost-efficiently. What you … ll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules, and self-service infrastructure Enhancing CI/CD pipelines (Azure DevOps, YAML-based) with security … knowledge (AKS, Functions, SQL, Cosmos DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Loan IQ DevOps Engineer

Manchester Area, United Kingdom
Hybrid / WFH Options
Revolent Group
related processes like data migrations and environment setup. ✅ Preferred (Nice to Have): Banking/Financial Services knowledge — especially around wholesale lending and Loan IQ . Experience with monitoring and observability tools such as APPD, ELK Stack, or Grafana. Understanding of DevSecOps principles , including vulnerability scanning, secrets management, and compliance automation. Further experience with CI/CD integration and pipeline automation More ❯
Posted:

DevOps Engineer - GammaLabs

Manchester, United Kingdom
Hybrid / WFH Options
Gamma Communications plc
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal MLOps/GenAI Infrastructure Engineer

Salford, Manchester, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Coe Lead - Observability & Tooling

Bury, England, United Kingdom
JD GROUP
The CoE Lead - Observability & Tools at JD Sports Fashion Plc is a critical, hands-on technical role focused on designing, building, and maintaining the company's Observability platform.This role ensures that our technology platforms operate efficiently and reliably, providing early insights for Engineering, Service Reliability, Service Delivery, and DevOps teams. The CoE Lead will manage the contract with third-party … performance indicators (KPIs). The position involves a 75% focus on the design of frameworks and a 25% focus on implementation and adoption. · Job Title – Centre Of Excellence Lead- Observability & Tooling · Location – BL9 8RR · Working rota – Monday Friday · Working hours – 40 What You'll Be Doing: We are looking for an experienced CoE Lead to design, build, and maintain our … Observability platform. The CoE Lead will work closely with DevOps, Engineering, Service Reliability, and Service Delivery teams to continuously improve our Observability capabilities. This role is a technical, hands-on position with a 75% focus on framework design and 25% on implementation and adoption. You will contribute to pipeline design, enabling observability from the first deployment in test environments and More ❯
Posted:

Site Reliability Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
Couchbase
Reliability Engineers are hybrid software and systems engineers. They are the glue holding things together, whether that’s infrastructure/platform, tooling support for our cloud business or managing Observability posture for Couchbase. In this role the candidate we are looking for is for the Observability team which is responsible for maintaining Reliability, Availability and Serviceability for the entire Couchbase … You will have an immediate impact on the day-to-day efficiency of cloud operations and an ongoing impact on growth. Responsibilities Develop/maintain software features in the Observability stack which includes metrics pipeline, alerting, logging and notifications Create/maintain monitoring dashboards which gives insights to our customer cluster health Develop control plane features requiring observability needs High … to identify and solve issues before they affect business productivity Roll up your sleeves to be a full stack engineer as we build end-end software solutions in the Observability domain Requirements 2+ years experience as a software developer Proficiency with programming and scripting languages like Go, Python, Java, or Ruby Strong ability to write code, understands basic DSA concepts More ❯
Posted:

Cloud Platform Lead

Stockport, England, United Kingdom
JR United Kingdom
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
Posted:

Cloud Platform Lead

Manchester, England, United Kingdom
JR United Kingdom
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
Posted:

Senior Java Developer

Manchester, North West, United Kingdom
Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
Employment Type: Permanent, Work From Home
Salary: £90,000
Posted:

Senior Software Engineer – Backend (Remote)

Bury, Greater Manchester, United Kingdom
Hybrid / WFH Options
Zettafleet
Cloud-native technologies: Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to More ❯
Posted:

Senior Software Engineer – Backend (Remote)

Altrincham, Greater Manchester, United Kingdom
Hybrid / WFH Options
Zettafleet
Cloud-native technologies: Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to More ❯
Posted:

Senior Software Engineer – Backend (Remote)

Leigh, Greater Manchester, United Kingdom
Hybrid / WFH Options
Zettafleet
Cloud-native technologies: Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to More ❯
Posted:

Senior Software Engineer – Backend (Remote)

Bolton, Greater Manchester, United Kingdom
Hybrid / WFH Options
Zettafleet
Cloud-native technologies: Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to More ❯
Posted:

Senior Software Engineer – Backend (Remote)

Ashton-Under-Lyne, Greater Manchester, United Kingdom
Hybrid / WFH Options
Zettafleet
Cloud-native technologies: Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to More ❯
Posted:

Senior Software Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
Lloyds Banking Group
Strong collaboration skills, with the ability to influence and align teams on a shared vision. Knowledge of DevOps practices and tools CI/CD pipelines. Knowledge of Monitoring and Observability tooling. Experience with ITIL processes and incident management. Ability to triage and prioritise incidents based on severity, ensuring timely resolution. Any experience of these would be really useful: Expertise in More ❯
Posted:

Remote Senior Software Engineer - Disney+

Manchester, Lancashire, United Kingdom
Hybrid / WFH Options
WorksHub
the infrastructure and deployment of those applications. We are actively expanding our Manchester born SRE function, which aims to advance our knowledge and innovation globally in areas such as Observability, Reliability and Availability. We have the autonomy to choose the technologies and processes that help us achieve our objectives. So each team leverages the technology that fits their needs best. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

ML Ops Lead

Manchester, England, United Kingdom
THG Ingenuity Ltd
deployment. Design and maintain scalable, reliable infrastructure, enabling seamless integration of machine learning models into production through automated deployment, monitoring, and optimisation pipelines. Implement and manage monitoring, logging, and observability tools, ensuring deployed models and infrastructure are stable, performant, and cost-efficient. Stay ahead of the curve on emerging AI/ML trends, identifying and integrating relevant technologies to ensure More ❯
Posted:
Observability
Manchester
10th Percentile
£54,500
25th Percentile
£64,285
Median
£75,000
75th Percentile
£87,500
90th Percentile
£146,300