Crewe, Cheshire, United Kingdom Hybrid / WFH Options
Manchester Digital
platform security, reliability, and performance across systems deployed in Canada, the UK, and AWS cloud environments Contribute to key projects, platform optimizations, and ongoing maintenance initiatives Help drive scalability, observability, and operational excellence If you're passionate about infrastructure, cloud, and systems engineering-and want to help shape the future of mobility-we want to hear from you! Requirements We … configurations (Azure AD , Ory, Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, New Relic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/… understanding of Infrastructure as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., New Relic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
Revolent Group
related processes like data migrations and environment setup. ✅ Preferred (Nice to Have): Banking/Financial Services knowledge — especially around wholesale lending and Loan IQ . Experience with monitoring and observability tools such as APPD, ELK Stack, or Grafana. Understanding of DevSecOps principles , including vulnerability scanning, secrets management, and compliance automation. Further experience with CI/CD integration and pipeline automation More ❯
this role, you will assist in upgrading the Elastic DP estate to Kubernetes, moving away from obsolete technology (Cloudera), upgrading to RHEL 8, and contributing to improving stability and observability of the platform. You will provide advanced analytics tooling and services for modeling analytics, working across continuous integration, development, build, and deployment using automation and cloud technologies to support the More ❯
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
The CoE Lead - Observability & Tools at JD Sports Fashion Plc is a critical, hands-on technical role focused on designing, building, and maintaining the company's Observability platform.This role ensures that our technology platforms operate efficiently and reliably, providing early insights for Engineering, Service Reliability, Service Delivery, and DevOps teams. The CoE Lead will manage the contract with third-party … performance indicators (KPIs). The position involves a 75% focus on the design of frameworks and a 25% focus on implementation and adoption. · Job Title – Centre Of Excellence Lead- Observability & Tooling · Location – BL9 8RR · Working rota – Monday Friday · Working hours – 40 What You'll Be Doing: We are looking for an experienced CoE Lead to design, build, and maintain our … Observability platform. The CoE Lead will work closely with DevOps, Engineering, Service Reliability, and Service Delivery teams to continuously improve our Observability capabilities. This role is a technical, hands-on position with a 75% focus on framework design and 25% on implementation and adoption. You will contribute to pipeline design, enabling observability from the first deployment in test environments and More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Couchbase
Reliability Engineers are hybrid software and systems engineers. They are the glue holding things together, whether that’s infrastructure/platform, tooling support for our cloud business or managing Observability posture for Couchbase. In this role the candidate we are looking for is for the Observability team which is responsible for maintaining Reliability, Availability and Serviceability for the entire Couchbase … You will have an immediate impact on the day-to-day efficiency of cloud operations and an ongoing impact on growth. Responsibilities Develop/maintain software features in the Observability stack which includes metrics pipeline, alerting, logging and notifications Create/maintain monitoring dashboards which gives insights to our customer cluster health Develop control plane features requiring observability needs High … to identify and solve issues before they affect business productivity Roll up your sleeves to be a full stack engineer as we build end-end software solutions in the Observability domain Requirements 2+ years experience as a software developer Proficiency with programming and scripting languages like Go, Python, Java, or Ruby Strong ability to write code, understands basic DSA concepts More ❯
and refine queue-based processing to support asynchronous workflows and event-driven architecture. Work collaboratively with cross-functional teams, including DevOps, Infrastructure, and Product, to deliver robust systems. Leverage observability tools to monitor, alert, and troubleshoot application and integration health. Stay current on AI-driven software development practices (e.g., GPT-assisted development, Agentic AI workflows) and suggest practical implementations. Participate … Prior experience building middleware for data sync, order processing, and internal APIs in a multi-system e-commerce environment Understanding of architecture patterns: Microservices , SOA , Hexagonal , Modular Monolith Monitoring & Observability: Grafana , Prometheus , CloudWatch , New Relic , Datadog , etc. Solid grasp of AI trends in software development , particularly in using GPT tools and agentic systems Education: Mathematics or Computer Science degree (or More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
Bellrock Group
and DBAs to improve platform design and release workflows. Implement and promote best practices for operational readiness, reliability, and fault tolerance. Guide the platform team on tooling, automation, instrumentation, observability and best practice in Azure. Build a high-quality platform aligned to the Microsoft Cloud Adoption Framework, with Well Architected design, Defender, Advisor, Policy and governance in mind. Design and … background in CI/CD tools—GitHub Actions and Octopus Deploy. Proficient in writing and managing Infrastructure as Code (Terraform, ARM templates). Experienced in setting up and maintaining observability stacks (e.g. Application Insights, Prometheus, Grafana). Familiar with container orchestration concepts; Kubernetes experience is a plus. Scripting or programming experience in PowerShell, Python, or similar languages. Comfortable balancing speed More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4
and communication skills. Some of the technologies we use: Programming Languages - Python, Ruby, Rust Infrastructure as Code - Terraform, AWS CDK Source Code Management and CI/CD - GitLab, Snyk Observability - DataDog, Airbrake Containerized Workloads - Docker Cloud-native infrastructure in AWS - ECS, Lambda, Step Functions, SNS/SQS, Transit Gateway, Aurora, DynamoDB, CloudFront, S3, AppSync, API Gateway, and many more. Responsibilities … build highly scalable and resilient applications and infrastructure in AWS Maintain and improve extensible infrastructure-as-code using Terraform Learn, maintain, and improve our existing deployment strategies Deliver effective observability, monitoring, and alerting patterns for KnowBe4’s applications and infrastructure Minimum Qualifications: BS/MS/Ph.D. or equivalent plus 5 years experience Training in secure coding practices (preferred) Proficient More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
Concerto
and DBAs to improve platform design and release workflows. Implement and promote best practices for operational readiness, reliability, and fault tolerance. Guide the platform team on tooling, automation, instrumentation, observability and best practice in Azure. Build a high-quality platform aligned to the Microsoft Cloud Adoption Framework, with Well Architected design, Defender, Advisor, Policy and governance in mind. Design and … background in CI/CD tools—GitHub Actions and Octopus Deploy. Proficient in writing and managing Infrastructure as Code (Terraform, ARM templates). Experienced in setting up and maintaining observability stacks (e.g. Application Insights, Prometheus, Grafana). Familiar with container orchestration concepts; Kubernetes experience is a plus. Scripting or programming experience in PowerShell, Python, or similar languages. Comfortable balancing speed More ❯
Blackpool, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
company's data platforms, ensuring high availability, performance, and security Implement data governance policies and procedures, ensuring compliance with data quality standards and regulatory requirements Design and implement data observability and data quality monitoring solutions, enabling proactive identification and resolution of data issues Key Behaviours Technical Passion & Innovation: Demonstrates a strong passion for data technologies and a commitment to staying … as GitHub or Azure DevOps Experience with Azure DevOps for CI/CD pipeline development and data operations (DataOps) Experience with Python or other relevant coding Experience with Data Observability tools Exposure to Agile Project Methodology, i.e. Scrum ️ The Application Timeline A first stage video call with the internal recruitment team (15 minute call) A face to face or video More ❯
Preston, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
company's data platforms, ensuring high availability, performance, and security Implement data governance policies and procedures, ensuring compliance with data quality standards and regulatory requirements Design and implement data observability and data quality monitoring solutions, enabling proactive identification and resolution of data issues Key Behaviours Technical Passion & Innovation: Demonstrates a strong passion for data technologies and a commitment to staying … GitHub or Azure DevOps Experience with Azure DevOps for CI/CD pipeline development and data operations (DataOps) Experience with Python or other relevant coding languages Experience with Data Observability tools Exposure to Agile Project Methodology, i.e. Scrum A first stage video call with the internal recruitment team (15 minute call) A face to face or video call with the More ❯
migration technologies like Azure Data Factory. Knowledge of core policy platforms like Bravura Sonata, TCS BaNCS, FNZ and bespoke legacy systems. Skills to modernise architectural estates and drive serviceability, observability dashboarding and metrics in end products. Good understanding of information principles, guardrails and processes spanning data within operational contexts such as Pensions Platforms. Knowledge of system development life cycle methodologies … Kafka, API Management Cloud migration frameworks (Azure functions, Logic Apps, App Services) Java frameworks and technologies Database technologies and platforms like SQL, NoSQL, Data Lake, Snowflake, Databricks, MongoDB, Oracle Observability platforms like Splunk, Dynatrace, Datadog, Grafana About Royal London We're the UK's largest mutual life, pensions and investment company, offering protection, long-term savings and asset management products More ❯
migration technologies like Azure Data Factory. Knowledge of core policy platforms like Bravura Sonata, TCS BaNCS, FNZ and bespoke legacy systems Skills to modernise architectural estates and drive serviceability, observability dashboarding and metrics in end products. Good understanding of information principles, guardrails and processes spanning data within operational contexts such as Policy Administration Systems. Knowledge of system development life cycle … frameworks (Azure functions, Logic Apps, App Services) Java or Microsoft technologies, Azure platform and .Net ecosystem Database technologies and platforms like SQL, NoSQL, Data Lake, Snowflake, Databricks, MongoDB, Oracle Observability platforms like Splunk, Dynatrace, Datadog, Grafana About Royal London We're the UK's largest mutual life, pensions and investment company, offering protection, long-term savings and asset management products More ❯
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯