with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform Associate About Acorn Insurance More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
The Acorn Group
with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform Associate About Acorn Insurance More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Arm Limited
infrastructure "Nice To Have" Skills and Experience: Experience in a GitOps solution such as ArgoCD, Flux or Fleet Implementation of the Security Development Lifecycle (SDL) in infrastructure Monitoring and observability using Prometheus and Grafana, ELK stack or equivalent Use of Kubernetes management systems such as Rancher Familiarity with open source project development cycles and contribution processes, particularly around CI/ More ❯
multiple stakeholders including development teams to implement and maintain reliable and scalable systems while adhering to industry best practices and security standards. Responsibilities and Impact: Design, implement, and maintain observability solutions to track system health and performance. Analyze observability data to identify and troubleshoot potential issues proactively. Develop and implement alerts and notifications for critical events. Collaborate with development teams … in Computer Science, Information Technology, or a related field. 5+ years of experience as a Site Reliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure as Code, Terraform is preferred Comfortable with CI/CD pipelines such as GitHub Actions, Azure DevOps More ❯
of new software and tools into the platform. Support scalable, resilient cloud environments with modern DevOps practices. Promote GitOps deployment strategies and mentor peers in DevOps best practice. Enhance observability using tools like Prometheus and Grafana. This role is ideal for someone looking to take the next step in a DevOps career while working with a modern tech stack in More ❯
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
a minimum of two years working with us post training Nice to have: Domain knowledge: Banking, Financial Services, Lending (Very nice to have – understanding the wholesale lending lifecycle) Monitoring & Observability: Experience in managing Tools like APPD, ELK stack, Grafana Security Practices: DevSecOps principles, vulnerability scanning, compliance automation, Certificate/vault/user role management. Strong attention to detail a passion More ❯
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
ll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and more cost-efficiently. What you … ll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules, and self-service infrastructure Enhancing CI/CD pipelines (Azure DevOps, YAML-based) with security … knowledge (AKS, Functions, SQL, Cosmos DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and More ❯
Crewe, Cheshire, United Kingdom Hybrid / WFH Options
Manchester Digital
platform security, reliability, and performance across systems deployed in Canada, the UK, and AWS cloud environments Contribute to key projects, platform optimizations, and ongoing maintenance initiatives Help drive scalability, observability, and operational excellence If you're passionate about infrastructure, cloud, and systems engineering-and want to help shape the future of mobility-we want to hear from you! Requirements We … configurations (Azure AD , Ory, Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, New Relic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/… understanding of Infrastructure as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., New Relic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
Revolent Group
related processes like data migrations and environment setup. ✅ Preferred (Nice to Have): Banking/Financial Services knowledge — especially around wholesale lending and Loan IQ . Experience with monitoring and observability tools such as APPD, ELK Stack, or Grafana. Understanding of DevSecOps principles , including vulnerability scanning, secrets management, and compliance automation. Further experience with CI/CD integration and pipeline automation More ❯
this role, you will assist in upgrading the Elastic DP estate to Kubernetes, moving away from obsolete technology (Cloudera), upgrading to RHEL 8, and contributing to improving stability and observability of the platform. You will provide advanced analytics tooling and services for modeling analytics, working across continuous integration, development, build, and deployment using automation and cloud technologies to support the More ❯
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
and refine queue-based processing to support asynchronous workflows and event-driven architecture. Work collaboratively with cross-functional teams, including DevOps, Infrastructure, and Product, to deliver robust systems. Leverage observability tools to monitor, alert, and troubleshoot application and integration health. Stay current on AI-driven software development practices (e.g., GPT-assisted development, Agentic AI workflows) and suggest practical implementations. Participate … Prior experience building middleware for data sync, order processing, and internal APIs in a multi-system e-commerce environment Understanding of architecture patterns: Microservices , SOA , Hexagonal , Modular Monolith Monitoring & Observability: Grafana , Prometheus , CloudWatch , New Relic , Datadog , etc. Solid grasp of AI trends in software development , particularly in using GPT tools and agentic systems Education: Mathematics or Computer Science degree (or More ❯
migration technologies like Azure Data Factory. Knowledge of core policy platforms like Bravura Sonata, TCS BaNCS, FNZ and bespoke legacy systems. Skills to modernise architectural estates and drive serviceability, observability dashboarding and metrics in end products. Good understanding of information principles, guardrails and processes spanning data within operational contexts such as Pensions Platforms. Knowledge of system development life cycle methodologies … Kafka, API Management Cloud migration frameworks (Azure functions, Logic Apps, App Services) Java frameworks and technologies Database technologies and platforms like SQL, NoSQL, Data Lake, Snowflake, Databricks, MongoDB, Oracle Observability platforms like Splunk, Dynatrace, Datadog, Grafana About Royal London We're the UK's largest mutual life, pensions and investment company, offering protection, long-term savings and asset management products More ❯
migration technologies like Azure Data Factory. Knowledge of core policy platforms like Bravura Sonata, TCS BaNCS, FNZ and bespoke legacy systems Skills to modernise architectural estates and drive serviceability, observability dashboarding and metrics in end products. Good understanding of information principles, guardrails and processes spanning data within operational contexts such as Policy Administration Systems. Knowledge of system development life cycle … frameworks (Azure functions, Logic Apps, App Services) Java or Microsoft technologies, Azure platform and .Net ecosystem Database technologies and platforms like SQL, NoSQL, Data Lake, Snowflake, Databricks, MongoDB, Oracle Observability platforms like Splunk, Dynatrace, Datadog, Grafana About Royal London We're the UK's largest mutual life, pensions and investment company, offering protection, long-term savings and asset management products More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
this role you will, assist in upgrading the Elastic DP estate to Kubernetes thereby moving away from Obsolete technology (Cloudera), uplifting to RHEL 8, contributing towards improving stability and observability of the platform and providing advanced analytics tooling and services for modelling analytics. Working across continuous integration, development, build and deployment using automation & cloud technologies to support the growth of More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
WorksHub
the infrastructure and deployment of those applications. We are actively expanding our Manchester born SRE function, which aims to advance our knowledge and innovation globally in areas such as Observability, Reliability and Availability. We have the autonomy to choose the technologies and processes that help us achieve our objectives. So each team leverages the technology that fits their needs best. More ❯
across multiple squads to ensure our platform is scalable, secure, and designed for rapid deployment and operational excellence. You'll contribute to the development and automation of cloud infrastructure, observability systems, CI/CD pipelines, and event-based services that power key parts of our product ecosystem. About Suits Me Suits Me is a multi-award-winning, ethical fintech dedicated … pipelines (e.g. GitHub Actions) to enable rapid and reliable delivery of services Contributing to the design of scalable and secure platform components that enable developer productivity Building and improving observability tooling (e.g. CloudWatch, Grafana) to support rapid detection and resolution of issues Collaborating with developers and stakeholders across squads to understand infrastructure needs and ensure best practices are applied Writing More ❯
stakeholders to define solutions Mentor junior developers and promote engineering best practices Drive improvements in development processes, CI/CD pipelines, and tooling Investigate and resolve production issues Ensure observability through logging, metrics, and diagnostics Contribute to event-driven architecture and distributed systems design What You Bring 5+ years of backend development experience Expertise in C#, .NET (preferably .NET 6+ More ❯
CI/CD pipelines Understanding of security protocols, authentication (OAuth, JWT), and data protection best practices Solid grasp of scalable architecture, SOLID principles, and clean coding standards Experience with observability tools (eg Grafana, Loki) and automated testing frameworks Comfortable working in agile, cooperative teams with product-first thinking Apply with your CV below More ❯
of practices (e.g., Cloud, Platforms. AI, Strategy, Custom Application Development, Network & Edge, Security, Resiliency, etc.) Articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps) and operations (e.g., observability, automated response, SRE, etc.) and able to articulate a path toward a target operating model (people, process, and tools) SoftServe is an Equal Opportunity Employer. All qualified applicants will receive More ❯