monitoring platforms such as IBM Netcool, Moogsoft, BigPanda, PagerDuty, ServiceNow AIOps. Proficiency in Python, and hands-on knowledge of Ansible Automation Platform. Other highly valued skills include: Knowledge of Observability Platforms: Prometheus, Grafana, ELK, Splunk. Experience with integration into ITSM platforms such as ServiceNow. Experience with Kafka. You may be assessed on the key critical skills relevant for success in More ❯
deployment pipelines to enhance efficiency and reliability. Quality, Stability & Standards: Establish quality standards to meet performance, reliability, and maintainability of the systems. With a strong production-first mindset, drive observability, maintain Service Level Objectives (SLOs), and ensure efficient incident resolution. Oversee the maintenance of existing systems, ensuring continuous improvements and prompt resolution of issues. Agile Delivery & Collaboration: Working closely with More ❯
deployment pipelines to enhance efficiency and reliability. Quality, Stability & Standards: Establish quality standards to meet performance, reliability, and maintainability of the systems. With a strong production-first mindset, drive observability, maintain Service Level Objectives (SLOs), and ensure efficient incident resolution. Oversee the maintenance of existing systems, ensuring continuous improvements and prompt resolution of issues. Agile Delivery & Collaboration: Working closely with More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
Explore Group
Design, build, and maintain secure, scalable infrastructure on AWS Develop and manage Infrastructure as Code (Terraform/CDK/etc.) Set up and optimize CI/CD pipelines and observability tooling Work closely with engineering and data science teams to enable smooth delivery What we're looking for: Proven experience with AWS and infrastructure engineering Strong understanding of cloud-native More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
BJSS
cloud architecture and as an engineer designing and implementing cloud solutions Extensive knowledge of one (or more) public cloud providers across hosting, networking, CI/CD, IAM, security and observability Deep understanding and use of infrastructure-as-code Exceptional analytical and problem-solving skills Strong record of technical leadership Ability to work collaboratively in a team environment Developing and maintaining More ❯
data analysis, and delivery methods. Other highly valued skills may include: Strong understanding of modern infrastructure architecture (containerization, virtualization, public cloud) and Site Reliability Engineering practices, including metrics and observability tools. Experience working in a finance, banking, or fintech company with an internal customer base. Certified Product Owner You may be assessed on the key critical skills relevant for success More ❯
our engineers Lead and contribute to cross-team initiatives from design through deployment and operations Write maintainable, well-tested, high-quality code and uphold engineering best practices Focus on observability and maintain Service Level Objectives, take operational responsibility for the Identity Platform, including joining the on-call rota Foster a strong engineering culture through mentorship, code reviews, and collaboration Lead More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
eBay Inc
platform guidelines Responsible for subsystems within a large domain, possessing in-depth knowledge and expertise. Taking full responsibility for the entire software life cycle, including design, development, testing, deployment, observability and monitoring. Serve as a key player in influencing and guiding others towards the right technical decisions. Identifying areas of features likely to change and integrating flexibility in the design More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
On the Beach
native services (AWS). Experience with Kubernetes, Terraform, and CI/CD pipelines for scalable and automated deployments. Understanding of GraphQL and modern API-driven architectures. Strong focus on observability and monitoring, ensuring system reliability and performance. Knowledge of internet scale real-time data processing, supplier integrations, and transactional systems. A pragmatic approach to balancing technical debt, product velocity, and More ❯
Knutsford, England, United Kingdom Hybrid / WFH Options
Experis
resource is required to assist in upgrading the Elastic DP estate to Kubernetes thereby moving away from Obsolete technology (Cloudera), uplifting to RHEL 8, contributing towards improving stability and observability o... JBLK1_UKTJ More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Manchester Digital
all aspects of the product they work on, from ideation through to development, testing, and deployment, so you should expect to champion and mentor on best practices like TDD, Observability, and IaC. Skills: CI/CD, TDD, SOLID The salary is competitive - up to £90k plus benefits including hybrid working (1-2 days per month in Manchester). If you More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Manchester Digital
work on, from ideation through to development, testing and deployment, so you should expect to maintain high standards in code quality through TDD and be comfortable with practices like Observability and IaC. As a Senior Software Engineer, you'll also be given the opportunity to shape the overall approach to software engineering and mentor less experienced team members. Skills: TypeScript More ❯
Bolton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Stockport, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Chester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Warrington, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We actively promote inclusivity. Please inform us if you require accessibility adjustments during the application or More ❯
Preston, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
performing digital product. What You’ll Be Doing: Designing and building robust backend services to support personalisation, fulfilment, and customer experience. Enhancing developer tooling, CI/CD pipelines, and observability across the platform. Supporting internal teams by improving the engineering platform and workflows (including tools like Backstage). Collaborating across product and engineering teams to deliver new features with real More ❯
and modernization practices. Knowledge of SDLC methodologies like Agile, DevOps, and principles such as Domain Driven Design is essential. Engineering experience in cloud, Java/.NET, database technologies, and observability platforms is preferred. About Royal London We’re the UK’s largest mutual life, pensions, and investment company, offering protection, savings, and asset management products. Our values include empowerment, trust More ❯
its structure, and define what good should look like Strong understanding of architectural trade-offs, balancing scalability, maintainability, performance, and business needs Proficiency in DevOps, Infrastructure as Code (IaC), observability, performance tuning and core computer science principles Strong security mindset, understanding of best practices for secure coding, authentication, and data protection Experience in migrating legacy applications to modern architectures Demonstrated More ❯
Role: Observability Engineer - Splunk/Dynatrace Start Date: ASAP End Date: End of 2025 Rate £635 per day, Inside IR35 Work Pattern: Hybrid (3 Days Onsite) Location: Chester/Bromley/Camberley Summary A new role has opened for a Splunk SME on a contract basis until the end of the year, but likely to extend click apply for full More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. … really get hands on with AI Ops then get in touch. In return the role offers £55k and an opportunity to work remotely within the UK. AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engi More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. … really get hands on with AI Ops then get in touch. In return the role offers £55k and an opportunity to work remotely within the UK. AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engi More ❯