implement observability solutions that provide real-time insight into the health, performance, and reliability of Anglian Water's digital platforms and products. Your work will enable proactive incident detection, rootcauseanalysis, and continuous improvement, embedding observability as a core engineering discipline across our organisation. What will you be doing as an Observability Engineer? Design and implement … aligned to user and business needs. Integrate observability tooling into CI/CD pipelines and infrastructure-as-code. Standardise tooling across teams and support automation of alert responses and rootcause analysis. Collaborate with development, operations, and platform teams to define SLIs, SLOs, and error budgets. Conduct rootcauseanalysis and post-incident reviews to More ❯
Deploy, configure, and optimize Wiz for continuous cloud security monitoring and compliance management. Identify vulnerabilities, misconfigurations, and risks across AWS, Azure, and GCP environments, and drive remediation efforts. Lead rootcauseanalysis (RCA) for security incidents and coordinate escalations as needed. Partner with software engineering and infrastructure teams to integrate security best practices into CI/CD … engineering using Wiz , AWS , Azure , and GCP . Strong understanding of cybersecurity principles , risk and controls , and internal control frameworks . Proficiency in incident response , security issue escalation , and rootcauseanalysis . Hands-on experience with security automation , DevSecOps tools , and infrastructure as code (e.g., Terraform, CloudFormation). Excellent problem-solving skills and ability to think More ❯
processes better. What You’ll Be Doing Owning and coordinating day-to-day application support Becoming the go-to expert for core business applications (high level, not coding) Driving root-causeanalysis to eliminate recurring issues Raising and presenting improvement initiatives for approval Supporting and coordinating the Change Advisory Board (CAB) Working closely with projects to minimise … days on-site per week) Permanent Up to £37,000 + £2,000 performance bonus What We’re Looking For Must-have: Background in application support Someone who enjoys root-causeanalysis and problem ownership Confident communicating across teams and taking the lead Nice-to-have: ITIL exposure or certification Experience with Azure DevOps (or similar) Why More ❯
and shape the future of Problem Management at scale. What You’ll Do: • Take full ownership of the problem lifecycle, from identification to resolution, preventing recurring service issues.• Lead RootCauseAnalysis (RCA) and trend analysis to uncover systemic problems and deliver actionable solutions.• Maintain and optimise the Known Error Database (KEDB) and ensure key information … excellence.• Exceptional analytical, problem-solving, and critical thinking skills.• Confident stakeholder management and influencing skills, able to engage both technical and non-technical audiences.• Experience with RCA methodologies, trend analysis, and embedding CSI initiatives.• Background in utilities, energy, or critical national infrastructure is a strong advantage.• Resilient, proactive, and thrives in a fast-paced, complex environment. Join a team More ❯
and shape the future of Problem Management at scale. What You’ll Do: • Take full ownership of the problem lifecycle, from identification to resolution, preventing recurring service issues. • Lead RootCauseAnalysis (RCA) and trend analysis to uncover systemic problems and deliver actionable solutions. • Maintain and optimise the Known Error Database (KEDB) and ensure key information … excellence. • Exceptional analytical, problem-solving, and critical thinking skills. • Confident stakeholder management and influencing skills, able to engage both technical and non-technical audiences. • Experience with RCA methodologies, trend analysis, and embedding CSI initiatives. • Background in utilities, energy, or critical national infrastructure is a strong advantage. • Resilient, proactive, and thrives in a fast-paced, complex environment. Join a team More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure as Code Strong networking fundamentals and distributed systems knowledge Proven incident response and rootcauseanalysis experience Excellent collaboration and communication skills TO BE CONSIDERED: Please either apply through this advert or emailing me directly via . For further information please More ❯
Wigan, Greater Manchester, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure as Code Strong networking fundamentals and distributed systems knowledge Proven incident response and rootcauseanalysis experience Excellent collaboration and communication skills TO BE CONSIDERED: Please either apply through this advert or emailing me directly via (url removed). For further More ❯
of AWS, Terraform, and Ansible. Technical Skills Linux system administration & shell scripting. Networking fundamentals, containerization, and infrastructure security best practices. Version control experience (e.g., Git). Strong troubleshooting and rootcauseanalysis skills. Desirable Skills Experience with Kubernetes and/or other cloud platforms. Familiarity with Nagios, Datadog, or similar monitoring tools. Exposure to CI/CD More ❯
their fulfilment status. · Ensure that all service requests comply with established policies and procedures. Problem Management: · Assist with the identification and resolution of recurring incidents and problems. · Contribute to rootcauseanalysis and the development of permanent solutions. · Maintain documentation of known issues and solutions for future reference. Customer Service: · Provide excellent customer service to all end More ❯
the determination of equipment criticality across all instrumentation asset classes and business functions, acting as the technical authority for whole-life asset care strategies. Apply Failure Mode and Effects Analysis (FMEA) to develop predictive and preventative maintenance strategies for critical assets, and to propose efficient spare parts holding strategies. Use field feedback and performance data to conduct regular Preventative … Maintenance Optimisation (PMO), ensuring maintenance activities remain effective and efficient. Perform detailed reliability, asset health, and performance analysis—prioritising high-risk and high-cost assets—to initiate strategy reviews and quantify improvements such as Mean Time Between Failures (MTBF), cost savings, and resource efficiency. Facilitate defect elimination studies and solutions, by collating and analysing submitted rootcauseanalysis (RCA) conclusions, carried out locally by Maintenance Specialists, and using the companies ‘bad actor’ report to identify repeat cause assets which require investigation. Provide technical support and content for job plans, work instructions, and internal training courses; collaborate with operational and maintenance teams to improve reliability of operational equipment, and compliance with statutory and regulatory requirements More ❯
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
Employment Type: Permanent
Salary: £90000 - £100000/annum To £140,000 package
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
Manchester, Lancashire, United Kingdom Hybrid/Remote Options
Smart DCC
refreshes, ensuring new solutions are secure by design and aligned with DCC's technology strategy. Incident & Problem Response: Lead the Technology Office representation in post incident reviews, ensuring credible rootcauseanalysis (RCA) and delivery of corrective actions. Cross Domain Collaboration: Drive alignment and coherence across domain architectures to ensure performance, security, and operational integrity. Compliance & Governance More ❯
with other members of the team or directly with business users to understand and document business requirements, Undertake/support the monitoring of BAU processes as directed, including undertaking rootcauseanalysis, advising remediation options and if required delivering a solution including delivering any early lifecycle support as needed. Ensure that all work is carried through the More ❯
Employment Type: Permanent
Salary: £70000 - £80000/annum Hybrid working 10% pension
Didsbury, Manchester, Lancashire, England, United Kingdom
Great Places Housing Association
and the project management team to deliver new applications and system projects. Work with colleagues, engineers and consultants to resolve complex on going technical issues and provide input into rootcauseanalysis reports as requested. Assist with the development and implementation of disaster recovery (DR) plans including testing and documentation of the network infrastructure. Familiarise yourself with More ❯
Level 3/HNC/HND Degree Professional experience 3 years supervisory experience within E&I Maintenance Environment Specific training: (detailed in specialised functions) Modern E&I Maintenance Techniques, RootCauseAnalysis, Q.S.E. knowledge/training Iosh/Nebosh Skills for this position Health and Safety Communication Commitment Leadership and Man Management Experience People management and development More ❯
The Tea Factory, 82 Wood Street, Liverpool, England
AMPITO GROUP
Sector. Role Troubleshooting and resolution of complex networking incidents and escalations via tickets Act as an escalation point to our 1st line team Monitoring and fixing of networking alerts Rootcauseanalysis and reporting of network trends Major incident resolution often working with 3rd party on-site engineers On-call support (escalations only, as part of rota More ❯
testing of large Telco systems. - Have wide experience of Operational Acceptance Testing. - Take full responsibility for the prevention of production incidents with regard to non functional requirements and provide RootCauseAnalysis for issues that occur in production so they do not reoccur - Monitor, manage and enforce quality gate processes to prevent defect seepage into downstream test More ❯
BAE Systems, Channel Way, Preston, Lancashire, England
BAE SYSTEMS PLC
the UK, giving you the opportunity to make a rewarding contribution to the work we do. You will experience placements in Products & Applications and Solutions Architecture. Work may include rootcauseanalysis, contributing to the design of software solutions. You will be involved in implementing innovative digital solutions to drive efficiency, with opportunities to evaluate, initiate, create More ❯
on complex process and/or high-speed production equipment Experience troubleshooting, testing, and repairing instrumentation devices Experience in PLC touch screen controls and electrical maintenance Problem solving and rootcauseanalysis experience PLC programming skills Time served Electrical Engineer Automation/Controls apprentice with several years post qualification work experience within a manufacturing environment HNC in More ❯
England, United Kingdom, Manchester Hybrid/Remote Options
Vermelo RPO
however, we are open to largely remote working with the occasional travel to an office. Role Responsibility A key change role, responsible for the successful delivery of Change Business Analysis Services to Programmes, Projects & Small Changes, to support the business in meeting its objectives and goals through the delivery of Change. A Senior Change Business Analyst will be assigned … to Larger Projects and Programmes, with high or very high complexity, engaging with Director or C-Suite stakeholders. Typical Accountabilities Change Business Analysis Service Delivery Deliver Change Business Analysis Service Offerings to Programmes, Projects and Small Changes, including (but not limited to): Identify and document the Business Functions, Products, Processes, Systems and Services that are (or may be … impacted by a Change Undertake analysis to identify and quantify financial and non financial benefits of a Change Requirements Elicitation Work with identified stakeholders to elicit their business, functional and nonfunctional requirements/stories Use appropriate tools and techniques to capture and document those requirements/stories, that meet the needs of downstream ‘customers’ (e.g. Technology, 3rd Parties, Operational More ❯
Salford, Lancashire, England, United Kingdom Hybrid/Remote Options
Vermelo RPO
however, we are open to largely remote working with the occasional travel to an office. Role Responsibility A key change role, responsible for the successful delivery of Change Business Analysis Services to Programmes, Projects & Small Changes, to support the business in meeting its objectives and goals through the delivery of Change. A Senior Change Business Analyst will be assigned … to Larger Projects and Programmes, with high or very high complexity, engaging with Director or C-Suite stakeholders. Typical Accountabilities Change Business Analysis Service Delivery Deliver Change Business Analysis Service Offerings to Programmes, Projects and Small Changes, including (but not limited to): Identify and document the Business Functions, Products, Processes, Systems and Services that are (or may be … impacted by a Change Undertake analysis to identify and quantify financial and non financial benefits of a Change Requirements Elicitation Work with identified stakeholders to elicit their business, functional and nonfunctional requirements/stories Use appropriate tools and techniques to capture and document those requirements/stories, that meet the needs of downstream ‘customers’ (e.g. Technology, 3rd Parties, Operational More ❯