slough, south east england, united kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
Los Angeles, California, United States Hybrid / WFH Options
INSPYR Solutions
You should have strong coding skills, a passion for automation, and a focus on reliability engineering to deliver robust and maintainable systems. You will work on network design, traffic analysis and engineering, maintaining CI/CD pipeline and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. Your role will include: Designing, deploying, and operating the … strategies. QoS experience across multiple vendor hardware implementations. Troubleshooting and Incident Response: Skilled at troubleshooting live incidents, with a proactive approach to minimizing downtime and service impact. Familiarity with RootCauseAnalysis (RCA) processes to identify, document, and drive long-term solutions to recurring issues. Automation and Scripting: Proficiency in scripting and programming languages like Python and More ❯
Bletchley, Buckinghamshire, United Kingdom Hybrid / WFH Options
Tria
across the business. Key Responsibilities: Create new and adjust existing Tableau reports (Desktop, Server, and/or Cloud). Create, amend, and maintain documentation related to reporting processes. Conduct rootcauseanalysis and troubleshoot reporting incidents. Required Experience: Proficient in Tableau Strong SQL skills, with the ability to read and write queries. Excellent communication skills, with the More ❯
Southborough, Kent, United Kingdom Hybrid / WFH Options
Vermelo RPO
Firm understanding of Agile concepts Experienced in Web and Device testing (mobile and cross browser) Excellent written and spoken English Excellent attention to detail Able to troubleshoot issues with rootcauseanalysis Desired Skills: Experience with Azure Experience with Microsoft Visual Studio Experience with .NET Experience with Big Data Database Technologies, DataLake, CosmosDb, SQL Experience with Telemetry More ❯
Tunbridge Wells, Kent, United Kingdom, Southborough Hybrid / WFH Options
Vermelo RPO
Firm understanding of Agile concepts Experienced in Web and Device testing (mobile and cross browser) Excellent written and spoken English Excellent attention to detail Able to troubleshoot issues with rootcauseanalysis Desired Skills: Experience with Azure Experience with Microsoft Visual Studio Experience with .NET Experience with Big Data Database Technologies, DataLake, CosmosDb, SQL Experience with Telemetry More ❯
knowledge: VPCs, subnets, routing, security groups, load balancing and Route 53. Automation skills with Python, Bash or PowerShell. Git version control expertise (GitHub/GitLab). Proven incident response, rootcauseanalysis and debugging skills. Strong communicator, able to work collaboratively in multi-disciplinary teams. AWS certifications (Solutions Architect, Developer, SysOps) would be a bonus. Familiarity with More ❯
City, Birmingham, United Kingdom Hybrid / WFH Options
Plum Personnel
be doing: • Collecting, validating, and analysing data to ensure accuracy and integrity. • Producing insightful reports and dashboards to track KPIs and identify opportunities for improvement. • Solving operational challenges through root-causeanalysis and data-driven recommendations. • Collaborating with stakeholders across the EMEA North Region to enhance service delivery and supply chain efficiency. • Driving continuous improvement and contributing More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
Big Red Recruitment Midlands Limited
Developer, you'll have the opportunity to work across the end-to-end software development lifecycle (SDLC). You'll play a crucial role in ensuring code quality, performing rootcauseanalysis, and conducting peer code reviews. Working with cloud-based software, you'll also take part in design discussions and product specification meetings, contributing to both More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
Other responsibilities will encompass, proactive monitoring of production environments, design and implementation of automation and processes to improve efficiency and effectiveness, taking a lead in incident response, troubleshooting and rootcauseanalysis activities to mitigate future issues. You'll collaborate with senior business stakeholders to gather requirements, address concerns and provide updates on projects and systems status More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
Other responsibilities will encompass, proactive monitoring of production environments, design and implementation of automation and processes to improve efficiency and effectiveness, taking a lead in incident response, troubleshooting and rootcauseanalysis activities to mitigate future issues. You'll collaborate with senior business stakeholders to gather requirements, address concerns and provide updates on projects and systems status More ❯
Leatherhead, Surrey, South East, United Kingdom Hybrid / WFH Options
Recruitvirt
data protection, recovery, and disaster recovery (DR) across on-prem and hybrid workloads. - Manage incidents, service requests, and change controls via standard ITIL-based processes. - Lead and participate in rootcauseanalysis for infrastructure-related incidents and issues. - Maintain and update detailed technical documentation and configuration records. - Act as a senior point of contact for customers, attending More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Corelight, Inc
playbooks for SOC/IR workflow automation based on Corelight data Ad-hoc (as requested) written summary reports on equipment and security problems Technical input to major service outage rootcauseanalysis and corrective action reports Leading project status meetings and wrap-up/post-mortem meetings Some on-site work required Qualifications: US Citizen TS/ More ❯
subject matter expert for studio technology systems, staying current on best practices, new tools, and updates. Collaborate with internal and vendor support teams to resolve complex technical issues. Perform rootcauseanalysis and implement corrective/preventative measures. Maintain accurate documentation of incidents, resolutions, maintenance activities, and system configurations. Manage technical inventory, parts ordering, and equipment logistics More ❯
Grantham, Lincolnshire, East Midlands, United Kingdom Hybrid / WFH Options
Recruitment Revolution
and ensuring a seamless support experience. • Keep everything running efficiently by accurately updating our CRM/ticketing system in real time - every detail matters. • Dive into incident resolution and rootcauseanalysis, helping to prevent future issues before they start. • Ability to replicate problems and validate issues, using localised environments and copies of anonymised customer data. • Collaborate More ❯
be willing to travel to other sites as needed. KEY RESPONSIBILITIES: Troubleshoot and resolve incidents, major incidents, problems and service requests, providing regular updates to the end user. Conduct rootcauseanalysis of major incidents and problem records, contributing to the implementation of remedies and preventative measures. To be the first point of escalation for the 1st More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Outcomes First Group
be willing to travel to other sites as needed. KEY RESPONSIBILITIES: Troubleshoot and resolve incidents, major incidents, problems and service requests, providing regular updates to the end user. Conduct rootcauseanalysis of major incidents and problem records, contributing to the implementation of remedies and preventative measures. To be the first point of escalation for the 1st More ❯
Bolton, Greater Manchester, North West, United Kingdom Hybrid / WFH Options
Outcomes First Group
be willing to travel to other sites as needed. KEY RESPONSIBILITIES: Troubleshoot and resolve incidents, major incidents, problems and service requests, providing regular updates to the end user. Conduct rootcauseanalysis of major incidents and problem records, contributing to the implementation of remedies and preventative measures. To be the first point of escalation for the 1st More ❯
manchester, north west england, united kingdom Hybrid / WFH Options
Outcomes First Group
be willing to travel to other sites as needed. KEY RESPONSIBILITIES: Troubleshoot and resolve incidents, major incidents, problems and service requests, providing regular updates to the end user. Conduct rootcauseanalysis of major incidents and problem records, contributing to the implementation of remedies and preventative measures. To be the first point of escalation for the 1st More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
Infrastructure as Code (IaC) practices using Bicep or Terraform. Set up and maintain observability for integration components using Azure Monitor, Application Insights, and Log Analytics. Support incident response and rootcauseanalysis for integration-related issues. Apply security best practices across integration solutions, including authentication, encryption, and access control. Ensure compliance with internal and external standards (eg More ❯
key service indicators Mentor and develop the team, building leadership capabilities and succession planning across the team. Workforce Planning Hold teams accountable to SLA targets with rapid diagnosis and rootcauseanalysis of deviations Implement robust capacity planning and resource optimisation models and processes Lead flexible resourcing strategies, optimising the mix of permanent, locum, and BPO staff … AI teams to optimise clinical workflows and provider tools Lead new service launches with Product and Medical teams, ensuring smooth rollouts with minimal prescriber friction. Budget Management and Performance Analysis Set rolling 12-month Clinical Operations budget and cost-to-serve targets with Finance Deliver cost-saving initiatives on time and within budget parameters Lead team to provide timely More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom Hybrid / WFH Options
MHA
end user technologies, e.g. PC’s, Audio Visual, Mobile Phones, Telephone systems Experience working with ITIL service and support processes Strong analytical and diagnostic skills for problem resolution and rootcauseanalysis A broad understanding of technology and a good level of awareness of technical concepts Excellent knowledge of Microsoft products Any experience supporting business applications would … Managers to create a customer engagement plan for your region. The primary focus is to continuously improve services, saving time and removing frustration. Using your own knowledge and detailed analysis of tickets and current trends to identify and drive opportunities for improvement. A key part of the role is to help our customers to achieve value from IT through More ❯
Staines-upon-Thames, Middlesex, England, United Kingdom Hybrid / WFH Options
Salt Search
product areas, create automation scripts for faster troubleshooting, and deliver knowledge sessions to peers globally. You will work closely with customers and internal teams to troubleshoot, resolve, and provide rootcauseanalysis for technical issues, while maintaining high standards of documentation and case management. Key Responsibilities Troubleshoot and resolve technical issues related to UX and platform performance. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
DGH Recruitment
Overall management and maintenance of the CrowdStrike platform, including configuring EDR policies, tuning SIEM rules, and optimizing the system for performance - Leading or participating in incident response efforts, conducting rootcauseanalysis, and developing runbooks for incident handling - Monitoring for security threats, analysing alerts, and responding to incidents using CrowdStrike and other security tools. Conduct vulnerability scans More ❯
Corsham, Wiltshire, United Kingdom Hybrid / WFH Options
CBSbutler Holdings Limited trading as CBSbutler
networks, and applications. Installing updates, patches, and operating systems (Windows/Linux), plus security software. Monitoring complex environments and troubleshooting across hardware, networks, and applications. Diagnosing issues, carrying out root-causeanalysis, and implementing fixes. Managing backups, inventories, and hardware installs (yes, sometimes lifting heavy kit). Maintaining connectivity, documenting changes, and ensuring smooth operations. Must-haves More ❯