of reliability engineering principles, including SLIs, SLOs, and error budgets. Excellent communication and stakeholder management skills. Ability to lead by influence and build consensus across diverse teams. Experience with incidentresponse, root cause analysis, and implementing preventative measures. Comfortable working in a fast-paced, results-oriented contract environment. Please apply with your up-to-date CV in English. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
RED Global
of reliability engineering principles, including SLIs, SLOs, and error budgets. Excellent communication and stakeholder management skills. Ability to lead by influence and build consensus across diverse teams. Experience with incidentresponse, root cause analysis, and implementing preventative measures. Comfortable working in a fast-paced, results-oriented contract environment. Please apply with your up-to-date CV in English. More ❯
london, south east england, united kingdom Hybrid / WFH Options
RED Global
of reliability engineering principles, including SLIs, SLOs, and error budgets. Excellent communication and stakeholder management skills. Ability to lead by influence and build consensus across diverse teams. Experience with incidentresponse, root cause analysis, and implementing preventative measures. Comfortable working in a fast-paced, results-oriented contract environment. Please apply with your up-to-date CV in English. More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
RED Global
of reliability engineering principles, including SLIs, SLOs, and error budgets. Excellent communication and stakeholder management skills. Ability to lead by influence and build consensus across diverse teams. Experience with incidentresponse, root cause analysis, and implementing preventative measures. Comfortable working in a fast-paced, results-oriented contract environment. Please apply with your up-to-date CV in English. More ❯
. Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS environments in production. Leading incidentresponse, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site More ❯
. Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS environments in production. Leading incidentresponse, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site More ❯
Working closely with development teams to integrate reliability and performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incidentresponse , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud More ❯
london, south east england, united kingdom Hybrid / WFH Options
Mimecast
throughput platforms. Collaborate & Lead: Work closely with product, UX, and engineering teams in Agile sprints, shaping platform capabilities and mentoring peers. Champion Resilience: Participate in on-call rotations and incidentresponse, driving proactive improvements for platform uptime and reliability. Code with Excellence: Write clean, efficient, and maintainable code, and set the standard through thoughtful code reviews and technical More ❯
Hammersmith, London, England, United Kingdom Hybrid / WFH Options
Proactive Appointments
managing and administering IT Service governance, ideally aligned with ITIL principles. Including: User provisioning and access management (Starter/movers/leavers, access reviews, etc) Change management Asset management Incident/Problem management Service catalogue, CMDB, etc End user device and software patching Familiarity with Information Security Operational management The role of service desk in incidentresponseMore ❯
experience with: Planning and executing Purple tests Hands on IT Security analysis and security engineering in particular: Security systems Securing Networks On - call experience Malware analysis, Intrusion detection and IncidentResponse across a hybrid estate Ensuring a risk based approach to security Monitoring and tuning technical IT Security Controls The ideal Purple Team Analyst/Cyber Security Analyst More ❯
Tower Hamlets, Greater London, Canary Wharf, United Kingdom
Spencer Rose
experience with: Planning and executing Purple tests Hands on IT Security analysis and security engineering in particular: Security systems Securing Networks On - call experience Malware analysis, Intrusion detection and IncidentResponse across a hybrid estate Ensuring a risk based approach to security Monitoring and tuning technical IT Security Controls The ideal Purple Team Analyst/Cyber Security Analyst More ❯
performance cloud infra for ML workloads Build and manage GPU clusters, storage systems, and distributed training environments Set up and optimise containerised workflows (Docker, Kubernetes, Terraform) Implement robust monitoring, incidentresponse, and CI/CD practices Collaborate closely with researchers to integrate and scale experiments This person must have experience building ML Infrastructure and cloud architecture from scratch More ❯
performance cloud infra for ML workloads Build and manage GPU clusters, storage systems, and distributed training environments Set up and optimise containerised workflows (Docker, Kubernetes, Terraform) Implement robust monitoring, incidentresponse, and CI/CD practices Collaborate closely with researchers to integrate and scale experiments This person must have experience building ML Infrastructure and cloud architecture from scratch More ❯
performance cloud infra for ML workloads Build and manage GPU clusters, storage systems, and distributed training environments Set up and optimise containerised workflows (Docker, Kubernetes, Terraform) Implement robust monitoring, incidentresponse, and CI/CD practices Collaborate closely with researchers to integrate and scale experiments This person must have experience building ML Infrastructure and cloud architecture from scratch More ❯
london (city of london), south east england, united kingdom
Harnham
performance cloud infra for ML workloads Build and manage GPU clusters, storage systems, and distributed training environments Set up and optimise containerised workflows (Docker, Kubernetes, Terraform) Implement robust monitoring, incidentresponse, and CI/CD practices Collaborate closely with researchers to integrate and scale experiments This person must have experience building ML Infrastructure and cloud architecture from scratch More ❯
BMS, CAFM, etc. Act as Senior Authorised Person (SAP) for High and Low Voltage systems. Manage the Permit to Work (PTW) system and review RAMS for all activities. Lead incidentresponse, root cause analysis, and corrective action processes. Deliver robust Planned Preventative Maintenance (PPM) and reactive maintenance schedules. Monitor site KPIs, service levels, and operational risks, ensuring swift More ❯
BMS, CAFM, etc. Act as Senior Authorised Person (SAP) for High and Low Voltage systems. Manage the Permit to Work (PTW) system and review RAMS for all activities. Lead incidentresponse, root cause analysis, and corrective action processes. Deliver robust Planned Preventative Maintenance (PPM) and reactive maintenance schedules. Monitor site KPIs, service levels, and operational risks, ensuring swift More ❯
BMS, CAFM, etc. Act as Senior Authorised Person (SAP) for High and Low Voltage systems. Manage the Permit to Work (PTW) system and review RAMS for all activities. Lead incidentresponse, root cause analysis, and corrective action processes. Deliver robust Planned Preventative Maintenance (PPM) and reactive maintenance schedules. Monitor site KPIs, service levels, and operational risks, ensuring swift More ❯
london (city of london), south east england, united kingdom
PRS
BMS, CAFM, etc. Act as Senior Authorised Person (SAP) for High and Low Voltage systems. Manage the Permit to Work (PTW) system and review RAMS for all activities. Lead incidentresponse, root cause analysis, and corrective action processes. Deliver robust Planned Preventative Maintenance (PPM) and reactive maintenance schedules. Monitor site KPIs, service levels, and operational risks, ensuring swift More ❯
to staff and volunteers to strengthen awareness and understanding across the Diocese. Additionally, you will: - Review and advise on third-party processing and sharing - Support statutory records management and incidentresponse - Handle data subject rights and third-party requests - Contribute to reviews and improvements of privacy governance About You To be considered as a Deputy Data Protection Officer More ❯
managed platform to secure and recover data from all threats. The Druva Data Security Cloud ensures the availability, confidentiality, and fidelity of data - providing customers with autonomous protection, rapid incidentresponse, and guaranteed data recovery. The company is trusted by its more than 6,000 customers, including 65 of the Fortune 500, to defend business data in today More ❯
authentication/authorisation workflows, entitlements, permissions, and account lifecycle operations. Collaborate closely with InfoSec and Engineering teams to design and implement secure-by-default identity solutions and to support incidentresponse or troubleshooting. Contribute to documentation (playbooks and runbooks) and build enablement tooling or training for internal support teams. Apply IAM security best practices and ensure alignment with More ❯
Officer and be the primary contact dealing with ICO registrations, DSARs, privacy notices, risk assessments and mitigations (e.g. business process assessments, DPIAs); data-related supply chain due diligence; data incident responses; and co-ordinating with Technology team security risk assessments Records Management: Work with Grosvenors archivists on longer term retention and records preservation, including digital preservation. Training and Awareness More ❯
Service Management Engineer to join a growing team. In this role, you'll be at the heart of the IT monitoring and event management services, ensuring smooth operations, rapid incidentresponse, and proactive service improvement. If you're passionate about technology, problem-solving, and delivering excellent service, this could be the role for you. What you'll be … maintenance. Skilled in creating and managing operational checklists for system health. Solid understanding of IT processes and event management within ITSM frameworks. Desirable Familiarity with ITIL best practices (Event, Incident, and Problem Management). Experience in IT support or operations roles. About you Proactive, flexible, and eager to learn. Strong analytical and problem-solving skills. Organised with excellent communication More ❯