london (city of london), south east england, united kingdom Hybrid / WFH Options
Oho Group Ltd
infrastructure A thoughtful, pragmatic engineering approach Curiosity about security and detection (no prior experience required) Bonus if you’ve worked with: Event-driven or distributed systems Security tooling or incidentresponse workflows Why Join? Work on hard, meaningful problems in cybersecurity Be part of a fast, technical, remote-first team Competitive salary and meaningful equity Founding Engineer - London More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Oho Group Ltd
infrastructure A thoughtful, pragmatic engineering approach Curiosity about security and detection (no prior experience required) Bonus if you’ve worked with: Event-driven or distributed systems Security tooling or incidentresponse workflows Why Join? Work on hard, meaningful problems in cybersecurity Be part of a fast, technical, remote-first team Competitive salary and meaningful equity Founding Engineer - London More ❯
Lead Site Reliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You'll Do: Design and implement scalable, efficient systems for maximum reliability. Lead incidentresponse and implement monitoring solutions to maintain high system uptime. Optimize performance through in-depth analysis and continuous improvement. Develop preventive maintenance programs and carry out Root Cause More ❯
Lead Site Reliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You'll Do: Design and implement scalable, efficient systems for maximum reliability. Lead incidentresponse and implement monitoring solutions to maintain high system uptime. Optimize performance through in-depth analysis and continuous improvement. Develop preventive maintenance programs and carry out Root Cause More ❯
Infrastructure as Code) Work with virtualisation (VMware/vSphere, etc.) Configure/manage SAN/storage, Fibre Channel, zoning, LUN provisioning Participate in vulnerability assessments, patches, security hardening, and incidentresponse Required Skills & Experience NPPV3 clearance, either current or active within the last 12 months (non-negotiable) Strong track record with Windows 11 deployment (imaging, upgrade, Autopilot, Intune More ❯
Infrastructure as Code) Work with virtualisation (VMware/vSphere, etc.) Configure/manage SAN/storage, Fibre Channel, zoning, LUN provisioning Participate in vulnerability assessments, patches, security hardening, and incidentresponse Required Skills & Experience NPPV3 clearance, either current or active within the last 12 months (non-negotiable) Strong track record with Windows?11 deployment (imaging, upgrade, Autopilot, Intune More ❯
Infrastructure as Code) Work with virtualisation (VMware/vSphere, etc.) Configure/manage SAN/storage, Fibre Channel, zoning, LUN provisioning Participate in vulnerability assessments, patches, security hardening, and incidentresponse Required Skills & Experience NPPV3 clearance, either current or active within the last 12 months (non-negotiable) Strong track record with Windows 11 deployment (imaging, upgrade, Autopilot, Intune More ❯
united kingdom, united kingdom Hybrid / WFH Options
Mercor
and fast. You will work across every layer of the stack to drive measurable reliability improvements. Responsibilities Mentor engineers on best practices for observability, alert management, and instrumentation. Lead incidentresponse from triage through post-mortem and remediation. Own and improve load-testing, disaster-recovery, and chaos-engineering programs. Automate reliability checks, capacity planning, and service-level monitoring. More ❯
for automation, cost savings, performance improvement, and scalability. Own capacity planning, infrastructure budgeting, and vendor management. Operational Excellence Ensure high availability, performance, and security of all infrastructure services. Oversee incidentresponse and root cause analysis for infrastructure-related issues. Monitor KPIs and SLAs, ensuring service delivery meets or exceeds expectations. Collaboration & Communication Work closely with cross-functional teams More ❯
for automation, cost savings, performance improvement, and scalability. Own capacity planning, infrastructure budgeting, and vendor management. Operational Excellence Ensure high availability, performance, and security of all infrastructure services. Oversee incidentresponse and root cause analysis for infrastructure-related issues. Monitor KPIs and SLAs, ensuring service delivery meets or exceeds expectations. Collaboration & Communication Work closely with cross-functional teams More ❯
monitoring, cost optimization, invoice reconciliation, and contract renewals. Monitor and remediate device compliance and security posture (encryption, passcode, OS version minimums, managed open-in, DLP); coordinate with Security for incidentresponse and hardening. Maintain accurate asset and SIM inventory; track chain of custody and ensure audit readiness. Create and maintain documentation, runbooks, and end-user guides for enrollment More ❯
monitoring, cost optimization, invoice reconciliation, and contract renewals. Monitor and remediate device compliance and security posture (encryption, passcode, OS version minimums, managed open-in, DLP); coordinate with Security for incidentresponse and hardening. Maintain accurate asset and SIM inventory; track chain of custody and ensure audit readiness. Create and maintain documentation, runbooks, and end-user guides for enrollment More ❯
handling distributed data processing workloads. Minimum 1 years' experience with Microsoft PowerBI Desirable Skills Solid understanding of SRE principles applied to data platforms (pipeline reliability, monitoring, CI/CD, incidentresponse, automation/self-healing). GitHub for version control, collaboration, and workflow management. Datadog for application health and performance monitoring. General database fundamentals and query optimization Experience More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. To be successful in this role as an Infrastructure Engineer - Production Network Engineering, you should possess the following skillsets: Extensive experience as an individual contributor in the design More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of More ❯
St. Albans, Hertfordshire, England, United Kingdom
Method Resourcing
you'll do Lead the design, build, deployment, and operation of critical software systems. Architect and deliver the shift to an event-driven microservices environment. Improve automation, monitoring, and incidentresponse capability. Partner with Product and stakeholders to define and execute the roadmap. Mentor and develop engineers, driving a culture of quality and accountability. What you'll bring More ❯
St. Albans, Hertfordshire, South East, United Kingdom
Method-Resourcing
you'll do Lead the design, build, deployment, and operation of critical software systems. Architect and deliver the shift to an event-driven microservices environment. Improve automation, monitoring, and incidentresponse capability. Partner with Product and stakeholders to define and execute the roadmap. Mentor and develop engineers, driving a culture of quality and accountability. What you'll bring More ❯
Hayes, Hillingdon, Greater London, UK Hybrid / WFH Options
The Electric Car Scheme
proactive monitoring, and identifying potential risks. Proven ability to lead technical initiatives from concept to completion, often involving multiple team members or complex integrations. Well versed in production operations, incidentresponse, and performance optimisation. You proactively identify and mitigate risks to ensure system stability and scalability. Benefits: Hybrid working with 2 days in the office (Hayes, London More ❯
Hayes, England, United Kingdom Hybrid / WFH Options
The Electric Car Scheme
proactive monitoring, and identifying potential risks. Proven ability to lead technical initiatives from concept to completion, often involving multiple team members or complex integrations. Well versed in production operations, incidentresponse, and performance optimisation. You proactively identify and mitigate risks to ensure system stability and scalability. Benefits: Hybrid working with 2 days in the office (Hayes, London More ❯
Hayes, south east england, united kingdom Hybrid / WFH Options
The Electric Car Scheme
proactive monitoring, and identifying potential risks. Proven ability to lead technical initiatives from concept to completion, often involving multiple team members or complex integrations. Well versed in production operations, incidentresponse, and performance optimisation. You proactively identify and mitigate risks to ensure system stability and scalability. Benefits: Hybrid working with 2 days in the office (Hayes, London More ❯
Manage and monitor the performance of internal tooling and fraud rules. Work cross-functionally across Operations, Engineering, Product, and Finance to mitigate areas of risk. Represent Trust & Safety during incidentresponse and create mitigation processes. About You: 3+ years of experience in fraud or chargeback operations, investigations, or a related Trust & Safety vertical. Experience with user restriction systems More ❯
Milton Keynes, Buckinghamshire, United Kingdom Hybrid / WFH Options
Rightmove PLC
metrics (CSAT, quality, speed, backlog health) to drive improvements. Analyse service data to identify trends, risks, and opportunities. Oversee resource planning and workload forecasting to maintain smooth operations. Manage incidentresponse standards and escalation processes, reducing friction across CX teams. Leading Teams Lead, coach, and support Team Leaders to build confident, high-performing teams. Take accountability for team More ❯
united kingdom, united kingdom Hybrid / WFH Options
Odin
growth and delivery Work closely with Product to deliver high-quality releases that customers love, balancing speed, quality, and maintainability Own and continuously improve processes across QA, support, and incidentresponse, ensuring reliability and operational excellence Hire exceptional engineers, define the bar, and coach others to interview and assess effectively Guide architectural decisions, coding standards, and security practices More ❯
software development and systems engineering. A high bar for code and configuration quality and readability. A good understanding of current observability and reliability practices. Experienced and comfortable in running incident response. Big picture thinking - you can make trade offs on technical work streams against business impact. Fantastic communication skills. You're able to articulate what you're working on More ❯