policies, principles and criteria covering connectivity, interfacing, resilience, recovery and access. Provides definitive and expert advice in the specialist area of engineering into other cyber security activities, such as incident response. Develops secure architecture patterns for emerging technologies within the engineering domain, in line with safety, integrity and availability requirements. Acts as a single point of contact for senior More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of More ❯
is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of More ❯
Chester, Cheshire, United Kingdom Hybrid / WFH Options
Whelen Engineering
and Responsibilities Lead and mentor the IT help desk, systems,and network teams, ensuring high performance and professional growth. Oversee the day-to-day delivery of IT services, including incidentresponse, service requests, system availability, and infrastructure support, while prioritizing and maintaining production systems uptime Manage work in the ticketing system (Jira), ensuring timely response, prioritization, and More ❯
Leading Consultancy continues to expand its EMEA presence and seek an Associate Director to join. As an Associate Director, you'll lead technical investigations involving cybersecurity breaches, digital forensics, and eDiscovery. Your work will span both hands-on investigation and More ❯
as job-specific technical skillsThis role can be based in our London, Knutsford or Glasgow locations. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incidentresponse, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of systems and services through proactive monitoring, maintenance … and capacity planning.Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.Collaboration with development teams More ❯
as job-specific technical skillsThis role can be based in our London, Knutsford or Glasgow locations. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incidentresponse, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of systems and services through proactive monitoring, maintenance … and capacity planning.Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.Collaboration with development teams More ❯
specific technical skills This role can be based in our London, Knutsford or Glasgow locations. Purpose of the role To apply software engineering techniques, automation, and best practices in incidentresponse, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring … maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance More ❯
specific technical skills This role can be based in our London, Knutsford or Glasgow locations. Purpose of the role To apply software engineering techniques, automation, and best practices in incidentresponse, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring … maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance More ❯
specific technical skills This role can be based in our London, Knutsford or Glasgow locations. Purpose of the role To apply software engineering techniques, automation, and best practices in incidentresponse, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring … maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance More ❯
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incidentresponse, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incident management, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows, MacOS More ❯
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incidentresponse, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incident management, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows, MacOS More ❯
london (shoreditch), south east england, united kingdom
Hamilton Barnes 🌳
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incidentresponse, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incident management, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows, MacOS More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
Actions. Contribute to Infrastructure as Code (IaC) practices using Bicep or Terraform. Set up and maintain observability for integration components using Azure Monitor, Application Insights, and Log Analytics. Support incidentresponse and root cause analysis for integration-related issues. Apply security best practices across integration solutions, including authentication, encryption, and access control. Ensure compliance with internal and external … such as Logic Apps, Function Apps, Service Bus, Event Grid, Event Hub, and API Management. Experience with RESTful APIs, JSON, and integration patterns (eg, pub/sub, request/response, ETL). Understanding of DevOps practices and tools (Azure DevOps, GitHub, CI/CD). Knowledge of security and identity management in Azure (eg, OAuth2, Managed Identities, RBAC). More ❯
ll design scalable infrastructure, automate operations, and embed SRE principles to improve reliability and reduce toil. This is a highly influential role where you'll guide engineering standards, support incident management, and mentor others in building robust, cloud-native systems using modern DevOps practices. What You'll Bring: Strong experience supporting complex web applications and distributed systems, including Micro … DevOps, GitHub Actions) Solid grasp of cloud infrastructure (Azure or GCP), networking, and security best practices for web platforms Knowledge of SRE frameworks including SLOs, SLIs, error budgets, and incidentresponse Familiarity with testing tools such as Playwright, Vitest, and Jest Understanding of infrastructure-as-code (Terraform) and DevSecOps is a plus Why You Should Apply: You'll More ❯
ensuring high availability, security, and optimal performance. Core MPLS & ISP Infrastructure - Operate and maintain core MPLS and ISP backbone systems, including BGP peering and collaboration with upstream providers. Monitoring & IncidentResponse - Monitor alerts, enhance visibility via internal/customer-facing monitoring tools, and proactively address performance issues. Ticket & Workflow Management - Own incident resolution from start to finish … roles. Strong working knowledge of Juniper, Cisco ASA, Fortinet FortiGate, and Aruba network solutions. Expertise in MPLS, BGP, and ISP backbone infrastructure. Experience with network monitoring tools and proactive incident prevention. Solid understanding of routing, switching, VPN, firewall, and wireless networking technologies. Familiarity with virtualised networking environments and integration. Proficiency with packet analysis tools (eg, Wireshark) for deep troubleshooting. More ❯
it's about enabling defenders to act as one and sharing intelligence that drives action. Key Responsibilities: Triage and escalate reports as part of the Watch Officer rota. Support incidentresponse during high-alert periods. Monitor and assess emerging cyber threats. Share actionable threat intelligence via reports and briefings. Manage tooling (e.g. malware sandboxes, TIPs) and collaborate across More ❯
ensuring world-class reliability, safety, and performance. What You'll Do Lead and develop on-site engineering teams, fostering a culture of safety, compliance, and operational excellence. Oversee maintenance, incidentresponse, upgrades, and vendor management. Ensure uptime, customer satisfaction, and seamless service delivery. Manage budgets, contracts, and site financial performance (OPEX & CapEx). Drive continuous improvement and support More ❯
s head office in Plymouth, this role is integral to their expansion and ongoing success. As the Contact Centre Operator, you will play a key role in both reactive incidentresponse and proactive monitoring. Day-to-day of the role: Monitor security systems, alarms, and CCTV feeds to detect and respond to incidents. Escalate issues in line with More ❯
security monitoring tools and processes to improve threat detection and reduce false positives. Define detection use cases and recommend security investments to improve monitoring coverage. Create playbooks, standards, and incidentresponse processes for the OT environment. About You Strong experience in security operations , including analysing logs and detecting indicators of compromise. Proven background in working within Operational Technology More ❯
vulnerabilities. - Coordinate and lead disaster recovery drills and tests to ensure readiness and effectiveness. - Collaborate with IT and suppliers to design and document backup and recovery solutions. - Support Manage incidentresponse and ensure rapid recovery of operations in the event of a disaster. - Document and report on disaster recovery activities and outcomes. - Stay updated with industry best practices More ❯
RAG, and prompt engineering Familiarity with Azure services and cloud ecosystems Excellent communication and presentation skills A passion for mentoring and developing engineering talent Experience with distributed systems and incidentresponse Benefits: Flexible remote working Competitive salary 25 days holiday Private health insurance (after 1 year) Enhanced parental leave And more Please Note: This is a permanent role More ❯
RAG, and prompt engineering Familiarity with Azure services and cloud ecosystems Excellent communication and presentation skills A passion for mentoring and developing engineering talent Experience with distributed systems and incidentresponse Benefits: Flexible remote working Competitive salary 25 days holiday Private health insurance (after 1 year) Enhanced parental leave And more Please Note: This is a permanent role More ❯
RAG, and prompt engineering Familiarity with Azure services and cloud ecosystems Excellent communication and presentation skills A passion for mentoring and developing engineering talent Experience with distributed systems and incidentresponse Benefits: Flexible remote working Competitive salary 25 days holiday Private health insurance (after 1 year) Enhanced parental leave And more Please Note: This is a permanent role More ❯