Commercial awareness with a track record of delivering profitable portfolio growth. Confident communicator with strong broker relationships and market presence. Familiarity with cyber risk scoring tools, pricing models, and incidentresponse frameworks. Professional qualifications such as ACII, or relevant technical certifications (e.g. CISSP) are an advantage. Experience working in a Lloyd’s, MGA cyber underwriting environment. Exposure to More ❯
Commercial awareness with a track record of delivering profitable portfolio growth. Confident communicator with strong broker relationships and market presence. Familiarity with cyber risk scoring tools, pricing models, and incidentresponse frameworks. Professional qualifications such as ACII, or relevant technical certifications (e.g. CISSP) are an advantage. Experience working in a Lloyd’s, MGA cyber underwriting environment. Exposure to More ❯
london (city of london), south east england, united kingdom
Santam Syndicate 1918
Commercial awareness with a track record of delivering profitable portfolio growth. Confident communicator with strong broker relationships and market presence. Familiarity with cyber risk scoring tools, pricing models, and incidentresponse frameworks. Professional qualifications such as ACII, or relevant technical certifications (e.g. CISSP) are an advantage. Experience working in a Lloyd’s, MGA cyber underwriting environment. Exposure to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
others. Strong analytical and communication skills in a distributed team environment. Customer-first mindset with a data-driven approach to problem-solving. Experience with large-scale distributed systems and incident response. Familiarity with LLM technologies (RAG, prompt engineering, evaluation methods). Knowledge of Azure services and Microsoft's ecosystem. Ability to work independently and collaboratively. Degree in Data Science More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
others. Strong analytical and communication skills in a distributed team environment. Customer-first mindset with a data-driven approach to problem-solving. Experience with large-scale distributed systems and incident response. Familiarity with LLM technologies (RAG, prompt engineering, evaluation methods). Knowledge of Azure services and Microsoft's ecosystem. Ability to work independently and collaboratively. Degree in Data Science More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
others. Strong analytical and communication skills in a distributed team environment. Customer-first mindset with a data-driven approach to problem-solving. Experience with large-scale distributed systems and incident response. Familiarity with LLM technologies (RAG, prompt engineering, evaluation methods). Knowledge of Azure services and Microsoft's ecosystem. Ability to work independently and collaboratively. Degree in Data Science More ❯
stakeholder management skills. Comfortable working in a fast-paced, iterative, and agile environment. Preferred Experience: Solid understanding of cyber security concepts such as threat detection, SIEM, anomaly detection, and incident response. Experience with tools for tracking ML models in production (e.g., MLflow). We encourage you to apply even if your experience is not a 100% match with the More ❯
and historical treasury data with focus on accuracy, reliability, and performance Contributing to deployment, monitoring, and scaling of systems while working with operations teams to ensure uptime and quick incidentresponse Key Requirements 5-15 years of professional experience in software engineering, ideally in front office finance , fintech , high-frequency trading , or technology-driven startups Expert-level Python More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
QBE Management Services (UK) Limited
and emerging threats. Monitor evolving attacker tactics (TTPs), integrating insights into detection and prevention strategies. Collaborate across Cyber Security, Engineering, and Global IT teams to enhance security posture and incident response. About you Strong experience across both offensive and defensive cyber security disciplines. Deep understanding of attacker tactics, techniques, and procedures (TTPs), with expertise in the MITRE ATT&CK More ❯
that impact the support team Understand and stay updated on the regulatory environment in EMEA to ensure compliance with all relevant laws and regulations. Drive continuous process improvements Coordinate incidentresponse efforts and client communications Supporting new product releases Assist with supporting vendor relationships and due diligence Manage production support oversight and coordinate incident triage Ensure client More ❯
markets expertise across areas such as Fixed Income, Equities, FX, Derivatives, or Futures Maintain and optimise monitoring infrastructure for Front and Middle office technology stack Take ownership of production incident workflows and resolution processes Drive incidentresponse protocols and coordinate resolution efforts Support and mentor Level 1 (L1) team members through technical challenges Build relationships with business More ❯
support team to drive continuous improvement in service delivery quality. Provide professional insights into AC/DC charging technologies , including fault diagnosis and issue analysis. Lead maintenance process optimization, incidentresponse mechanisms, and standardization of service workflows. Act as a coordination and technical interface in major service issues, ensuring efficient problem resolution for customers. Service Operations Support Support … the development and optimization of preventive maintenance, troubleshooting, and spare parts management processes. Monitor and promote the execution of Service Level Agreements (SLAs) to improve response times and customer satisfaction. Work with customer success, sales teams, and third-party service providers to ensure consistent service delivery. Responsible for the selection, onboarding, and management of service partners , ensuring their competence … requirements and coordinate resources to ensure efficient closure of technical and service issues. Remote Monitoring & Fault Management Collaboration Coordinate with remote monitoring teams to enhance proactive alerting and issue response mechanisms. Support the application and advancement of remote diagnostics and predictive maintenance capabilities. Qualifications & Requirements Education & Experience Bachelor's degree or above in Electrical Engineering, Mechanical Engineering, or a More ❯
support team to drive continuous improvement in service delivery quality. Provide professional insights into AC/DC charging technologies , including fault diagnosis and issue analysis. Lead maintenance process optimization, incidentresponse mechanisms, and standardization of service workflows. Act as a coordination and technical interface in major service issues, ensuring efficient problem resolution for customers. Service Operations Support Support … the development and optimization of preventive maintenance, troubleshooting, and spare parts management processes. Monitor and promote the execution of Service Level Agreements (SLAs) to improve response times and customer satisfaction. Work with customer success, sales teams, and third-party service providers to ensure consistent service delivery. Responsible for the selection, onboarding, and management of service partners , ensuring their competence … requirements and coordinate resources to ensure efficient closure of technical and service issues. Remote Monitoring & Fault Management Collaboration Coordinate with remote monitoring teams to enhance proactive alerting and issue response mechanisms. Support the application and advancement of remote diagnostics and predictive maintenance capabilities. Qualifications & Requirements Education & Experience Bachelor's degree or above in Electrical Engineering, Mechanical Engineering, or a More ❯
family, retirement Employee, friends and family discounts across 1200+ retail, hospitality and lifestyle brands Key Responsibilities Be responsible for the development and refinement of processes for issue management and incidentresponse Oversee and ensure effective processes are in place for the handling and communication of incidents and issues across Product Clusters, Internal Teams and Stakeholders Establish metrics and … reporting systems to track incidents, issues and their resolutions Lead and run the monthly Ops and Incident review meetings with regions/stakeholders Manage the product support process to ensure efficient resolution of issues advanced to the T2 support team and customer satisfaction Own and refine our quarterly planning process Accountable for the creation and implementation of scalable product … ideally in B2B SaaS or technology-focused companies Deep understanding of product management best practices-user research, market insights, goal setting, prioritisation, execution, and leadership Familiarity with monitoring tools, incident management protocols, and collaboration with Site Reliability Engineering (SRE) teams Proven ability to develop relationships and align teams across product, engineering, and leadership to ensure the effective execution of More ❯
and manage SIEM solutions using Splunk, including data onboarding, creating correlation rules, and developing dashboards and reports to improve security visibility. Develop and maintain detection use cases, alerts, and response playbooks based on threat intelligence and compliance requirements. Monitor, tune, and optimise Splunk performance to ensure availability, scalability, and accuracy of security event logging and analysis. Work closely with … SOC analysts, incidentresponse, and threat hunting teams to enhance threat detection, investigation capabilities, and response effectiveness. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
clients. Collaborate with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … in the maintenance of these procedures. Support regular security audits and penetration tests, addressing findings and oversee any remediation work. Improve system monitoring, alerting, documentation, operating procedures and incidentresponse processes. Manage, mentor, plan and coordinate the activities of both teams. Required Skills/Experience Ideally 7+ years Linux system administration experience with at least 3 years in … and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such as VMware, containerised systems as well as familiarity with hybrid cloud environments and DevOPS practices. Familiarity with incident/problem management and change control workflows a plus. Previous experience with low latency systems or financial experience desirable. Excellent verbal and written communication skills, with the ability to More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
clients. Collaborate with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … in the maintenance of these procedures. Support regular security audits and penetration tests, addressing findings and oversee any remediation work. Improve system monitoring, alerting, documentation, operating procedures and incidentresponse processes. Manage, mentor, plan and coordinate the activities of both teams. Required Skills/Experience Ideally 7+ years Linux system administration experience with at least 3 years in … and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such as VMware, containerised systems as well as familiarity with hybrid cloud environments and DevOPS practices. Familiarity with incident/problem management and change control workflows a plus. Previous experience with low latency systems or financial experience desirable. Excellent verbal and written communication skills, with the ability to More ❯
Uxbridge, Middlesex, England, United Kingdom Hybrid / WFH Options
Network IT
reference point for network security , advising IT teams and stakeholders. Perform vulnerability assessments, audits, and continuous traffic monitoring for anomalies. Analyse and respond to security incidents, developing remediation and incident reports. Provide 2nd/3rd line support for network services, ensuring smooth operation and timely resolutions. Contribute to the design, implementation, and improvement of network infrastructure. Lead on monitoring … project and change management, providing expertise on network security in planning and delivery. Experience Requirements: Strong technical background in network infrastructure and security operations . Experience with vulnerability assessments, incidentresponse, and security tools. Knowledge of enterprise networking (wired, wireless, remote access). Familiarity with monitoring, alerting, and reporting solutions. Strong problem-solving skills and the ability to More ❯
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical data … of Apache Flink, Kafka, and Python in production environments Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) Comfortable with monitoring tools, distributed systems debugging, and incidentresponse Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at Rise Technical More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical data … of Apache Flink, Kafka, and Python in production environments *Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) *Comfortable with monitoring tools, distributed systems debugging, and incidentresponse Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click 'Apply Now' or contact Tommy Williams at Rise Technical More ❯
the CMDB as a real-time, trusted system of record for configuration items across cloud, on-prem, and hybrid environments. Embed SACM capabilities into core IT processes including observability, incidentresponse, service management, and architecture governance. Champion automation, transparency, and traceability of all infrastructure, software, and asset relationships. Automation & Integration: Build and operate a fully automated CMDB with … Alignment: Apply SRE principles to ensure reliability, performance, and resilience of the SACM platform. Embed SACM into 24x7 operations and observability platforms to support real-time decision-making. Support incident prevention, root cause analysis, and continuous improvement through data-driven insights. Define and enforce service level objectives (SLOs) and key performance indicators (KPIs) for SACM health and value. Governance More ❯
the CMDB as a real-time, trusted system of record for configuration items across cloud, on-prem, and hybrid environments. Embed SACM capabilities into core IT processes including observability, incidentresponse, service management, and architecture governance. Champion automation, transparency, and traceability of all infrastructure, software, and asset relationships. Automation & Integration: Build and operate a fully automated CMDB with … Alignment: Apply SRE principles to ensure reliability, performance, and resilience of the SACM platform. Embed SACM into 24x7 operations and observability platforms to support real-time decision-making. Support incident prevention, root cause analysis, and continuous improvement through data-driven insights. Define and enforce service level objectives (SLOs) and key performance indicators (KPIs) for SACM health and value. Governance More ❯
using tools like Prometheus, Grafana, ELK, or Datadog Collaborate with development teams to optimize build, test, and deployment workflows Ensure system reliability, scalability, and performance through proactive monitoring and incidentresponse Implement security best practices across infrastructure and deployment processes Containerize applications using Docker and orchestrate with Kubernetes or similar platforms Maintain version control systems and branching strategies More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Infoplus Technologies UK Limited
using tools like Prometheus, Grafana, ELK, or Datadog Collaborate with development teams to optimize build, test, and deployment workflows Ensure system reliability, scalability, and performance through proactive monitoring and incidentresponse Implement security best practices across infrastructure and deployment processes Containerize applications using Docker and orchestrate with Kubernetes or similar platforms Maintain version control systems and branching strategies More ❯
london, south east england, united kingdom Hybrid / WFH Options
Infoplus Technologies UK Limited
using tools like Prometheus, Grafana, ELK, or Datadog Collaborate with development teams to optimize build, test, and deployment workflows Ensure system reliability, scalability, and performance through proactive monitoring and incidentresponse Implement security best practices across infrastructure and deployment processes Containerize applications using Docker and orchestrate with Kubernetes or similar platforms Maintain version control systems and branching strategies More ❯