arena that arise. When issues, incidents, and/or problems arise, the Senior Network Engineer drives the situation and oversees the Networking team's troubleshooting and problem resolution, including RootCauseAnalysis (RCA). The Senior Network engineer is responsible for ensuring the Networking team as a whole produces and maintains documentation of the network systems(s … IISRP, OSPP, PC, QoS, STP, VPC, VRF). Shall have 5 or more years' of demonstrated understanding and hands-on experience in the following networking concepts: network traffic flow analysis, network management, network topology design, network security, performance, high availability, load balancing, and fault tolerant architectures. Shall have 5 or more years' of experience working with secure encrypted networking More ❯
process improvement projects using methodologies such as Lean Six Sigma. Analyze IT systems, workflows, and performance metrics to identify areas for enhancement. Facilitate working sessions to collect data, perform rootcauseanalysis, and design solutions. Develop, implement, and track key performance indicators to measure improvement success. Prepare process documentation, standard operating procedures (SOPs), and best practice guides. More ❯
across AWS, Azure, or other government cloud environments Monitor and manage cloud security posture using tools like CSPM and CWPP solutions Respond to and investigate cloud security incidents, perform rootcauseanalysis, and recommend remediation actions Hands-on experience with at least three of the following: CrowdStrike, Microsoft Defender for Endpoint, Cisco Firepower, ExtraHop, ForeScout, Gigamon. Collaborate More ❯
Greater London, England, United Kingdom Hybrid / WFH Options
Larbey Evans
system health and performance; initiate health checks and proactively remediate issues Respond to and resolve incidents and service requests in line with SLAs Provide break/fix troubleshooting and rootcauseanalysis across supported systems Collaborate with infrastructure teams to support system scalability and optimization Facilitate alignment between delivery teams and Information Security, Infrastructure Partner with stakeholders More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Larbey Evans
system health and performance; initiate health checks and proactively remediate issues Respond to and resolve incidents and service requests in line with SLAs Provide break/fix troubleshooting and rootcauseanalysis across supported systems Collaborate with infrastructure teams to support system scalability and optimization Facilitate alignment between delivery teams and Information Security, Infrastructure Partner with stakeholders More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Larbey Evans
system health and performance; initiate health checks and proactively remediate issues Respond to and resolve incidents and service requests in line with SLAs Provide break/fix troubleshooting and rootcauseanalysis across supported systems Collaborate with infrastructure teams to support system scalability and optimization Facilitate alignment between delivery teams and Information Security, Infrastructure Partner with stakeholders More ❯
performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical data incidents and lead rootcauseanalysis Optimising system performance, define SLIs/SLOs, and drive reliability Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical data incidents and lead rootcauseanalysis *Optimising system performance, define SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
Kansas City Metropolitan Area. Responsibilities: Architect and develop Python-based microservices (FastAPI, Flask, or custom). Translate data formats (JSON, Parquet, Avro) and develop automation/scripting solutions. Drive rootcauseanalysis and troubleshooting across staging and production. Lead integration efforts with DevOps, security, and cloud infrastructure teams. Guide CI/CD improvements, observability tooling, and service More ❯
support of the companies technologies, including email, voicemail, and other enterprise systems. Take an active role in the Incident Response and Problem Management processes, representing the desktop environment. Provide rootcauseanalysis for problems and measures to mitigate future occurrences. Supervise the daily activities of the end user and desktop support function including, but not limited to More ❯
qualification, audits, and corrective actions. Review and approve quality documentation to ensure compliance with specifications and regulatory standards. Monitor product performance and customer feedback for areas of improvement. Conduct rootcauseanalysis and implement corrective and preventive actions. Collaborate with R&D, manufacturing, and supply chain teams to embed quality throughout the product lifecycle. Manage internal and More ❯
Leatherhead, Surrey, England, United Kingdom Hybrid / WFH Options
Recruitvirt Ltd
data protection, recovery, and disaster recovery (DR) across on-prem and hybrid workloads. Manage incidents, service requests, and change controls via standard ITIL-based processes. Lead and participate in rootcauseanalysis for infrastructure-related incidents and issues. Maintain and update detailed technical documentation and configuration records. Act as a senior point of contact for customers, attending More ❯
San Diego, California, United States Hybrid / WFH Options
Gridiron IT Solutions
using Infrastructure as Code with Terraform and configuration management tools like Ansible Automate repetitive tasks to eliminate toil and drive consistency + repeatability Actively participate in incident response and root-causeanalysis, support a blameless post-mortems culture Qualifications: Eligible for Top Secret/SCI Security clearance 5+ Years Experience working in a Security culture Experience working More ❯
Work collaboratively with development, DevOps, and security teams to ensure data governance, compliance, and operational efficiency. Implement monitoring and alerting solutions using tools like CloudWatch, Datadog, or Prometheus. Conduct rootcauseanalysis (RCA) and develop long-term preventive strategies. Maintain and enforce database standards, documentation, and operational procedures. Required Qualifications: 7+ years of experience in database engineering More ❯
/experience with workstations, laptops, printers, smartphones, and tablets. Working knowledge/experience of PC imaging tools, diagnosis and remote-control tools, documentation, and ticketing. Excellent troubleshooting, problem solving, & rootcauseanalysis skills. Excellent customer service skills - Must be able to interact in person with customers who are experiencing network/technology related issues. Ability and willingness More ❯
and introducing new tools or automations. Understanding of the ITIL framework or service management methodology Experience in incident management, including owning the response process for urgent issues and ensuring rootcauseanalysis is performed and documented. Excellent communication skills, both written and verbal. Hands-on experience with service desk tools, e.g. Jira, Zendesk, ServiceNow. If that's More ❯
in alignment with ITSM principles to ensure consistent service quality. Incident & Change Management - Direct incident, problem, and change management activities in accordance with ITIL standards, ensuring rapid issue resolution, root-causeanalysis, and long-term service stability. Technical Collaboration - Liaise closely with Salesforce and AWS specialists to coordinate upgrades, patch releases, and enhancements, ensuring minimal service disruption More ❯
North West London, London, United Kingdom Hybrid / WFH Options
SEFE MARKETING & TRADING LIMITED
Oracle estate is secured, up-to-date with security patches, operating system updates, and aligned with company policies. Maintain proper database security and monitor compliance. Youll provide prompt, precise rootcauseanalysis and work closely with IT Development and Infrastructure teams to resolve issues and improve performance. Disaster Recovery & Incident Management: Participate in disaster recovery exercises, ensure More ❯
bradford, yorkshire and the humber, united kingdom
Alscient
in alignment with ITSM principles to ensure consistent service quality. Incident & Change Management - Direct incident, problem, and change management activities in accordance with ITIL standards, ensuring rapid issue resolution, root-causeanalysis, and long-term service stability. Technical Collaboration - Liaise closely with Salesforce and AWS specialists to coordinate upgrades, patch releases, and enhancements, ensuring minimal service disruption More ❯
adjustments. Coordinate with procurement agents, suppliers, and repair centers to ensure timely and quality repairs. Monitor inventory levels of repairable spares and manage the logistics of parts movement. Conduct rootcauseanalysis to identify and address recurring issues with spare parts. Maintain detailed records of repair activities, costs, and inventory status. Ensure compliance with company policies and More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
. Lead the adoption of proactive monitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, rootcauseanalysis, and continuous improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and More ❯
. Lead the adoption of proactive monitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, rootcauseanalysis, and continuous improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
. Lead the adoption of proactive monitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, rootcauseanalysis, and continuous improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and More ❯
troubleshoot application functionality within VDI sessions in partnership with application owners Create and manage desktop pools in Horizon Administrator/Console, including both persistent and non-persistent configurations Perform rootcauseanalysis for recurring issues and implement permanent fixes Maintain user entitlement mappings and access to appropriate VDI pools Monitor system health and generate reports on performance More ❯
device, configuration, able to easily navigate the CLI, deploy applicable patches, and make configuration changes as needed. Have strong analytical and problem-solving skills. Candidates are expected to perform rootcauseanalysis to troubleshoot & identify issues at all layers of the network. Expertise with WAN/Transport and IP routing technologies and protocols, candidates should have an More ❯