Document as you go - to support colleaguesfollow what's been done and why Drive tasks forward with energy and enthusiasm Create proactive monitoring solutions using standard tooling Conduct RCA (RootCauseAnalysis) for incidents Develop and maintain self-managing infrastructure services and dashboards Define the metrics of success and report on progress Implement Infrastructure as Code for … automation tools (Puppet, Ansible, Git), pipelines (Azure DevOps) and test automation Experience with CI/CD tooling (Azure DevOps) Comfortable with Elasticsearch log standardisation, Kibana dashboard creation and data analysis skills AWS hands on - Cloud formation, Route53, S3, DynamoDB, Cloud-watch, Lambda, Security, and troubleshooting, Azure experience also useful Certificate management and automation Strong troubleshooting and diagnosis skills Able More ❯
and non-technical staff across numerous areas. * Proven ability to work independently on multiple tasks with commitment and willingness to see issues through to resolution * Excellent problem solving and RootCauseAnalysis skills * Proficiency in understanding, analysing and defining corrective actions any tickets raised by users * Understanding of virtualization and environments ability to understand Intune administration * Knowledge More ❯
are stable, scalable, performant, accessible, testable and secure by utilising endorsed technologies and applying common build patterns to minimise technical debt. Promote active application custodianship by participating in incident root-causeanalysis to identify and act on learnings. Lead and mentor software engineers in their technical development within the team, driving a collaborative culture Provide Domain Expert More ❯
features. Authors and maintains comprehensive technical documentation including detailed system configurations, governance models, and operational procedures. Acts as a senior escalation point for Level 3/4 support, performing rootcauseanalysis and driving long-term resolution of complex issues. Manages the technical scope, delivery timelines, and risk mitigation strategies for cloud engineering initiatives. Tracks and reports More ❯
and in-depth experience of Oracle Engineered systems and subsystems, especially Exadata Ability to troubleshoot and resolve complex hardware/software issues, restore environments to an operational state, perform rootcauseanalysis and provide forward thinking mitigation strategies Good communication and analytical skills Familiarity with security practices in web application delivery and general knowledge of network topology More ❯
data. Perform regular vulnerability assessments, patch management, and security audits to safeguard infrastructure and prevent unauthorized access. Monitor systems for security incidents, respond to threats, and conduct investigations and rootcauseanalysis to mitigate future risks. Manage relationships with vendors and external Managed Service Providers (MSPs) to ensure timely and effective support. Develop and maintain comprehensive documentation More ❯
and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and rootcause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and rootcause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
infrastructure in a LiveOps environment. Scoping, design and implementation of cloud architecture. Implement, maintain and consolidate cloud testing and automation tools. Identifying and deploying cybersecurity measures. Incident management and rootcause analysis. Working with our code and build teams to ensure a streamlined workflow. Minimum Requirements: Prior experience working in DevOps at a Senior level. Strong experience with More ❯
Sheffield, South Yorkshire, England, United Kingdom
Massenhove Recruitment Limited
that cannot be addressed by First or Second Line support. You will play a key role in maintaining and improving the organisation’s IT infrastructure, performing deep-dive diagnostics, rootcauseanalysis, and implementing long-term solutions. In addition to supporting escalated incidents, you will contribute to system design, strategic projects, and continuous service improvement. The ideal … Line Support Engineer - Key duties Expert-Level Support & Issue Resolution Take ownership of high-level, complex incidents and problems escalated from Second Line Support Perform in-depth diagnostics and rootcauseanalysis across infrastructure, systems, and applications Develop and implement long-term fixes and preventative measures to reduce repeat incidents Infrastructure Management & Improvement Maintain, monitor, and optimise … support role Strong expertise in server administration, networking, virtualisation, and storage solutions Solid understanding of IT security principles and best practices Ability to carry out detailed troubleshooting and perform rootcauseanalysis Experience managing or contributing to technical projects and service improvements Proficiency in tools such as Active Directory, Group Policy, Office 365, Exchange, and Windows Server More ❯
and operational health across critical payment systems. You will collaborate with our application support DevOps Labs to utilize observability platforms such as Splunk and Dynatrace for monitoring, incident response, rootcauseanalysis, and system performance optimization. About us We are an innovative bank committed to shaping finance as a force for good, empowering our people to innovate … SLAs and performance indicators. Knowledge of configuring alerts, reducing noise, and automating alert routing in collaboration with Technical Recovery Managers. Proficiency in Splunk SPL, report scheduling, app management, and rootcause analysis. Ability to conduct proactive system performance analysis and generate reports for stakeholders. Development of scripts for automation and integration of observability tools with CI/ More ❯
partnership with our application support DevOps Labs to deliver deep insights using observability platforms such as Splunk and Dynatrace. Your responsibilities will involve providing direct support for incident response, rootcauseanalysis, performance optimization, and system performance improvement! About us If you think all banks are the same, you'd be wrong. We're an innovative, fast … develop automated alert routing. Hold advanced knowledge of Splunk SPL, dashboard development, report scheduling and app management. Proficient in crafting service-level dashboards, setting up custom metrics, and conducting rootcauseanalysis using advanced technology. Analysis & Reporting: Conduct proactive analysis on system performance, availability and failures. Generate regular reports for senior stakeholders, summarising trends, anomalies More ❯
systems in near real time using automated tools and perform direct system checks Respond to events and alerts from monitoring systems and external vendors like AT&T Conduct early analysis of incidents and contact points of contact for event confirmation Troubleshoot affected critical infrastructure including network devices and system interfaces Escalate incidents by contacting DOT Tier III teams and … support tickets with external vendors and service providers Generate incident management correspondence including notifications and status reports Initiate Critical Incident Management processes and anchor Incident Response Bridge calls Document RootCauseAnalysis for incidents and maintain knowledge management repositories Provide on-site troubleshooting support during off-hours including data center operations Maintain point of contact information and … Current Public Trust clearance (DOJ Public Trust preferred) Nice-to-Haves Familiarity with ServiceNow ticketing system Experience with BMC Remedy Knowledge of network troubleshooting tools and techniques Experience with rootcauseanalysis documentation Background in government or federal agency environments ITIL certification or similar incident management training Education: Bachelors degree in Computer Science, Information Technology, System Administration More ❯
partnership with our application support DevOps Labs to deliver deep insights using observability platforms such as Splunk and Dynatrace. Your responsibilities will involve providing direct support for incident response, rootcauseanalysis, performance optimization, and system performance improvement! About us If you think all banks are the same, you'd be wrong. We're an innovative, fast … develop automated alert routing. Hold advanced knowledge of Splunk SPL, dashboard development, report scheduling and app management. Proficient in crafting service-level dashboards, setting up custom metrics, and conducting rootcauseanalysis using advanced technology. Analysis & Reporting: Conduct proactive analysis on system performance, availability and failures. Generate regular reports for senior stakeholders, summarising trends, anomalies More ❯
our client excellence and regulatory teams to interpret new requirements and how they can be accommodated into their Target Operating Models Leveraging analytics tools & core methods to assist with rootcauseanalysis and identify opportunities for improvement Partnering with our technology engineers & data strategists on defining architecture strategies and controls Collaborating across teams to take designs into … the overall design of our strategy, processes and technology architecture and sharing this knowledge across Investment Operations Proven track record driving and successfully delivering complex, cross-function projects Strong root-causeanalysis and creative problem-solving skills, coupled with the self-motivation and leadership to drive projects to completion Ability to consistently produce high quality project documentation … moving targets Demonstrated success collaborating, ability to influence without authority 7+ years experience PREFERRED QUALIFICATIONS Functional Architecture Design Business Process Re-Engineering Experience across a project lifecycle from scoping & analysis to delivery Degree in Operational Strategy Technical knowledge of financial service industry utilities At Wellington Management, our approach to compensation is designed to help us attract, inspire and retain More ❯
and performance management. You'll develop and manage insightful dashboards and reports, monitor key performance indicators, and present actionable insights to stakeholders. Your role will also involve predictive analytics, root-causeanalysis, risk management collaboration, and ensuring compliance with industry standards. Proficiency in tools like Power BI, SQL, and Python, along with familiarity with PRISM or Primavera … risk performance. Monitor key project performance indicators and identify trends, deviations, and improvement opportunities. Present clear and actionable insights to stakeholders to enable effective project control decisions. Data Modelling & Analysis: Develop predictive analytics models to assess potential project outcomes based on current data and trends. Conduct root-cause analyses of project variances and propose mitigation strategies to … reporting. Project Controls Knowledge: Familiarity with project controls disciplines such as cost control, scheduling, and risk management. Understanding of Earned Value Management (EVM) and techniques like cost forecasting, variance analysis, and benchmarking. About You Essential Bachelor's degree in data science, Engineering, Project Controls, Finance, or a related discipline. Professional certifications (e.g., AACE, PMI-PMP) are a plus. At More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
Cooper Moss Rutland LLP
and performance management. You'll develop and manage insightful dashboards and reports, monitor key performance indicators, and present actionable insights to stakeholders. Your role will also involve predictive analytics, root-causeanalysis, risk management collaboration, and ensuring compliance with industry standards. Proficiency in tools like Power BI, SQL, and Python, along with familiarity with PRISM or Primavera … risk performance. Monitor key project performance indicators and identify trends, deviations, and improvement opportunities. Present clear and actionable insights to stakeholders to enable effective project control decisions. Data Modelling & Analysis: Develop predictive analytics models to assess potential project outcomes based on current data and trends. Conduct root-cause analyses of project variances and propose mitigation strategies to … reporting. Project Controls Knowledge: Familiarity with project controls disciplines such as cost control, scheduling, and risk management. Understanding of Earned Value Management (EVM) and techniques like cost forecasting, variance analysis, and benchmarking. About You Essential Bachelor's degree in data science, Engineering, Project Controls, Finance, or a related discipline. Professional certifications (e.g., AACE, PMI-PMP) are a plus. At More ❯
Roseville, California, United States Hybrid / WFH Options
LHH Recruitment Solutions
in system-level validation, test plan development, and hands-on debugging. Key Responsibilities Lead product validation activities, including test plan creation, execution, and failure analysis. Conduct performance characterization and root-causeanalysis of anomalies in ASIC and storage systems. Develop and maintain detailed test documentation, including test plans, matrices, and reports for new features and regression testing. …/programming (Python preferred, Bash, C/C++ also valuable). Solid understanding of system architecture, power management flows, and platform integration. Experience with debug tools and methodologies for root-cause analysis. Comfortable operating in Linux-based test environments. Preferred Qualifications Familiarity with security protocols such as TLS 1.3, IPsec, Secure Boot, Secure Firmware Download. Experience with Wireshark … or other traffic analysis tools. Background in ASIC validation or firmware/hardware integration. Strong automation skills and experience integrating tests into CI/CD pipelines. Prior experience working in Agile/Scrum teams. Why Work with Us? Opportunity to join a respected industry leader with a track record of innovation. Collaborative environment that values problem-solving and continuous More ❯
Reading, Berkshire, United Kingdom Hybrid / WFH Options
DCL
escalations Conduct advanced threat hunting using the Microsoft Security Stack. Build, optimise and maintain workbooks, rules, analytics etc. Correlate data across Microsoft 365 Defender, Azure Defender and Sentinel. Perform rootcauseanalysis and post-incident reporting. Aid in mentoring and upskilling Level 1 and 2 SOC analysts. Required Skills & Experience: The ability to achieve UK Security Clearance More ❯
Leicester, Leicestershire, England, United Kingdom Hybrid / WFH Options
Oliver James
satisfaction.Key Responsibilities: Lead and manage the Application Support team in resolving incidents, service requests, and change requests. Serve as an escalation point for complex technical issues requiring in-depth analysis and resolution. Perform hands-on troubleshooting, rootcauseanalysis, and issue resolution using SQL and system diagnostics tools. Design and execute test cases for application upgrades More ❯
challenges and demands of countering the Cyber Threat. The successful applicant will drive a proactive ethos in an ever-changing cyber security environment and provide robust threat detection and analysis within the 24x7 SOC. Responsibilities: To support the SOC Manager in assisting Information Management UK meet the challenges and demands of countering the Cyber Threat. Support for the operational … UK SOC. To work with other UK SOC members, including the UK InfoSec Team and the IM Domains (Customer Support and Infrastructure/Information Systems). It will cover analysis, monitoring, reporting, alerting and investigation activity utilising a wide variety of security platforms including AI/ML and behavioural analytics, SIEM (Security Information Event Management), Network Packet Capture platform … technologies and platforms across the UK Network Perimeter working with the best standard technologies. The SOC Analyst reports to the SOC Manager. The SOC Analyst conducts a range of analysis and assists the incident response team with investigations that need to be escalated to an Embedded member of staff. The SOC Analyst key responsibilities are: Effective Tier 1 to More ❯
Helm charts, and pod definition. • Kubernetes Administration: Manage and configure Kubernetes clusters for high availability, scalability, and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective rootcauseanalysis and debugging techniques. • GPU Configuration and Support: Configure and optimize GPU resources using CUDA or other technologies for compute-intensive workloads. • Automated Testing and Deployment … Strong analytical and problem-solving mindset • Excellent verbal and written communication skills • Adaptability and a drive for continuous learning and improvement Bonus If You Have: • Understanding of RF signal analysis or satellite communications systems Engineer smarter, Build bolder. Apply now and learn more about our extensive benefits and customizable compensation packages More ❯
Burke, Virginia, United States Hybrid / WFH Options
ALTA IT Services
and ensure accurate data visualization. • Design and maintain dashboards for network health, application performance, and security insights. • Integrate LiveAction solutions with SIEMs, orchestration tools, and other monitoring platforms. • Conduct root-causeanalysis of performance bottlenecks, latency, and packet loss across large-scale networks. • Create custom reports and alerts for performance metrics, compliance monitoring, and capacity planning. • Assist … with an understanding of cybersecurity frameworks. Preferred Qualifications: • LiveAction product certifications (e.g., LiveAction Certified Professional). • Experience integrating LiveAction with Splunk, Elastic, or other SIEMs. • Familiarity with packet-level analysis tools like Wireshark or Riverbed. • Experience with scripting and automation (Python, Ansible, PowerShell). • Prior experience supporting agencies such as DoD, DHS, VA, or IC. More ❯
Loughborough, Leicestershire, East Midlands, United Kingdom Hybrid / WFH Options
Oscar Associates (UK) Limited
migration, working with project teams and senior stakeholders. Drive process improvement, automation, and security best practice using PowerShell, deployment tools (SCCM/Intune/PDQ), and cloud technologies. Conduct rootcauseanalysis for critical incidents and coordinate long-term resolutions. Develop and maintain comprehensive technical documentation, knowledge base articles, and asset records. Build strong relationships with senior … and best practice. Participate in an on-call rota (with enhanced allowance) to ensure 24/7 service continuity. Ideal Candidate: Significant 2nd/3rd line support or application analysis experience in a complex/regulated environment. Deep knowledge of Microsoft technologies: Windows Server, Active Directory, PowerShell, Exchange, SharePoint, Teams, Azure. Track record delivering major upgrades, migrations, or automation More ❯
test plans (ITPs), ensuring proper verification and validation of hardware components and systems against design requirements and customer specifications. Develop quality metrics and dashboards to monitor process health, drive rootcauseanalysis, and support strategic quality planning. Collaborate with Engineering and Production to evaluate design feasibility, manufacturability, and adherence to quality standards during reviews and milestone phases. … Skills: Demonstrated experience with ISO 9001 QMS, along with working knowledge of additional standards such as ISO 45001, 14001, 27001, 31000, and ITIL. Strong analytical skills with experience in rootcauseanalysis, corrective actions, and statistical process control (SPC) Proven leadership in guiding teams to adhere to internal procedures and government/industry regulations. Experience writing, revising More ❯