and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and rootcause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and rootcause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
the phone, and by email. Ability to work in a hands-on team environment. Familiarity with the Agile development process Experience with software programming, can diagnose issues and perform rootcause analysis. Knowledge of at least one programming language, C/C++, Java, Python, etc Experience with Ansible scripting, proficient at using Ansible to automate processes Experience with More ❯
the phone, and by email. Ability to work in a hands-on team environment. Familiarity with the Agile development process Experience with software programming, can diagnose issues and perform rootcause analysis. Knowledge of at least one programming language, C/C++, Java, Python, etc Experience with Ansible scripting, proficient at using Ansible to automate processes Experience with More ❯
infrastructure in a LiveOps environment. Scoping, design and implementation of cloud architecture. Implement, maintain and consolidate cloud testing and automation tools. Identifying and deploying cybersecurity measures. Incident management and rootcause analysis. Working with our code and build teams to ensure a streamlined workflow. Minimum Requirements: Prior experience working in DevOps at a Senior level. Strong experience with More ❯
Sheffield, South Yorkshire, England, United Kingdom
Massenhove Recruitment Limited
that cannot be addressed by First or Second Line support. You will play a key role in maintaining and improving the organisation’s IT infrastructure, performing deep-dive diagnostics, rootcauseanalysis, and implementing long-term solutions. In addition to supporting escalated incidents, you will contribute to system design, strategic projects, and continuous service improvement. The ideal … Line Support Engineer - Key duties Expert-Level Support & Issue Resolution Take ownership of high-level, complex incidents and problems escalated from Second Line Support Perform in-depth diagnostics and rootcauseanalysis across infrastructure, systems, and applications Develop and implement long-term fixes and preventative measures to reduce repeat incidents Infrastructure Management & Improvement Maintain, monitor, and optimise … support role Strong expertise in server administration, networking, virtualisation, and storage solutions Solid understanding of IT security principles and best practices Ability to carry out detailed troubleshooting and perform rootcauseanalysis Experience managing or contributing to technical projects and service improvements Proficiency in tools such as Active Directory, Group Policy, Office 365, Exchange, and Windows Server More ❯
and operational health across critical payment systems. You will collaborate with our application support DevOps Labs to utilize observability platforms such as Splunk and Dynatrace for monitoring, incident response, rootcauseanalysis, and system performance optimization. About us We are an innovative bank committed to shaping finance as a force for good, empowering our people to innovate … SLAs and performance indicators. Knowledge of configuring alerts, reducing noise, and automating alert routing in collaboration with Technical Recovery Managers. Proficiency in Splunk SPL, report scheduling, app management, and rootcause analysis. Ability to conduct proactive system performance analysis and generate reports for stakeholders. Development of scripts for automation and integration of observability tools with CI/ More ❯
partnership with our application support DevOps Labs to deliver deep insights using observability platforms such as Splunk and Dynatrace. Your responsibilities will involve providing direct support for incident response, rootcauseanalysis, performance optimization, and system performance improvement! About us If you think all banks are the same, you'd be wrong. We're an innovative, fast … develop automated alert routing. Hold advanced knowledge of Splunk SPL, dashboard development, report scheduling and app management. Proficient in crafting service-level dashboards, setting up custom metrics, and conducting rootcauseanalysis using advanced technology. Analysis & Reporting: Conduct proactive analysis on system performance, availability and failures. Generate regular reports for senior stakeholders, summarising trends, anomalies More ❯
systems in near real time using automated tools and perform direct system checks Respond to events and alerts from monitoring systems and external vendors like AT&T Conduct early analysis of incidents and contact points of contact for event confirmation Troubleshoot affected critical infrastructure including network devices and system interfaces Escalate incidents by contacting DOT Tier III teams and … support tickets with external vendors and service providers Generate incident management correspondence including notifications and status reports Initiate Critical Incident Management processes and anchor Incident Response Bridge calls Document RootCauseAnalysis for incidents and maintain knowledge management repositories Provide on-site troubleshooting support during off-hours including data center operations Maintain point of contact information and … Current Public Trust clearance (DOJ Public Trust preferred) Nice-to-Haves Familiarity with ServiceNow ticketing system Experience with BMC Remedy Knowledge of network troubleshooting tools and techniques Experience with rootcauseanalysis documentation Background in government or federal agency environments ITIL certification or similar incident management training Education: Bachelors degree in Computer Science, Information Technology, System Administration More ❯
partnership with our application support DevOps Labs to deliver deep insights using observability platforms such as Splunk and Dynatrace. Your responsibilities will involve providing direct support for incident response, rootcauseanalysis, performance optimization, and system performance improvement! About us If you think all banks are the same, you'd be wrong. We're an innovative, fast … develop automated alert routing. Hold advanced knowledge of Splunk SPL, dashboard development, report scheduling and app management. Proficient in crafting service-level dashboards, setting up custom metrics, and conducting rootcauseanalysis using advanced technology. Analysis & Reporting: Conduct proactive analysis on system performance, availability and failures. Generate regular reports for senior stakeholders, summarising trends, anomalies More ❯
Problem Management Support and facilitate cross departmental Incident Retrospectives and documenting the outcome in the form of an Incident Report Analyse incident trends and log problem records to drive rootcauseanalysis and workarounds Tracking and reporting on Problem Management actions and activities and presenting trends and service risks at relevant forums Continuous Service Improvement (CSI) Assist … DevOps methodologies (desirable) Able to organise and prioritise responsibilities across different workstreams at the same time Experience or good understanding of Problem Management practices and facilitating reviews to drive rootcauseanalysis and improvements Experience of analysing problem trends and driving through suggested solutions An interest in media, technology and service management (desirable) Exposure to or hands More ❯
our client excellence and regulatory teams to interpret new requirements and how they can be accommodated into their Target Operating Models Leveraging analytics tools & core methods to assist with rootcauseanalysis and identify opportunities for improvement Partnering with our technology engineers & data strategists on defining architecture strategies and controls Collaborating across teams to take designs into … the overall design of our strategy, processes and technology architecture and sharing this knowledge across Investment Operations Proven track record driving and successfully delivering complex, cross-function projects Strong root-causeanalysis and creative problem-solving skills, coupled with the self-motivation and leadership to drive projects to completion Ability to consistently produce high quality project documentation … moving targets Demonstrated success collaborating, ability to influence without authority 7+ years experience PREFERRED QUALIFICATIONS Functional Architecture Design Business Process Re-Engineering Experience across a project lifecycle from scoping & analysis to delivery Degree in Operational Strategy Technical knowledge of financial service industry utilities At Wellington Management, our approach to compensation is designed to help us attract, inspire and retain More ❯
Roseville, California, United States Hybrid / WFH Options
LHH Recruitment Solutions
in system-level validation, test plan development, and hands-on debugging. Key Responsibilities Lead product validation activities, including test plan creation, execution, and failure analysis. Conduct performance characterization and root-causeanalysis of anomalies in ASIC and storage systems. Develop and maintain detailed test documentation, including test plans, matrices, and reports for new features and regression testing. …/programming (Python preferred, Bash, C/C++ also valuable). Solid understanding of system architecture, power management flows, and platform integration. Experience with debug tools and methodologies for root-cause analysis. Comfortable operating in Linux-based test environments. Preferred Qualifications Familiarity with security protocols such as TLS 1.3, IPsec, Secure Boot, Secure Firmware Download. Experience with Wireshark … or other traffic analysis tools. Background in ASIC validation or firmware/hardware integration. Strong automation skills and experience integrating tests into CI/CD pipelines. Prior experience working in Agile/Scrum teams. Why Work with Us? Opportunity to join a respected industry leader with a track record of innovation. Collaborative environment that values problem-solving and continuous More ❯
JOB As a Reliability Engineer at Anduril, you will work within the organization to support product development, define and execute processes for continuous improvement and risk mitigation, and lead rootcause & corrective action efforts throughout the new product development life cycle to ensure a smooth transition to production. You will support field performance monitoring and track trends/… lessons learned to recommend improvements to upstream teams. The ideal candidate for this role has comprehensive knowledge of design, development, test, analysis, manufacturing, operations, asset tracking, maintenance, and continuous improvement. If you are someone who has hands-on experience throughout the entire new product development life cycle from concept to customer delivery, loves to build world-class Reliability processes … redundancy, and accuracy against requirements and concept of operations. Partner with Design & Development Engineers to document requirements and proactively identify risk through tools such as Failure Modes and Effects Analysis, Physics of Failure Methodologies, and Analysis + Test Data Reviews in R&D. Support Development Milestone Reviews through identification of appropriate entry and exit criteria, and work with More ❯
applications of technical principles, theories, and concepts in the field and apply general knowledge of other related disciplines. Support technical solutions to a wide range of difficult problems. Provide analysis for approach to solutions. Work closely with other engineering teams. Responsibilities and Duties Provide guidance and expertise on Azure cloud migrations Design and implement full-stack, in-house solutions … commercial cloud environment Experience with multiple operating systems, including UNIX, Linux, and Windows Experience in troubleshooting very complex distributed environments, including following stack traces back to code to determine rootcauseanalysis Experience with running Internet facing or auto-deployed environments while adhering to service level agreements Agile Development, Cloud systems, or ITIL related Certifications Experience within More ❯
and provide day to day support to enable timely payments. This position will lead efforts to dive deep into vendor payment data, drive process improvements, standardize processes, and eliminate root-cause defects within existing systems. By working together on behalf of our customers, we are building the future one innovative product, service, and idea at a time. Are … intake, escalate, and triage customer payments (payables and billings), contacts, and issues and follow-through to resolution, including executive-level communication. - Recognize escalation trends and identify underlying defects and root causes. - Partner with and influence upstream and downstream teams - Meet or exceed monthly collections goals - Leverage data from numerous databases and internal tools (using Excel, SQL, or other data … manage past due balances - Create statement of accounts, communication plans, SOPs for specific customer accounts - Report collections related key metrics on daily, weekly and monthly basis - Create and share rootcauseanalysis & write documents for dive deep sessions - Escalate to senior leadership key issues that could impact collections - Deliver on multiple data requests and participate in process More ❯
Leicester, Leicestershire, England, United Kingdom Hybrid / WFH Options
Oliver James
satisfaction.Key Responsibilities: Lead and manage the Application Support team in resolving incidents, service requests, and change requests. Serve as an escalation point for complex technical issues requiring in-depth analysis and resolution. Perform hands-on troubleshooting, rootcauseanalysis, and issue resolution using SQL and system diagnostics tools. Design and execute test cases for application upgrades More ❯
Helm charts, and pod definition. • Kubernetes Administration: Manage and configure Kubernetes clusters for high availability, scalability, and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective rootcauseanalysis and debugging techniques. • GPU Configuration and Support: Configure and optimize GPU resources using CUDA or other technologies for compute-intensive workloads. • Automated Testing and Deployment … Strong analytical and problem-solving mindset • Excellent verbal and written communication skills • Adaptability and a drive for continuous learning and improvement Bonus If You Have: • Understanding of RF signal analysis or satellite communications systems Engineer smarter, Build bolder. Apply now and learn more about our extensive benefits and customizable compensation packages More ❯
Burke, Virginia, United States Hybrid / WFH Options
ALTA IT Services
and ensure accurate data visualization. • Design and maintain dashboards for network health, application performance, and security insights. • Integrate LiveAction solutions with SIEMs, orchestration tools, and other monitoring platforms. • Conduct root-causeanalysis of performance bottlenecks, latency, and packet loss across large-scale networks. • Create custom reports and alerts for performance metrics, compliance monitoring, and capacity planning. • Assist … with an understanding of cybersecurity frameworks. Preferred Qualifications: • LiveAction product certifications (e.g., LiveAction Certified Professional). • Experience integrating LiveAction with Splunk, Elastic, or other SIEMs. • Familiarity with packet-level analysis tools like Wireshark or Riverbed. • Experience with scripting and automation (Python, Ansible, PowerShell). • Prior experience supporting agencies such as DoD, DHS, VA, or IC. More ❯
Loughborough, Leicestershire, East Midlands, United Kingdom Hybrid / WFH Options
Oscar Associates (UK) Limited
migration, working with project teams and senior stakeholders. Drive process improvement, automation, and security best practice using PowerShell, deployment tools (SCCM/Intune/PDQ), and cloud technologies. Conduct rootcauseanalysis for critical incidents and coordinate long-term resolutions. Develop and maintain comprehensive technical documentation, knowledge base articles, and asset records. Build strong relationships with senior … and best practice. Participate in an on-call rota (with enhanced allowance) to ensure 24/7 service continuity. Ideal Candidate: Significant 2nd/3rd line support or application analysis experience in a complex/regulated environment. Deep knowledge of Microsoft technologies: Windows Server, Active Directory, PowerShell, Exchange, SharePoint, Teams, Azure. Track record delivering major upgrades, migrations, or automation More ❯
test plans (ITPs), ensuring proper verification and validation of hardware components and systems against design requirements and customer specifications. Develop quality metrics and dashboards to monitor process health, drive rootcauseanalysis, and support strategic quality planning. Collaborate with Engineering and Production to evaluate design feasibility, manufacturability, and adherence to quality standards during reviews and milestone phases. … Skills: Demonstrated experience with ISO 9001 QMS, along with working knowledge of additional standards such as ISO 45001, 14001, 27001, 31000, and ITIL. Strong analytical skills with experience in rootcauseanalysis, corrective actions, and statistical process control (SPC) Proven leadership in guiding teams to adhere to internal procedures and government/industry regulations. Experience writing, revising More ❯
project progress, identifying and resolving issues and risks to ensure on-time and high-quality delivery. Responsible for prioritization of product backlog items based on input from stakeholders, market analysis, and user feedback Provide clarity and guidance to the development and run teams on product requirements, acceptance criteria, service level objectives and desired outcomes. Drive a culture of continuous … improvement by implementing best practices, fostering innovation, and promoting experimentation within the value stream. Lead and facilitate regular retrospectives, rootcauseanalysis, and problem-solving sessions to identify improvement opportunities and drive actionable outcomes. Build and maintain strong relationships with key senior RGM & TPM stakeholders, including business leaders, customers, and partners. Communicate value stream performance, challenges, and More ❯
optimising system performance based on key metrics Deploy updates, fixes, and perform regularmaintenance Provide second-line technical support Build tools and automation to reduce errors andenhance customer experience Perform rootcauseanalysis for production errorsand implement long-term solutions Troubleshoot and resolve technical issuesefficiently Automate tasks, including visualisation andreporting processes Design and implement troubleshooting andmaintenance procedures Collaborate More ❯
optimising system performance based on key metrics Deploy updates, fixes, and perform regularmaintenance Provide second-line technical support Build tools and automation to reduce errors andenhance customer experience Perform rootcauseanalysis for production errorsand implement long-term solutions Troubleshoot and resolve technical issuesefficiently Automate tasks, including visualisation andreporting processes Design and implement troubleshooting andmaintenance procedures Collaborate More ❯
talented team that enjoys exploring and designing new technologies. Aufgaben Lead problem management efforts, including external moderation and team improvements Handle P1/P2 incident management and conduct 8D rootcauseanalysis and lessons learned Oversee a specific cost center regarding forecasting, purchase orders, and bills Ensure proper cost center separation and adherence to SLAs Manage upgrades More ❯