back-office teams. You will act as the key escalation point for complex issues, working closely with development, infrastructure, and trading support groups to resolve problems quickly and effectively. Proactivemonitoring, incident management, and continuous improvement will sit at the heart of your role. Key Responsibilities Manage and mentor a team of Desktop Support Analysts, driving best practice … endpoint infrastructure challenges Collaborate with engineering teams on automation, scripting, and tooling improvements Oversee vendor relationships and guide technology strategy to align with business needs Drive root cause analysis, proactivemonitoring, and resilience planning to minimise disruption What You’ll Bring Proven experience in leading or mentoring technical support teams in high-pressure environments Expert knowledge of Windows More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
maintaining alignment with evolving industry standards and best practices. Design and build secure user devices (PC/laptop builds, anti-malware, mirroring, device integrity). Lead the adoption of proactivemonitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, root cause analysis, and continuous … improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and support business continuity. Consistently represent calm professionalism to clients while inspiring high standards within the team, clearly communicating technical issues and resolutions to both technical and non-technical audiences. Must-Have Experience Proven track record in … to-end project lifecycle ownership: from scoping through to rollout. Deep infrastructure expertise including device imaging, endpoint security, and network services. Strong hands-on experience with RMM platforms and proactivemonitoring tools. Ticketing systems and ITIL-style service delivery know-how. Backups, disaster recovery, and continuity planning fit for SME/school environments. Proven ability to install, configure More ❯
maintaining alignment with evolving industry standards and best practices. Design and build secure user devices (PC/laptop builds, anti-malware, mirroring, device integrity). Lead the adoption of proactivemonitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, root cause analysis, and continuous … improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and support business continuity. Consistently represent calm professionalism to clients while inspiring high standards within the team, clearly communicating technical issues and resolutions to both technical and non-technical audiences. Must-Have Experience Proven track record in … to-end project lifecycle ownership: from scoping through to rollout. Deep infrastructure expertise including device imaging, endpoint security, and network services. Strong hands-on experience with RMM platforms and proactivemonitoring tools. Ticketing systems and ITIL-style service delivery know-how. Backups, disaster recovery, and continuity planning fit for SME/school environments. Proven ability to install, configure More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
maintaining alignment with evolving industry standards and best practices. Design and build secure user devices (PC/laptop builds, anti-malware, mirroring, device integrity). Lead the adoption of proactivemonitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, root cause analysis, and continuous … improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and support business continuity. Consistently represent calm professionalism to clients while inspiring high standards within the team, clearly communicating technical issues and resolutions to both technical and non-technical audiences. Must-Have Experience Proven track record in … to-end project lifecycle ownership: from scoping through to rollout. Deep infrastructure expertise including device imaging, endpoint security, and network services. Strong hands-on experience with RMM platforms and proactivemonitoring tools. Ticketing systems and ITIL-style service delivery know-how. Backups, disaster recovery, and continuity planning fit for SME/school environments. Proven ability to install, configure More ❯
project work, with some limited BAU work as well. They are looking for an Infrastructure Engineer with an interest in using automation tools to streamline infrastructure deployment, management and monitoring, combined with a background in administering and building new IT infrastructures and improvements in IT systems from previous projects is going to be vital for this role. Any background … Datacentre relocation or migration would be useful as well in this role Role Responsibilities: Infrastructure: Maintain and enhance IT infrastructure, including VMware ESXi, Microsoft Windows Server environments, and Network Monitoring and networking components. There is some minimal work supporting AWS/Linux server infrastructure as well. Ensure system reliability, scalability, and performance through proactivemonitoring and automation. … streamline infrastructure deployment, management, and monitoring. Implement and optimize automation tools such as Azure DevOps (or other CI/CD pipelines), Terraform, Node-Red, and Packer. Deploy and manage monitoring tools (Zabbix, SolarWinds SentryOne, and other network/database monitoring solutions). Ensure secure cloud infrastructure management across Azure and AWS environments. Experience & Skills: It will be useful More ❯
diverse, modern, and complex technology landscape, ensuring seamless support and smooth operations. You'll take charge of incident triage and resolution, lead system upgrades, and keep performance optimized through proactivemonitoring and alerting. Beyond day-to-day support, you'll drive continuous improvements in processes and tools, collaborating with stakeholders to elevate operational excellence and keep our applications … drive business value. Collaborate with engineering teams to troubleshoot and resolve application and infrastructure-related issues. Oversee major incidents and work alongside developers and operations teams to restore service. Proactive Application Support & Maintenance: Ensure the ongoing stability and health of all applications through regular maintenance tasks, including patching and updates. Plan and execute minor upgrades and configuration changes with … minimal impact to the production environment. Maintain and optimise system configurations across legacy and modern applications to ensure their continued performance and reliability. System Monitoring & Performance: Maintain and improve logging, monitoring, and alerting systems. Define service-level objectives and indicators for business applications. Continuously review performance metrics against SLO/SLIs and proactively address performance bottlenecks or underperforming More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
The Channel Recruiter
requests, and ensuring customers receive an excellent level of service in line with SLAs. This is a varied role that covers End User Support, Infrastructure, Networking, Cloud Services, and proactive monitoring. While you don’t need to be an expert in every area, you should be eager to learn, adaptable, and passionate about delivering first-class technical support. Established … bring that positive impact home. What You’ll Do: L2 Service Desk Analyst Investigate and resolve incidents and service requests in line with ITSM processes. Respond to alerts from monitoring tools (e.g., Logic Monitor) and address underlying issues. Implement technical changes, preparing and presenting to CAB when required. Support patch management, event management, and proactivemonitoring activities. More ❯
Shefford, Bedfordshire, South East, United Kingdom
Intercity Technology Limited
work for this role are Monday - Sunday - 4 on 4 off - 7pm - 7am. Key Responsibilities as a Cloud Operations Engineer: Maintain and troubleshoot Azure and hybrid cloud environments. Perform proactivemonitoring, incident response, and root cause analysis of mission-critical systems. Configure, optimise, and secure servers, virtual machines, networking, and storage solutions. Create and maintain scripts (e.g., PowerShell … Experience and Knowledge: Knowledge of Azure Monitor and Microsoft 365 administration. Understanding of Windows Server, Active Directory, DNS, DHCP, and group policies. Experience using Tanium, Okta and Netskope. Experience monitoring and maintaining mission-critical infrastructures. Basic scripting knowledge (PowerShell). Excellent communication, troubleshooting, and documentation skills. Strong expertise in Azure administration Prior experience in a 3rd line technical infrastructure More ❯
Jam Management Consultancy Limited T/A JAM RECRUITMENT
on support for operational issues and ensure platform reliability. Continuous Improvement & Automation Automate workflows to reduce manual intervention and increase efficiency. Contribute to infrastructure-as-code and DevOps pipelines. Monitoring, Observability & Security Develop proactivemonitoring strategies and support security best practices. Participate in incident response, threat simulation, and operational runbooks. Troubleshooting & Collaboration Provide 3rd line support, collaborating … operating systems, cloud technologies, and hardware. Strong scripting/automation skills (PowerShell, Python). Understanding of containerisation/orchestration (Docker, Kubernetes). Excellent troubleshooting and communication skills. Security and monitoring knowledge desirable. More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Nextech Group Limited
Microsoft 365, networking, and on-prem virtualisation, collaborating with experienced 2nd & 3rd line engineers and security specialists. Key responsibilities * Manage and support servers (Hyper-V virtual and physical), ensuring proactivemonitoring, patching, and maintenance * Design, manage, and monitor Azure environments and resources * Support Office 365, OneDrive, SharePoint Online, Teams, Power Automate, Power BI, Intune, and MFA * Administer SQL … switches, routers, firewalls, load balancers, VPNs) * Handle certificate management (SSL), DNS, DHCP, and authentication (ADFS, SAML, WAP) * Keep CMDB and capacity planning records up to date * Collaborate on security monitoring and compliance activities, including audit support * Write and maintain PowerShell/Terraform scripts to automate tasks What you'll bring * Strong Microsoft Windows Server, Azure, and O365 administration experience … reliability in delivering results Desirable experience * IT certifications (Azure, MCP/MCSA/MCSE, CCNA) * SAN administration (Compellent, NAS/iSCSI), Hyper-V clusters, Microsoft Failover clustering * Backup management, monitoring systems (PRTG), build image maintenance, and infrastructure decommissioning * PowerShell scripting for automation and infrastructure management * ITIL foundation knowledge Why this role? * Competitive salary + discretionary bonus * 25 days holiday More ❯
including high-availability and disaster recovery solutions. Validate and advise customers on the suitability of their platforms for the installation of the Clinical Systems software solutions. Perform daily system monitoring checks to ensure integrity and availability of server resources, systems and key process and respond to any alerts generated by the proactivemonitoring and alerting system. Participate … in weekly support rota and Support out-of-hours implementation and service delivery activities as required. Conduct Backup monitoring and in line with supported backup strategies. Support the planning and execution of application and platform Service Pack upgrades as required. Troubleshoot and resolve issues within defined Service Level Agreements, which includes responding to, identify and resolve application and platform … problems and diagnose hardware or software faults in order to maintain supported environments. Support performance monitoring and tuning (Index maintenance and management, housekeeping, memory, CPU, Disk I/O etc.) as required. Assume responsibility for ownership of customer or internally reported issues through to conclusion where such ownership has been delegated. Adhere to change management procedures and ensure correct More ❯
Shefford, Bedfordshire, South East, United Kingdom
Intercity Technology Limited
manage governance, compliance, and security policies across cloud estates. Execute backup, disaster recovery, and business continuity procedures. Systems Management & Optimisation: Maintain and troubleshoot Azure and hybrid cloud environments. Perform proactivemonitoring, incident response, and root cause analysis of mission-critical systems. Configure, optimise, and secure servers, virtual machines, networking, and storage solutions. Create and maintain scripts (e.g., PowerShell … Windows Server environments and Active Directory (on-prem & Azure AD). Networking fundamentals (TCP/IP, DNS, VPN, firewalls, routing). Experience using Tanium, Okta and Netskope. Experience with monitoring tools (Azure Monitor, Log Analytics, Application Insights). Strong PowerShell scripting and automation skills. Proven ability to troubleshoot complex issues in multi-tenant, hybrid environments. Excellent communication, documentation, and More ❯
YAML pipelines, including Salesforce-specific pipelines. Build and maintain Infrastructure as Code (IaC) using Terraform and Ansible. Design highly reliable, scalable, and secure infrastructure supporting performance-critical workloads. Build proactivemonitoring, observability, and alerting with Prometheus, Grafana, Azure Monitor, DataDog, and Dynatrace. Troubleshoot complex system issues spanning applications, networks, and infrastructure. Define platform SLAs, SLOs, and governance standards … Code with Terraform and Ansible, along with scripting in PowerShell, Python, or Bash Experience implementing GitOps workflows and managing platform SLAs, SLOs, and governance standards Familiarity with observability and monitoring tools including Prometheus, Grafana, Azure Monitor, DataDog, or Dynatrace Preferred experience supporting Salesforce DevOps pipelines and working with Java, .NET, or Node.js application environments Exposure to AI/ML … Certified Kubernetes Administrator (CKA), Microsoft AZ-400, or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement We make technology work so people can do great things. CDW is a leading multi-brand provider of information More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Hays
resilience, cost-effectiveness, and future scalability. Oversee the integration of VOIP with core network, data centre, and security infrastructures. Ensure high availability and optimal performance of VOIP services through proactivemonitoring, capacity planning, and fault resolution. Develop and maintain disaster recovery strategies for voice systems, minimising downtime and ensuring business continuity. Produce and maintain high- and low-level … voice systems to modern SIP/Cloud solutions. Enterprise networking knowledge (Cisco, Aruba, Fortinet) and experience integrating voice across LAN/WAN and firewalls. Skilled in network and voice monitoring tools (SolarWinds, PRTG, Prime Infrastructure). Strong stakeholder management and vendor relationship skills. Ability to lead large-scale technical projects with minimal supervision. Qualifications & Certifications (Desirable): Cisco Certified Network More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
Stay up to date with emerging technologies and industry trends, sharing knowledge across company communities to embed SRE best practice. Drive continual improvement by automating manual processes and optimising monitoring systems to achieve full estate coverage. Lead initiatives to improve availability, performance, and scalability through proactivemonitoring, capacity planning, and ongoing maintenance. Collaborate with development squads to … embed monitoring, reliability, and scalability best practices within the development lifecycle. Represent the SRE team in stakeholder engagements, providing progress updates, managing expectations, and addressing concerns. Operate as a primary contact for pressing issues, employing technical skills to solve complex problems rapidly in coordination with other teams. Participate in out-of-hours on-call or standby duties when required. … Qualifications What we’d love you to bring: Deep experience of AWS (particularly EC2, EKS, Lambda, S3, IAM, etc) Monitoring/alerting tools (for example we use Grafana, Prometheus, Loki, CloudWatch and Dynatrace) SME on monitoring best practices for a variety of different platforms and technologies Docker and Kubernetes Git/Gitlab Jenkins/CI/CD/ More ❯
Gateshead, Tyne and Wear, England, United Kingdom Hybrid / WFH Options
Simpson Judge Ltd
migrations, hybrid deployments and installations* Troubleshooting across desktop, server, cloud and virtual environments (Hyper-V, VMware)* Acting as a trusted advisor to clients, translating tech into business outcomes* Proactively monitoring systems, managing SLAs, and keeping documentation sharp* Collaborating with vendors, internal teams and stakeholders to resolve issues* Getting involved in maintenance, incident response and occasional 24/7 supportMust More ❯
Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g. … practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g. … practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
customer relationships that promote retention and loyalty Strategic guidance is delivered to the customer to achieve increased adoption and value leveraging FI AI driven solutions and methodology Continuous and proactivemonitoring of customer initiatives & challenges Proactive, strategic and expanding utilization to support account growth FI solution use is expanded and adopted within and across customer functional teams … support strategy Ability to work in a dynamic environment and balance multiple responsibilities Outstanding communication skills to include listening, verbal, written and presentation skills Excellent organizational & time management skills Proactive and driven to provide optimal results with high personal drive, integrity, and a positive attitude Comfortable working in a team environment with Product, Sales, IT, and all levels of More ❯
intervals to share knowledge with others. Create and maintain site documentation and technical documentation. Develop knowledge in a specific technology area to become a trusted SME in that field. Proactivemonitoring of IT systems and preventative measures taken to reduce system downtime. Write post-incident review documents and implement recommendation plans to reduce or prevent further incidents. Test … applications, network, and server performance; provide performance statistics and reports; develop strategies for maintaining and improving core infrastructure. Adhere to all IT security policies and assist in enforcing and monitoring IT security policies; spot potential vulnerabilities and suggest resolutions. Evaluate, design, maintain infrastructure systems, including LANs, WANs, Internet, intranet, security, and incident and change management systems. Understand business requirements More ❯
Bromsgrove, Worcestershire, England, United Kingdom Hybrid / WFH Options
Klipboard
service requests raised through our Service Desk, working with technologies such as Active Directory, Azure Active Directory, Group Policy, Exchange Online, Windows Server, and Remote Desktop Services. Respond to monitoring alerts for Microsoft Azure IaaS/PaaS/SaaS services, network connectivity, and Microsoft 365 services to proactively address potential issues. Investigate and resolve security alerts for Microsoft … Review, install, and test security and application updates, leveraging automation to maintain and improve customer environments. Ensure the operational integrity, performance, and security of customer cloud-based services through proactivemonitoring and expertise. Collaborate with customers and internal teams to implement migrations and deliver solutions tailored to customer requirements. Maintain regular communication with customers via Service Desk tools More ❯