incident detection, notifications, triage, and resolution. Key Responsibilities: Pipeline Approach: Adopt a pipeline approach to enable observability of services deployed across multiple environments, balancing monitoring, logging, and tracing based on service classification. Intelligent Alerts: Design and build intelligent alerts using pipelines, onboarding automated runbooks triggered with clear audit/… logs in service management tools like Jira Service Management. Dashboards: Create and maintain dashboards for proactivemonitoring of services to help teams resolve incidents quickly. Monitoring Capability: Continuously improve monitoring capabilities to identify key alerts and thresholds for early warnings before services fail. Automation: Enable intelligent … and commercial observability tools (e.g., Prometheus, Grafana, NewRelic). Expertise in cloud environments (e.g., AWS, Azure) and infrastructure as code (IaC) tools like Terraform. Monitoring and Observability: Experience in creating and maintaining dashboards for proactivemonitoring of services. Ability to design and build intelligent alerts using pipelines More ❯
Borehamwood, Hertfordshire, South East, United Kingdom Hybrid / WFH Options
Interaction - Letchworth
and provide technical guidance. You will act as the ultimate point of escalation for complex incidents, define and maintain the technical BAU stack, implement proactivemonitoring, and ensure client security. You will also be responsible for providing technical support for our Technology's Managed Service customers with the … aim of fixing all incidents passed to them from 1st and 2nd line team. Additionally, you will perform a variety of proactive tasks which will help maximise customers' up-time, perform root-cause analysis and prevent reoccurring issues. This may include resolving more complex monitoring alerts, deploying the … Desk team and provide technical guidance. · Act as the ultimate point of escalation for complex incidents. · Define and maintain the technical BAU stack. · Implement proactivemonitoring and ensure client security. · Deal with incoming incidents in a professional, courteous manner over the phone and via e-mail. · Take ownership More ❯
for ensuring the platform is maintained in line with provider recommendations to ensure a highly resilient service is available to our community. Resolving incidents, proactivemonitoring and associated actions to deliver such a service. It also ensures that policies are adhered to and that the necessary support processes … tuning methods. What you'll be doing: Provide expert-level support to customers using the SJP Data Platform technologies Incident resolution for the platform Proactivemonitoring and maintenance to ensure high level of operational resilience. Application of SJP's IT policies and IT controls for the platform technologies … support requirements Who we're looking for: A methodical and analytical mindset, with a detail-oriented approach. Good communication and collaboration skills with a proactive approach to problem solving. The ideal candidate will also have a growth mindset, eager to stay updated on industry trends and share knowledge with More ❯
on-premises and cloud environments. To be successful in this role, it's essential that you're able to demonstrate: Experience with performance tuning, proactivemonitoring of cybersecurity solutions to identify and mitigate potential threats before they impact the organisation. Strong computer literacy, experience in using Microsoft Office … Product Suite Sentinel or Splunk Desirable skills for this role are: Experience of Agile work practices and tooling (Service Now, JIRA). Performance tuning, proactivemonitoring of solutions. Knowledge of systems management procedures in a large commercial, mission-critical environment. About us We're one of the largest More ❯
Bradford, Yorkshire, United Kingdom Hybrid / WFH Options
Freemans Grattan Holdings (fgh)
E-commerce DevOps Engineer role is responsible for managing and optimising software deployment processes for E-Commerce B2C websites and shopping Apps and proactively monitoring and reporting E-Commerce application and infrastructure performance. The role involves: Working collaboratively with software architects, software engineers and network, infrastructure and operations teams … to ensure smooth deployment, scalability and security of E-Commerce B2C websites and shopping apps using CI/CD pipelines and performance monitoring tools. Monitoring E-Commerce system performance, optimizing caching, ensuring uptime and responding to incidents. WHAT YOU'LL BE DOING Further developing and managing CI/… CD pipelines to automate deployment and reduce release cycle times. Ensuring website availability, performance and security through proactivemonitoring and incident response and implementing website performance monitoring and optimisation strategies to improve page load times, identify, diagnose and resolve issues and enhance customer experience. Enhancing system observability More ❯
in new projects. They are looking for an Infrastructure Engineering mindset with an interest in using automation tools to streamline infrastructure deployment, management and monitoring, combined with a background in administering and building new IT infrastructures and improvements in IT systems from previous projects is going to be vital … for this role. Role Responsibilities: Infrastructure & Automation: Maintain and enhance IT infrastructure, including VMware ESXi, Linux, Microsoft Windows Server environments, and Netwrok Monitoring and networking components. Learn how to automate configuration management, infrastructure provisioning, and application deployment. Ensure system reliability, scalability, and performance through proactivemonitoring and … monitoring. Implement and optimize automation tools such as Azure DevOps (or other CI/CD pipelines), Terraform, Node-Red, and Packer. Deploy and manage monitoring tools (Zabbix, SolarWinds SentryOne, and other network/database monitoring solutions). Ensure secure cloud infrastructure management across Azure and AWS environments. Operational More ❯
Chorley, Lancashire, North West, United Kingdom Hybrid / WFH Options
Nextech Group Limited
Follow ITIL-aligned processes for escalation and management of incidents. Participate in an On-Call Rota for out-of-hours incident response. System Maintenance & Monitoring Perform regular system health checks on client infrastructure, including servers, networks, and backups. Implement preventive maintenance plans and updates to minimise downtime. Proactively monitor More ❯
with a key focus on repeat problem analysis and prevention. You will maintain and develop operational, configuration and other procedures as well as providing proactivemonitoring and alerting of key systems, ensuring any potential or actual core system availability issues are identified and rectified quickly whilst performing regular … security monitoring and daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems, and key processes. You will actively participate in developing, maintaining, and testing Disaster Recovery whilst working on operational projects providing extended technical support to client server deployments, rebuilds, and upgrades and More ❯
What you’ll be doing: Public Cloud Infrastructure Management which involves provisioning, configuration and maintaining various Cloud resources to ensure scalability, reliability and security. Monitoring and Performance Optimisation by implementing monitoring solutions to track performance and identify areas for optimisation to enhance user experience and automate improvements where … possible. System Availability and Reliability by ensuring high availability and data integrity through proactivemonitoring, alerting, backups and DR planning and testing. Continuous Improvement by staying updated with the latest Cloud & Infrastructure technologies and continuously evaluating and proposing enhancements to existing systems, services and processes whilst also ensuring More ❯
What you’ll be doing: Public Cloud Infrastructure Management which involves provisioning, configuration and maintaining various Cloud resources to ensure scalability, reliability and security. Monitoring and Performance Optimisation by implementing monitoring solutions to track performance and identify areas for optimisation to enhance user experience and automate improvements where … possible. System Availability and Reliability by ensuring high availability and data integrity through proactivemonitoring, alerting, backups and DR planning and testing. Continuous Improvement by staying updated with the latest Cloud & Infrastructure technologies and continuously evaluating and proposing enhancements to existing systems, services and processes whilst also ensuring More ❯
company’s IT infrastructure, ensuring the seamless operation of hardware, servers, storage, and network systems, both on-premise and in the cloud. This includes proactivemonitoring, troubleshooting, and support across various IT functions such as hardware, security, and application management. The role also oversees IT service desk operations … service delivery, and continuous improvement of IT processes to meet IT needs and enhance user satisfaction. Duties and responsibilities Responsible for installing, configuring and monitoring IT Hardware Responsible for supporting and monitoring IT Servers Monitor and support storage appliances and IaaS solutions for performance, availability and security Ensure More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
Guidant Global
looking for an experienced IDAM Engineer to work as part of a dedicated Identity and Access Management (IdAM) Live Services team in maintaining and monitoring key components of the Identity Infrastructure. Ensure all supported components provide the level of availability and capacity as required. Provide subject matter expertise in … 2nd and 3rd line support for incidents and IT service requests; take ownership of problems and resolve calls logged with the team. Assist with proactivemonitoring and daily checks to avert incidents. *Follow the corporate change process and ensure implementation of changes in accordance with processes. *Help to More ❯
11-13 Church Street, Farnworth, Bolton, Greater Manchester, England
LIV UNIFIED COMMS LTD
We are looking for a proactive and customer-focused Apprentice Unified Comms Engineer to strengthen our technical support department. In this role, you will be the first point of contact for internal and external users experiencing technical issues and will provide timely and effective resolutions. Role First point of … other IT systems. Escalation: Escalate complex issues to higher level technical teams. Troubleshooting: Utilise available tools and knowledgebase to troubleshoot common technical issues. System monitoring: Assist with proactivemonitoring of critical IT systems and escalate potential issues. Training The company may offer a full-time position at More ❯
might arise. Liaise with third party partners, suppliers and other parties when required. Maintain the security, integrity and performance of our systems through regular, proactivemonitoring and housekeeping. Keep colleagues informed regarding any issues which arise, take remedial action where necessary, using available tools where applicable. SKILLS, KNOWLEDGE More ❯
service maintain a clear understanding of the status of the service and associated tasks at all times. Experience working within a service based team monitoring components such as web servers, applications servers, log files, disk space and databases Experience of line management and leadership Solid understanding of the entire More ❯
chance of calls closing first time and without introducing new issues or unnecessary risk. Develop and maintain in-depth knowledge of supported customer applications. Proactive in managing support contracts. Working with the customer to identify critical periods, or changes in environment and customer resource. Recognise the importance of and … dedicate time to proactivemonitoring and maintenance, identifying and managing potential issues prior to them becoming critical. Maintain up-to-date and robust customer documentation to support rapid and effective ticket handling and minimise single points of failure within the team. Demonstrate a mindset of adding customer value … beyond incoming tickets, including communicating relevant product upgrades and proactive maintenance. Follow the Simpson's Support Guide for ticket handling and time recording. Ensuring token-based support contracts are tracked and reports issued to the customer monthly, or otherwise as agreed. Work with the consulting team to ensure smooth More ❯
Hook, Hampshire, United Kingdom Hybrid / WFH Options
Office Angels
downtime. Provisioning virtual PCs and servers. Maintaining our systems in line with ISO27001 and Cyber Essentials Plus. Installing and configuring Catalyst/Meraki switches. Proactivemonitoring of the business systems. Recommending areas of improvement in the IT infrastructure to meet and maintain system growth. Occasional travel to other More ❯
Experience in building relationships with vendors • Practical knowledge of enterprise communication platforms (MS Teams, Teams Rooms) • Good knowledge of best practices in controlling and monitoring network environment • Experience in managing Microsoft products: Azure, O365, Teams, Intune • Ability to work in a multiculture environment • Taking ownership of local and global More ❯
systems (KeyCloak). - Have a strong foundation in networking principles. System Maintenance & Optimisation: - Administer and optimise Windows Server 2022 and Hyper-V environments. - Perform proactivemonitoring and maintenance to ensure system performance and reliability. - Implement and support disaster recovery and business continuity strategies. Collaboration & Documentation: - Work closely with More ❯
systems (KeyCloak). Have a strong foundation in networking principles. System Maintenance & Optimisation: Administer and optimise Windows Server 2022 and Hyper-V environments. Perform proactivemonitoring and maintenance to ensure system performance and reliability. Implement and support disaster recovery and business continuity strategies. Collaboration & Documentation: Work closely with More ❯
Bolton, Greater Manchester, North West, United Kingdom Hybrid / WFH Options
Warburtons Ltd
Your role will involve investigating and resolving problems, as well as providing essential information about our systems and infrastructure. You'll be responsible for monitoring and maintaining our network and server infrastructure, identifying any errors or faults that may arise. In addition, you'll provide advice and training to … crucial to your success in this role. You'll also be tasked with resolving incidents and problems, including documenting root causes and implementing remedies. Proactivemonitoring and raising improvement suggestions will be part of your daily routine, as well as highlighting and implementing process and system improvements. Integrating More ❯
by experience . You will have a strong background in security operations, threat detection and incident responses. A critical role supporting defence infrastructure through proactivemonitoring, analysis and improvement of cybersecurity. Responsibilities: Experience in a security operations centre (SOC) environment Experience with SIEM tools such as Microsoft Sentinel More ❯
Reigate, Surrey, United Kingdom Hybrid / WFH Options
Willis Towers Watson
Bicep etc.) Experience of Microsoft Azure in areas such as networking, storage, integration, compute and analytics Experience of cloud observability concerns (logging, tracing, metrics, monitoring & alerting) Experience of Windows & Linux containers and orchestration platforms (Docker, Kubernetes) Strong interpersonal skills, with the ability to work effectively with many stakeholders Solid More ❯
Chester, Cheshire, North West, United Kingdom Hybrid / WFH Options
NMS Recruit
training materials and deliver education to clients as and when required Share knowledge, experience and documentation across the team Infrastructure/Component Activities Proactively monitoring endpoints using various tools Provide administration services to admin centres such as Exchange, Teams and SharePoint Develop and implement compliance and configuration policies to More ❯
Coalville, England, United Kingdom Hybrid / WFH Options
Mobius Networks Limited
major (internal and customer facing) Key Experience & Skills Windows Server: 2016, 2019, 2022+ Cisco Networking: Configuration of Cisco routers, switches, and firewalls Network Management & Monitoring VPN Implementation, maintenance, and troubleshooting: IPSec and SSL VPN setups Disaster Recovery: Implementation, maintenance, and management of backup and recovery solutions Operating Systems and … Windows Server, Virtualization platforms (Hyper-V, VMware) Microsoft Office 365 Scripting languages & Automation tools Backup and recovery solutions (Asigra DS-User, Veeam, Acronis) Network monitoring tools (PRTG) Antivirus & Endpoint Detection and Response Web & email filtering Firewalls (Cisco ASA, Firepower preferred others beneficial) Managed Wi-Fi solutions (Cisco Meraki) Two … with a continuous desire to learn and expand knowledge. Team-oriented with the ability to collaborate effectively and integrate seamlessly within a team. Positive, proactive, and solution-focused mindset. Comfortable working both in an office environment and remotely. Why Come Work With Us?👋 Here’s why we’re confident More ❯