Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g. … practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
customer relationships that promote retention and loyalty Strategic guidance is delivered to the customer to achieve increased adoption and value leveraging FI AI driven solutions and methodology Continuous and proactivemonitoring of customer initiatives & challenges Proactive, strategic and expanding utilization to support account growth FI solution use is expanded and adopted within and across customer functional teams … support strategy Ability to work in a dynamic environment and balance multiple responsibilities Outstanding communication skills to include listening, verbal, written and presentation skills Excellent organizational & time management skills Proactive and driven to provide optimal results with high personal drive, integrity, and a positive attitude Comfortable working in a team environment with Product, Sales, IT, and all levels of More ❯
intervals to share knowledge with others. Create and maintain site documentation and technical documentation. Develop knowledge in a specific technology area to become a trusted SME in that field. Proactivemonitoring of IT systems and preventative measures taken to reduce system downtime. Write post-incident review documents and implement recommendation plans to reduce or prevent further incidents. Test … applications, network, and server performance; provide performance statistics and reports; develop strategies for maintaining and improving core infrastructure. Adhere to all IT security policies and assist in enforcing and monitoring IT security policies; spot potential vulnerabilities and suggest resolutions. Evaluate, design, maintain infrastructure systems, including LANs, WANs, Internet, intranet, security, and incident and change management systems. Understand business requirements More ❯
Bromsgrove, Worcestershire, England, United Kingdom Hybrid / WFH Options
Klipboard
service requests raised through our Service Desk, working with technologies such as Active Directory, Azure Active Directory, Group Policy, Exchange Online, Windows Server, and Remote Desktop Services. Respond to monitoring alerts for Microsoft Azure IaaS/PaaS/SaaS services, network connectivity, and Microsoft 365 services to proactively address potential issues. Investigate and resolve security alerts for Microsoft … Review, install, and test security and application updates, leveraging automation to maintain and improve customer environments. Ensure the operational integrity, performance, and security of customer cloud-based services through proactivemonitoring and expertise. Collaborate with customers and internal teams to implement migrations and deliver solutions tailored to customer requirements. Maintain regular communication with customers via Service Desk tools More ❯
for the network connectivity and interaction of systems. The Contractor shall provide network operational support and maintenance, to include, but not limited to: Provide technical guidance for directing and monitoring information systems operations. Designs, builds, and implement network systems. Update and maintain routers, switches, and firewalls. Direct compilation of records and reports concerning network operations and maintenance. Troubleshoot network … Utilize software and hardware tools and identifies and diagnosis complex problems and factors affecting network performance. Troubleshoot network systems when necessary and makes improvements to the network. Maintain network monitoring software across NS3 networks. Aid in all aspects of network management from network design through implementation, maintenance, and upgrading of existing networks. Analyze design, specifications, and related documents. Implement … server requirements to include load- balancing, VPNs, firewall contexts, and network address translation (NAT) where appropriate. Minimize network latency and maximizes data throughput through design analysis and network performance monitoring tools. Validate new/existing dataflow and data formats. Setup proactivemonitoring of all network devices, services, and servers using a combination of tools to ensure high More ❯
Hull, North Humberside, North East, United Kingdom
AWD Online
will support the development and delivery of the IT strategy, driving innovation and ensuring technology underpins operational efficiency. With a strong focus on service quality, the role will require proactivemonitoring of systems, oversight of network performance, and the implementation of robust security and recovery measures to minimise downtime and risk. As the Senior IT Support Engineer/… Leadership : Manage the daily tasks of the IT Technician, offering support, feedback, and promoting professional development Infrastructure Management : Ensure optimal performance of servers, networks, and related systems while proactively monitoring to prevent downtime Security Oversight : Maintain and monitor firewalls, antivirus, and web filtering to safeguard systems Software & Licensing : Oversee software distribution, updates, and ensure licensing compliance Backup & Recovery : Implement More ❯
will support the development and delivery of the IT strategy, driving innovation and ensuring technology underpins operational efficiency. With a strong focus on service quality, the role will require proactivemonitoring of systems, oversight of network performance, and the implementation of robust security and recovery measures to minimise downtime and risk. As the Senior IT Support Engineer/… Leadership : Manage the daily tasks of the IT Technician, offering support, feedback, and promoting professional development Infrastructure Management : Ensure optimal performance of servers, networks, and related systems while proactively monitoring to prevent downtime Security Oversight : Maintain and monitor firewalls, antivirus, and web filtering to safeguard systems Software & Licensing : Oversee software distribution, updates, and ensure licensing compliance Backup & Recovery : Implement More ❯
platforms and orchestration (e.g., Kubernetes, OpenShift), databases, and applications Manage environments and support CI/CD pipelines using Infrastructure as Code Improve observability using tools such as Dynatrace, ensuring proactivemonitoring and alerting Lead and contribute to post-mortems to identify and implement long-term fixes aligning with organisations long term objectives Troubleshoot complex issues across the platform More ❯
Shefford, Bedfordshire, South East, United Kingdom
Intercity Technology Limited
scripts and workflows using PowerShell and Power Platform to reduce manual effort and improve service efficiency. Drive continuous improvement initiatives across infrastructure, systems, and network services. Develop and maintain proactivemonitoring and automated remediation strategies. Service Management & Governance Champion ITIL-aligned processes including Change, Major Incident, and Problem Management. Own post-incident reviews and ensure actionable recommendations are More ❯
Bedford, Bedfordshire, England, United Kingdom Hybrid / WFH Options
Service Care Solutions - Housing
IP) Familiarity with virtualisation, SANs, and secure gateways Proven troubleshooting skills – able to resolve issues independently and collaboratively Excellent communication skills with both technical and non-technical users Organised, proactive, and able to prioritise a varied workload Previous contribution to IT projects (desirable) If you are interested in this position and meet the above criteria, please send your CV More ❯
Location: Corsham & occasional visits to Portsmouth Employment Type: Contract Salary Range: Up to 600 per day via an approved umbrella company Role Overview We are seeking a skilled and proactive Level 2-3 Platform Engineer to support the installation, configuration, and maintenance of hardware and software systems used in test and reference environments. These environments directly support deployed live More ❯
Basingstoke, Hampshire, South East, United Kingdom
Anson Mccade
be part of a multi-skilled agile team supporting and maintaining secure Windows Server and virtualised environments at scale. This is a hands-on infrastructure role involving systems administration, proactivemonitoring, and implementation of approved changes to ensure optimal performance across critical platforms. Youll collaborate closely with internal teams and stakeholders, contributing to service delivery and operational excellence. More ❯
and implement technologies to enhance platform performance, security, and cost-efficiency. Develop and maintain CI/CD pipelines to streamline deployment and ensure software reliability. Optimize platform performance through proactivemonitoring and resolving bottlenecks. Manage cloud-based infrastructure and orchestration tools to ensure high availability and disaster recovery readiness. Apply robust security measures to safeguard data and platform More ❯
South East London, London, United Kingdom Hybrid / WFH Options
AJ BELL BUSINESS SOLUTIONS LIMITED
upcoming work items. Orchestrate Agile ceremonies such as backlog refinement and sprint planning, facilitating team discussions to drive clarity and consensus on deliverables. Own the squads delivery plan, proactively monitoring progress, identifying potential risks or blockers, and supporting the team in resolving issues. Act as a liaison to internal stakeholdersincluding customer service, operations, and marketingto prepare teams for the More ❯
Redis, GridGain, Apache Ignite Programming languages: Java, Python, Go Lang Container orchestration/Cloud platform: RedHat Openshift/AWS/Azure DevOps tools - Ansible, Chef, Kubernetes, GitLab SRE logging & Monitoring Tools - ELK stack, Grafana, Prometheus, Open Telemetry Other highly valued skills include: Strong understanding of Agile application development methodology. Strong knowledge of API development/principles Collaborating with the … practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
Hampshire, South East, United Kingdom Hybrid / WFH Options
JLA Resourcing Ltd
managing the database management strategy and will play a key role in the transformation programme. You will work closely with the infrastructure team and some 3rd parties to drive proactivemonitoring, incident prevention and ITIL aligned Change Management. The team maintain operation availability and ensure database governance. You will be responsible for BAU tasks such as patching, updates … back-ups and performance monitoring tuning but key is that ability to look/implement new technologies and help the business move to that cloud based environment. The person: - Deep experience in enterprise grade databases with a real strength in SQLServer - Experience in performance testing, tuning and optimisation - Skills in enterprise level architecture, systems analysis and the development of More ❯
optimize hybrid cloud environments (Azure, AWS, GCC/GCC High) with a focus on automation, scalability, and performance. • Develop and implement automation strategies (PowerShell, Python, Ansible) to streamline provisioning, monitoring, and system management. • Maintain 99.99% uptime and high availability through proactive infrastructure monitoring, redundancy strategies, and disaster recovery planning. • Ensure compliance with NIST 800-171, CMMC, and … Compliance teams. Technical Leadership & Tier 3 Support • Act as the highest-level technical escalation point for engineering-related client and internal issues. • Lead technical troubleshooting, root cause analysis, and proactive problem resolution. • Ensure clear, client-friendly communication of technical solutions to enhance customer trust and satisfaction. • Network Architecture & Design: Plan, design, and document secure, scalable, and resilient network architectures … in macOS environments (MDM, Apple Business Manager, macOS Security and Policies) • Hands-on experience with automation/scripting (PowerShell, Python) to optimize deployments and system management. • Proficiency in infrastructure monitoring, performance tuning, and high-availability designs. • Strong understanding of MSP/MSSP tool stack management (RMM, PSA, SIEM, EDR, Email Security, Backup solutions). • Ability to collaborate with security More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactivemonitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯