software installation on AWS instances. Security and Compliance: Collaborate with the security team to implement and maintain security best practices, monitoring, and compliance within the AWS environment. Backup and DisasterRecovery: Contribute to backup and disasterrecovery strategies to ensure data integrity and high availability of systems. Monitoring and Troubleshooting: Monitor system performance and troubleshoot issues More ❯
efficiency. Oversee scheduling, workload distribution, and performance management to maintain service excellence. Manage customer relationships and coordinate with vendors of third-party services when needed. Support business continuity and disasterrecovery strategies, especially in incident response and critical outage scenarios. Essential Requirements Proven experience managing a service desk team within an MSP environment. Strong background in ITIL-based … Cyber Essentials, NIST, GDPR. Experience with cloud platforms and modern IT stacks (e.g., Microsoft 365, Azure). Automation or scripting experience (e.g., PowerShell). Involvement in business continuity planning, disasterrecovery, or IT audits. Interest in AI-based support tools or self-service strategies. Why Join SRD Technology UK? Be part of a vibrant, forward-thinking team making More ❯
KeyCloak). -Have a foundational knowledge of network principles. System Maintenance: -Administer and maintain Windows Server 2022 and Hyper-V. -Monitor system performance and conduct routine maintenance. -Contribute to disasterrecovery and business continuity strategies. Collaboration & Documentation: -Work with senior engineers and cross-functional teams to enhance infrastructure. -Maintain thorough documentation for systems, configurations, and processes. On-Call More ❯
Bristol, England, United Kingdom Hybrid / WFH Options
PA Consulting
the architecture and best practices. Security and Compliance : Ensure that all solutions comply with internal security standards and relevant regulatory requirements (e.g., GDPR, HIPAA), including data privacy, security, and disasterrecovery considerations. Documentation : Develop and maintain detailed documentation of solution designs, architecture principles, and integration approaches for ongoing projects and future reference. Risk Management : Identify and mitigate technical More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
in compliance with security policies. • Oversee the installation, configuration, and maintenance of storage solutions, including SAN, NAS, and backup systems. • Ensure data integrity, availability, and security through regular backups, disasterrecovery planning, and storage optimization. • Implement and manage storage policies, quotas, and replication to ensure efficient and secure data management. • Install, configure, and maintain Windows Server environments, including More ❯
identify and document client-specific operational risks and drive improvements to mitigate them. Comfortable supporting client engagements, including onboarding sessions, service reviews, or audits. Familiarity with business continuity and disasterrecovery planning considerations within service designs. Willingness to travel occasionally for client workshops or service reviews, as required. What do we offer in return? We have a high More ❯
Bournemouth, England, United Kingdom Hybrid / WFH Options
Allianz Management Services Ltd
and data governance in the context of machine learning. Experience in insurance or financial services. Familiarity with OpenAI, GPT, or other LLM technologies. Infrastructure management, including monitoring, backup, and disaster recovery. What We Will Offer You Recognised and rewarded for a job well done, we have a range of flexible benefits for you to choose from- so you can More ❯
Bristol, England, United Kingdom Hybrid / WFH Options
Allianz
and data governance in the context of machine learning. Experience in insurance or financial services. Familiarity with OpenAI, GPT, or other LLM technologies. Infrastructure management, including monitoring, backup, and disaster recovery. What We Will Offer You Recognised and rewarded for a job well done, we have a range of flexible benefits for you to choose from- so you can More ❯
Bristol, England, United Kingdom Hybrid / WFH Options
Allianz UK
and data governance in the context of machine learning. Experience in insurance or financial services. Familiarity with OpenAI, GPT, or other LLM technologies. Infrastructure management, including monitoring, backup, and disaster recovery. What We Will Offer You Recognised and rewarded for a job well done, we have a range of flexible benefits for you to choose from- so you can More ❯
the organisation's Cloud, Storage & Data Centre infrastructure, adhering to group standards and audit requirements. Assist in the design and implementation of Data Centres, ensuring high availability and robust disasterrecovery provisions. Manage Linux servers and software installations, utilizing automation tools where possible. Configure and oversee Data Protection platforms, including Commvault, Dell EMC Networker, and Zerto. Collaborate with … DBA, Network, Application, Systems, and Service Desk teams on projects, migrations, upgrades, and architectural activities. Conduct regular DisasterRecovery testing, validating Recovery Point Objectives (RPO) and Recovery Time Objectives (RTO), and ensuring up-to-date DR procedures. Perform system maintenance, upgrades, performance monitoring, capacity planning, and compute/storage management with minimal disruption. Evaluate and implement More ❯
Location : Harewood, Leeds Position Title : MIS Junior Engineer Job Type : Full-Time About Us Assured Data Protection is a global leader in data backup and disasterrecovery managed services, specializing in safeguarding against data loss and downtime in the event of a disaster, cyber, or ransomware attack. Our fully managed services include immutable backup, disasterrecoveryMore ❯
clusters; familiarity with Docker and Helm is a plus. Developing automated tools and processes for building, deployment, monitoring, testing, and maintenance of infrastructure and applications. Maintaining security, compliance, and disasterrecovery policies and procedures. Optimizing infrastructure for high availability, fault tolerance, and cost efficiency. Monitoring and improving infrastructure and application performance. Troubleshooting and resolving infrastructure and application issues. More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯