analysis on incidents and escalate issues to the appropriate support level when necessary. Manage Application ID and provide cloud elasticity by auto-scaling resources based on business requirements. Ensure DisasterRecovery (DR) and manual redundancy failover capabilities. Provide regular service management reports to keep clients informed. Follow ITIL processes and Integrated Service Level Management best practices. This is More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
once a month and for any unplanned maintenance). Deep understanding of Tableau Server architecture , security, performance tuning, and best practices (networking, firewall configuration, server maintenance, patching, backups, and disasterrecovery). Scripting & Automation : Scripting/development experience (e.g. PowerShell, Python, TabCMD) for automation. Tableau Desktop & Prep: Familiarity with Tableau Desktop (for creating administrative reports and dashboards) and More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
LM RECRUITMENT SOLUTIONS LTD
industry standards and regulatory requirements, including ISO 27001, GDPR, and Cyber Essentials. Drive the implementation of cybersecurity measures, risk assessments, and regular audits to maintain a secure IT environment. DisasterRecovery and Business Continuity: Own the development and execution of disasterrecovery plans and business continuity strategies. Oversee failover testing and ensure that the organisation can More ❯
technical security controls aligned to frameworks such as NIST 800-53, CIS Benchmarks, and JSP standards. Maintain technical documentation and contribute to the infrastructure knowledge base. Support backup and disasterrecovery processes and ensure business continuity measures are in place. Work alongside IT team members to deliver infrastructure enhancements and project-based work. Skills and Experience Strong hands More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
DXC Technology Inc
Monitor, troubleshoot, and resolve infrastructure issues across storage, networking, and compute environments. Implement automation and Infrastructure-as-Code (IaC) practices using tools like Terraform, Ansible, or PowerShell. Maintain backup, disasterrecovery, and security best practices across all infrastructure components. Document system configurations, policies, and procedures for future reference and audits. Provide technical guidance and support to other IT More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Artis Recruitment Ltd
documentation Provide advanced (2nd and 3rd line) technical support on escalated infrastructure issues Manage and optimise cloud resources, driving automation and efficiency Ensure high levels of uptime and strong disasterrecovery planning for Azure services Enforce device and system security through Intune and Azure policy management Spearhead the shift from on-premise infrastructure to a cloud-first environment More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
Navtech, Inc
environment. As a member of our team, you will contribute to the full lifecycle of our data persistence layer, from schema design and performance tuning to ensuring robust replication, disasterrecovery, and seamless integration within our cloud-native microservice ecosystem. Responsibilities: Design & Develop Database Solutions: Architect, design, and implement highly optimized relational (e.g., MySQL, PostgreSQL, AWS Aurora, SQL … microservices. Performance Optimization & Tuning: Proactively analyze and optimize complex queries, implement efficient indexing strategies, and manage partitioning/sharding to ensure peak database performance and handle high throughput. Reliability & DisasterRecovery: Design, implement, and maintain robust backup, disasterrecovery, and high-availability solutions, including replication (master-slave/multi-master) and failover configurations, to ensure data More ❯
networking, and security considerations. Evaluate, select, and implement VDI technologies (e.g., Citrix Virtual Apps and Desktops, VMware Horizon, Microsoft Azure Virtual Desktop). Ensure high availability, performance optimization, and disasterrecovery of VDI environments. Collaborate with IT security, network, and application teams to ensure seamless integration and compliance. Provide technical leadership and mentorship to VDI engineers and support More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
CACI Ltd
managing Kubernetes clusters; familiarity with Docker and Helm is preferred. Developing automated tools for building, deploying, monitoring, testing, and maintaining infrastructure and applications. Establishing policies for security, compliance, and disaster recovery. Optimizing infrastructure for high availability, fault tolerance, and cost efficiency. Monitoring and improving infrastructure and application performance. Troubleshooting and resolving infrastructure and application issues. Managing and mentoring junior More ❯
software installation on AWS instances. Security and Compliance: Collaborate with the security team to implement and maintain security best practices, monitoring, and compliance within the AWS environment. Backup and DisasterRecovery: Contribute to backup and disasterrecovery strategies to ensure data integrity and high availability of systems. Monitoring and Troubleshooting: Monitor system performance and troubleshoot issues More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
enhance fault tolerance. Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency. Ensure business continuity by designing and maintaining robust backups, failover strategies, and disasterrecovery solutions. Identify, diagnose, and resolve infrastructure or application performance bottlenecks. Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work … closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disasterrecovery solutions. #J-18808-Ljbffr More ❯
in compliance with security policies. • Oversee the installation, configuration, and maintenance of storage solutions, including SAN, NAS, and backup systems. • Ensure data integrity, availability, and security through regular backups, disasterrecovery planning, and storage optimization. • Implement and manage storage policies, quotas, and replication to ensure efficient and secure data management. • Install, configure, and maintain Windows Server environments, including More ❯
identify and document client-specific operational risks and drive improvements to mitigate them. Comfortable supporting client engagements, including onboarding sessions, service reviews, or audits. Familiarity with business continuity and disasterrecovery planning considerations within service designs. Willingness to travel occasionally for client workshops or service reviews, as required. What do we offer in return? We have a high More ❯
clusters; familiarity with Docker and Helm is a plus. Developing automated tools and processes for building, deployment, monitoring, testing, and maintenance of infrastructure and applications. Maintaining security, compliance, and disasterrecovery policies and procedures. Optimizing infrastructure for high availability, fault tolerance, and cost efficiency. Monitoring and improving infrastructure and application performance. Troubleshooting and resolving infrastructure and application issues. More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
scalable infrastructure on GCP using Terraform. Platform Reliability & Automation: Enhance system reliability and uptime through automation, monitoring, and proactive incident response. Develop CI/CD pipelines, automated testing, and disasterrecovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security More ❯
laws Ability to develop and deliver compliance training Excellent project management and stakeholder engagement skills Knowledge of cloud security frameworks (AWS, Azure, SaaS security) Experience in business continuity and disasterrecovery planning Understanding of government and other regulatory body security frameworks (MOD, FSQS, NSCS) Apply Please complete all fields unless stated as optional. First Name Last Name Email More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
Commercial
and present client documentation (Service reports, contracts, risk registers, service improvement plans, service investment plans etc) Conduct face to face service review meetings with service sponsors and stakeholders Review DisasterRecovery and Business Continuity provision to ensure such services are fit for purpose Lead the recruitment and management of technical staff including personal development and training Provide an More ❯
PCI DSS and other regulatory requirements. Implement security best practices, including user management, encryption and auditing. Design and implement database schemas, partitioning strategies and indexing solutions. Ensure high availability, disasterrecovery and scalability of database systems. Configure strong monitoring and management processes and tooling to help troubleshoot and resolve complex database issues in Real Time. Hands-on experience … performance tuning. Hands-on experience with AWS services such as EC2, S3, VPC, and IAM. Solid understanding of database security principles and compliance frameworks Experience implementing robust backup and recovery strategies. Proficiency with infrastructure automation tools like Terraform or CloudFormation. Familiarity with monitoring tools such as CloudWatch, Datadog, or similar. Experience of database replication tooling such as AWS DMS More ❯
and services. Ensure customer satisfaction regularly exceeds targets. Manage the sharing of skills, knowledge, and experience to improve the service provided by the Service support team. Ensure backups and disasterrecovery models are robust and fit for purpose. Additional Information: The job description is not intended to be exhaustive and duties may be altered from time to time More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
ownership of database schema design, implementation, and evolution across all environments. You'll be responsible for query optimization and performance tuning, managing database capacity planning, and designing backup and recovery procedures to maintain our SLOs. Your work will include developing comprehensive monitoring strategies and exploring multi-tenant database approaches to support future growth requirements. You will champion database best … series data management, ideally using Timescale Advanced SQL query optimization and performance tuning capabilities including index design and usage analysis Experience with database security, access control, compliance requirements, and disaster recovery. Experience with cloud database services and monitoring tools including Grafana and Datadog Proficiency in Python for database automation and tooling development Knowledge of CI/CD pipelines, infrastructure More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
communication skills and a collaborative spirit. Ability to work independently as a self-starter to identify and solve problems. Solid understanding of monitoring, auto-scaling, performance tuning, troubleshooting, and disasterrecovery best practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. More ❯
/patching with implementation and engineering teams. • Manage CMDB/ServiceNow product owner responsibilities, including cybersecurity assessments and data classification. • Support L2 and L3 operational activities related to CMMC, disasterrecovery, cybersecurity, EVM, and other infrastructure support activities. • Demonstrate business acumen, stay updated on industry trends, and be the internal expert on the product and competitive landscape. • Conduct More ❯
/patching with implementation and engineering teams. • Manage CMDB/ServiceNow product owner responsibilities, including cybersecurity assessments and data classification. • Support L2 and L3 operational activities related to CMMC, disasterrecovery, cybersecurity, EVM, and other infrastructure support activities. • Demonstrate business acumen, stay updated on industry trends, and be the internal expert on the product and competitive landscape. • Conduct More ❯