performance tuning, capacity planning, and proactive monitoring of infrastructure systems. Provide Level 2 and Level 3 support, troubleshooting complex technical issues, and collaborating with internal teams and vendors. Ensure disasterrecovery (DR) and business continuity planning (BCP) strategies are in place and tested regularly. Preferred Experience and Skills Minimum three years' experience in IT support and customer service … Veeam, storage solutions like NetApp, Cisco, Fortinet firewalls, Office Add-ins troubleshooting, Mitel telephone systems, Citrix Virtual Apps & Desktops, Market Data platforms (Bloomberg, Refinitiv, etc.), SQL Server management, and disasterrecovery planning. Additional Details Seniority level: Mid-Senior level Employment type: Full-time Job function: Information Technology Industries: Data Infrastructure and Analytics This job posting appears active, with More ❯
performance tuning, capacity planning, and proactive monitoring of infrastructure systems. Provide Level 2 and Level 3 support, troubleshooting complex technical issues, and collaborating with internal teams and vendors. Ensure disasterrecovery (DR) and business continuity planning (BCP) strategies are in place and tested regularly. Stay up to date with industry best practices, emerging technologies, and regulatory requirements in … similar. Experience with Office Add-ins troubleshooting and problem-solving. Telephone Systems, preferably Mitel. Citrix Virtual Apps & Desktops, NetScaler experience. Bloomberg, Refinitiv, FactSet, Fidessa, or other Market Data experience. Disasterrecovery/BCP. SQL Server database management & administration. #J-18808-Ljbffr More ❯
Manage security tools such as SIEM, endpoint protection, and access controls, including automated threat detection and incident response. Create documentation, procedures, and diagrams for infrastructure management. Plan and test disasterrecovery procedures. Collaborate with support and transformation teams to ensure security and performance. Produce clear documentation for teams and stakeholders. Embed infrastructure strategies into operational workflows. Act as … and storage. IAM: Microsoft Entra ID, RBAC, MFA. Cloud Security: Azure security tools, compliance frameworks. Threat and incident management: vulnerability assessment, threat protection. Patch and endpoint management. Backup and disasterrecovery tools. Monitoring tools experience. Cisco certifications (CCNA/CCNP/CCIE). Microsoft certifications (e.g., MCSA, AZ-104). Azure Security certifications (e.g., AZ-500) a plus. More ❯
and other support teams within agreed timescales. Assist in planning, designing, developing, and deploying new services and enhancements to existing services. Own and manage systems used for monitoring, backups, disasterrecovery, and security patching. Ensure compatibility and interoperability of computing systems. Review and analyze the effectiveness of existing systems and develop improvement strategies. Research hardware and software products More ❯
London, England, United Kingdom Hybrid / WFH Options
Simpson Thacher & Bartlett LLP
Identify and assess operational risks, including those related to IT infrastructure and security, and develop and implement mitigation strategies. Contribute to the development and implementation of business continuity and disasterrecovery plans. Reporting and Analysis: Develop and present regular reports on operational performance, key metrics, and project status to the Director of Operations and other stakeholders. Analyze data More ❯
on-premises infrastructure with Azure and other federated services in a secure and resilient manner. Proactively maintain systems, including security updates, patches, system backups and agreed business continuity and disasterrecovery arrangements. Ensure timely application of security updates and system patches, perform regular system and data backups to prevent data loss, and implement disasterrecovery procedures … with vendors and external support teams as needed. Working flexibly, outside of core hours and as part of a support rota as required Providing incident response, business continuity and disasterrecovery support as part of the on-call rota. Working as needed to provide proactive maintenance at dates and times that minimise disruption to Sadler’s Wells business … Power Automate. Understanding of networking systems, including TCP/IP, routers, firewalls, and VPNs. Experience with SQL Experience of supporting identity and access management solutions. Knowledge of backup solutions, disasterrecovery, and high-availability configurations. Familiarity with security best practices and compliance frameworks, including PCI-DSS and GDPR. Strong troubleshooting skills and ability to work independently or in More ❯
on-premises infrastructure with Azure and other federated services in a secure and resilient manner. Proactively maintain systems, including security updates, patches, system backups and agreed business continuity and disasterrecovery arrangements. Ensure timely application of security updates and system patches, perform regular system and data backups to prevent data loss, and implement disasterrecovery procedures … with vendors and external support teams as needed. Working flexibly, outside of core hours and as part of a support rota as required Providing incident response, business continuity and disasterrecovery support as part of the on-call rota. Working as needed to provide proactive maintenance at dates and times that minimise disruption to Sadler’s Wells business … Power Automate. Understanding of networking systems, including TCP/IP, routers, firewalls, and VPNs. Experience with SQL Experience of supporting identity and access management solutions. Knowledge of backup solutions, disasterrecovery, and high-availability configurations. Familiarity with security best practices and compliance frameworks, including PCI-DSS and GDPR. Strong troubleshooting skills and ability to work independently or in More ❯
Continuously evaluate and optimize cloud infrastructure to improve performance, reduce costs, and enhance scalability. This involves analysing usage patterns, identifying inefficiencies, and implementing changes to achieve better resource utilization. Disasterrecovery planning: Support and manage disasterrecovery plans to ensure business continuity in case of system failures or data loss, alongside the existing Operational Resilience Team. … This includes validating backup and recovery processes, testing recovery procedures, and ensuring that critical data is protected. Documentation and reporting: Maintain comprehensive documentation of cloud infrastructure, configurations, and processes. Assist with generating regular reports on cloud resource usage, performance metrics, and security compliance to keep stakeholders informed. Mentorship and Training: Provide mentorship and training to junior engineers and … related policy Skills & Competencies Extensive knowledge of Azure Platform services including but not limited to Compute Infrastructure, Storage, Networking (vNet, vWan, Peering, NSG’s, ASG’s), Azure Policy, RBAC, Recovery Services Extensive knowledge of Infrastructure as Code (IAC) – Terraform, Azure DevOps Extensive knowledge of scripting and automation skills and tools - Powershell, Terraform, Visual Studio Code Extensive knowledge of Microsoft More ❯
researchers , and internal IT teams to resolve complex, time-sensitive technical issues in a fast-paced environment. Ensure business continuity by maintaining backups, performing system health checks, and supporting disasterrecovery operations. Act as the point of contact for resolving incidents and service requests, escalating more complex issues to senior IT staff when needed. Maintain clear documentation of More ❯
Google Cloud Platform. WHAT YOU'LL BRING: Extensive experience in IT infrastructure support and administration. Strong technical knowledge in Microsoft 365, Google Workspace, Windows OS, Active Directory, Backup and Disaster Recovery. Proficient in Identity and Access Management platforms (e.g., Entra ID, Okta). In-depth understanding of server technologies, enterprise storage, network protocols, and infrastructure components. Skilled in managing More ❯
delivery. Operational & Leadership Skills: IT Operations & Service Continuity: Ability to ensure IT systems are highly available, resilient, and fit for purpose, with a strong focus on business continuity and disaster recovery. Supplier & Vendor Management: Experience managing third-party IT vendors, MSPs, and SaaS providers, ensuring service levels, performance, and cost-effectiveness. Project Leadership & Change Management: Ability to lead technology More ❯
delivery. Operational & Leadership Skills: • IT Operations & Service Continuity: Ability to ensure IT systems are highly available, resilient, and fit for purpose, with a strong focus on business continuity and disaster recovery. • Supplier & Vendor Management: Experience managing third-party IT vendors, MSPs, and SaaS providers, ensuring service levels, performance, and cost-effectiveness. • Project Leadership & Change Management: Ability to lead technology More ❯
and performance Proactive monitoring and actions : Establish proactive monitoring and alerting to maximize system uptime, performance, and cost management Responsible for service-levels : Own the cloud infrastructure availability, security, disasterrecovery, business continuity and SLA Compliance. Provide regular reports to the executive team Incident management: Take the lead in ongoing incidents, managing communications to customers and the executive … budgets and providing reporting to executive leadership Experience managing monitoring, alerting, observability, and dashboarding platforms (such as Azure Monitor, Grafana, and Azure Log Analytics) Experience with incidents, incident management, disasterrecovery planning, and business continuity practices Experience with CI/CD pipelines (Azure DevOps, or other tools) for infrastructure automation and continuous deployment Hands-on experience delivering cloud … Azure environments, including experience with: Windows Server, Linux, Web Application Gateways, Front Door, Virtual Machine scale sets, Firewalls, Azure Entra, Virtual Machines, Storage Accounts, Key Vaults, Log Analytics, SQL, Recovery Vaults, Network Resources, Security Resources and other Azure services Certified with AZ-104 Strong communication, documentation and organisation skills Nice to have: The following certifications: AZ-500, AZ More ❯
supporting other database services used throughout the business, such as Aurora, MariaDB and even some Oracle and DB2. The responsibilities of the role include: Maintain the high availability and disasterrecovery (HADR) systems including performing failover and failback. Ensure backups and recoverability of databases. Production monitoring of performance, capacity and scalability including raising archiving requirements. Patching and version … upgrades. Project work such as disasterrecovery site upgrades and database migrations. Maintain the integrity and performance of the various environments through maintenance routines. Provide on call DBA support on a rota. What we’re looking for: Must have skills/experience: Knowledge and/or experience with SQL Server and PostgreSQL in an IT support setting. An More ❯
with clients and internal stakeholders in a clear, supportive manner. • Certifications such as Microsoft Certified: Azure Administrator Associate or Microsoft 365 Certified: Enterprise Administrator Expert. • Knowledge of backup and disasterrecovery solutions. • Experience in managing hybrid cloud environments. • Familiarity with ITIL best practices. Should you have any questions or wish to apply please do not hesitate to contact More ❯
London, England, United Kingdom Hybrid / WFH Options
PHD Mail Limited
consider the Confidentiality, Integrity and Availability of all systems and wherever new requirements or changes are being requested/evaluated. The role requires the provisioning and maintenance of the DisasterRecovery (DR) solution for the business with regards to computer infrastructure, hardware, and software. Technologies Windows Server 2019,2022, Hyper V Manager, Failover Cluster Manager Microsoft Azure/ More ❯
London, England, United Kingdom Hybrid / WFH Options
Charles River Associates
selecting the most suitable tools for each stage. Managing different storage tiers and optimizing data handling throughout the data lifecycle. Acting as the escalation point for M365 administration. Ensuring DisasterRecovery plans are kept up to date and tested. Supporting and troubleshooting connectivity between cloud and on-premises networks. Building and executing proprietary workflows, custom automations tailored to More ❯
London, England, United Kingdom Hybrid / WFH Options
Charles River Associates
select the most suitable tools for each stage. Manage different storage tiers and optimize data handling throughout the data lifecycle. Act as the escalation point for M365 administration. Ensure DisasterRecovery plans are kept up to date and tested. Support and troubleshoot connectivity between cloud and on-premises networks Build and execute proprietary workflows, custom automations tailored to More ❯
Security (including the creation of policies). Advanced and significant knowledge of Cloud based backup configuration and management solutions e.g. Barracuda and Datto. Data management and premise and cloud disaster recovery. Advanced knowledge of Fast Ethernet & Gigabit Switches, Routers (Enterprise Cisco Routers & ADSL/Cable Routers). Advanced and significant knowledge of SAN’s (Fibre Channel & iSCSI – e.g. iXsystems More ❯
Security (including the creation of policies). Advanced and significant knowledge of Cloud based backup configuration and management solutions e.g. Barracuda and Datto. Data management and premise and cloud disaster recovery. Advanced knowledge of Fast Ethernet & Gigabit Switches, Routers (Enterprise Cisco Routers & ADSL/Cable Routers). Advanced and significant knowledge of SAN’s (Fibre Channel & iSCSI – e.g. iXsystems More ❯
configure, and maintain server hardware and software (Windows Server, Linux, etc.). Monitor server performance and troubleshoot issues proactively to minimize downtime. Perform regular system updates, patches, backups, and recovery operations. Manage user access, permissions, and security policies to protect sensitive data. Configure and maintain virtual environments using tools like VMware, Hyper-V, or similar. Monitor network connectivity related … Ensure compliance with company policies and industry best practices related to server management. Respond promptly to server-related incidents and coordinate with other teams for resolution. Implement and maintain disasterrecovery plans and business continuity procedures. Required Skills and Qualifications: Bachelors degree in Computer Science, Information Technology, or related field preferred. Proven experience as a Server Administrator or More ❯
and maintenance. Conduct capacity planning and resource management for all infrastructure components. Participate in on-call rotations to provide 24x7 support for all critical infrastructure issues. Design and implement disasterrecovery plans and business continuity strategies. Implement best practices for monitoring, logging, and alerting across the infrastructure. Foster a culture of continuous improvement and operational excellence. Analyze complex … tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disasterrecovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools More ❯
tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disasterrecovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools More ❯
Python, PowerShell, Terraform, and Ansible to automate configurations, monitoring, and troubleshooting. Monitoring & Observability – Maintain and improve system observability with Grafana, Splunk, OpsGenie, and PRTG to proactively address issues. Incident & DisasterRecovery – Manage incident response, root cause analysis, and DR plans to ensure business continuity. Security & Compliance – Enforce security best practices, access controls, and audit logs in line with More ❯
cloud deployments) Excellent written and verbal communication skills with a strong customer service focus Desired Skills & Experience Manage and troubleshoot firewalls and other security appliances Networking (Cisco, SonicWall, HP) DisasterRecovery & Online Backup Mail DNS (MX, DMARC, DKIM, SPF) Familiarity with monitoring and management tools (Solarwinds, PRTG) Relevant certifications (Microsoft, CCNA, CompTIA Security, ITIL) Benefits: Work equipment provided More ❯