environments. Core Responsibilities: Design and implement scalable, high-performance IT infrastructure across hybrid environments. Provide Level 3 support and troubleshooting across server, storage, and virtualisation platforms. Lead performance monitoring, capacityplanning, and system optimisation efforts and drive infrastructure automation Mentor junior engineers and guide best practices across the infrastructure estate. Collaborate with stakeholders and vendors to align infrastructure More ❯
Tamworth, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
role, you will be responsible for our network's day-to-day administration, support, and troubleshooting in a dynamic, high-volume environment with multiple configurations. You must have experience planning and installing new software or upgrades and resolving problems remotely and on-site. Your expertise in cloud-based solutions and virtualisation technologies will be critical in this role, as … implementation, and maintenance of complex solutions, coordinating activities with other technical personnel as appropriate. Develop and analyse highly complex system standards, thresholds, and recommendations to maximize system performance. Conduct capacityplanning reviews with management and approve capacity plans formulated by less experienced personnel. Develop strategies to manage the frequency of appropriate support packages/patch applications. Monitor More ❯
vendor management. Manage the SDLC for endpoint Azure Cloud Engineering including development of IaC and software deployment pipelines, associated features, and enhancements. Plan, develop and implement high availability strategies, capacityplanning, performance tuning, and site monitoring for endpoint systems. Guide the team in how to perform the full range of On Premise and Cloud Infrastructure engineering & operations including … Data Center systems, network, and security, and Azure Cloud IaC and deployment pipelines. Identify opportunities to streamline endpoint processes and provide tools to automate endpoint functions. Participate in the planning, implementation, and testing of the recovery procedures under the disaster recovery plan. 24x7 on-call production support as needed. Provide Systems Management support for production, staging, development and disaster … recovery requirements including incident and security response leadership within the Hosting Operations team. Participate in RFI process for endpoint describing technical processes and environmental conditions. Act in a leadership capacity within endpoint to facilitate technical solutions, problem solving, and development. Recruit, develop, mentor and coach Hosting Operations staff to meet personal and organizational goals and business priorities. Fosters a More ❯
environments. Core Responsibilities: Design and implement scalable, high-performance IT infrastructure across hybrid environments. Provide Level 3 support and troubleshooting across server, storage, and virtualisation platforms. Lead performance monitoring, capacityplanning, and system optimisation efforts and drive infrastructure automation Mentor junior engineers and guide best practices across the infrastructure estate. Collaborate with stakeholders and vendors to align infrastructure More ❯
language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like AWS CloudWatch More ❯
and supporting our enterprise messaging infrastructure built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacityplanning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem … and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacityplanning , scaling, and tuning of Solace infrastructure to meet current and future demand. Automate routine maintenance tasks and support continuous improvement of operational processes. Implement and maintain monitoring … understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and scripting (Bash, Python, etc.). Strong analytical and problem-solving skills, with attention to detail. Excellent More ❯
and supporting our enterprise messaging infrastructure built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacityplanning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem … and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacityplanning , scaling, and tuning of Solace infrastructure to meet current and future demand. Automate routine maintenance tasks and support continuous improvement of operational processes. Implement and maintain monitoring … understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and scripting (Bash, Python, etc.). Strong analytical and problem-solving skills, with attention to detail. Excellent More ❯
Technical Operations Specialist with DevOps Scrum Master responsibilities to join our Integrations Team. This role bridges the gap between technical execution and operational governance, ensuring smooth delivery with effective capacityplanning and clear communication between stakeholders, robust change control, clear documentation, and driving agile DevOps practices. The successful candidate will possess a background in software development, a comprehensive … the Team Lead to prioritize, allocate, and monitor tasks across the development team, using Scrum methodologies, ensuring timely and efficient delivery of integration projects. Facilitate daily stand-ups, sprint planning, reviews, and retrospectives to maintain team alignment and continuous improvement. Administrative Oversight: track project milestones, and uphold streamlined team processes to support operational excellence, ensuring all Scrum artifacts (e.g. … and Experience: Bachelor's degree in computer science, information-technology, engineering, system analysis or a related study, or equivalent experience A minimum of three years in a technology-related capacity with direct exposure to software development or IT project environments. At least one year of experience as a Scrum Master or in a similar agile facilitation role, with a More ❯
to major enterprise IT projects. Key Responsibilities Install, configure, and maintain Linux systems (RHEL, SLES) across virtualised environments (VMware ESXi, IBM Power-VM). Monitor and optimise system performance, capacity, and availability. Troubleshoot and resolve 2nd/3rd line infrastructure issues, with a focus on root cause analysis and service restoration. Collaborate on the design and deployment of high … and system monitoring tools (Zabbix, Grafana, etc.). Automation tools such as Ansible, Terraform; exposure to Git, Docker, Kubernetes, or Azure DevOps is a plus. Understanding of performance tuning, capacityplanning, and high availability architectures. Nice to Have Exposure to AIX and cloud services (Azure/AWS). Familiarity with ITIL, SAFe Agile, or Prince2 methodologies. Experience supporting More ❯
the analysis, design and implementation of future infrastructure systems and services Create and maintain documentation on SOPs and System Design Research, evaluate, recommend and implement new systems technologies Perform capacityplanning, upgrades and expansion of Active Directory environment Maintain effective communications with vendors, peers and clients in support of assigned projects Effectively coordinates project efforts so that deliverables … communications with hotel and corporate teams and vendors are required (both verbal and email) Projects and meetings are multinational and this position requires attendance outside of normal working hours Planning and multi-tasking is a requirement of this position, since this position typically is lead for several concurrent projects Communicate with hotel teams regarding outstanding tasks and deliverables and More ❯
capabilities with organizational needs Configure and deploy VMware-based virtualization solutions, including vSphere, ESXi and vCenter Perform and manage VMware upgrades, including vSphere, ESXi and vCenter version upgrades Perform capacityplanning and tuning to ensure optimal performance Build out VMware environments in new data centers, including configuration, optimization, and validation Execute VMWare hardware refresh and data center migration … projects, including planning, execution, and validation Automate provisioning and management of virtual infrastructure utilizing PowerCLI and Infrastructure as Code (IaC) tools such as Packer, Terraform, and Ansible Secure the virtual environment by implementing best practices, including patch management, vulnerability remediation, and access control Develop and implement disaster recovery and business continuity plans using tools such as Zerto and VMware More ❯
to obtain the job specification and client details. Core Responsibilities Lead and modernize their VMware, Kubernetes, and storage platforms Manage data infrastructure, including backup systems and data analytics Drive capacityplanning and technical specifications Collaborate with project managers and subject matter experts Spearhead DevOps initiatives and automation strategies Required Skills Primary Expertise: VMware and virtualization technologies, Enterprise datacenter More ❯
and maintain Retail WAN networks using MPLS technology. Work closely with Security Operations (SecOps) team to ensure vulnerability management is maintained. Provide technical leadership in network-related projects, including capacityplanning, upgrades, and security enhancements. Monitor network performance, troubleshoot issues, and implement proactive solutions to minimize downtime. Collaborate with cross-functional teams and third-party vendors to support More ❯
current VMware, storage platforms, cloud, and associated underlying infrastructure hardware. Backup is also your domain including data management - the availability, structure, and analysis of data. You will also do capacityplanning as well as technical & functional specification writing whilst working alongside project managers and other SME's. Alongside the above, we are looking for an engineer who also More ❯
with the goal of automating response to all non-exceptional service conditions. Influence and create new designs, architectures, standards, and methods for large-scale distributed systems. Engage in service capacityplanning, service integration and geo-expansion, software performance analysis and system tuning. Candidate must be solutions-oriented oriented using rigorous logic and methods to solve difficult problems with More ❯
with the goal of automating response to all non-exceptional service conditions. Influence and create new designs, architectures, standards, and methods for large-scale distributed systems. Engage in service capacityplanning, service integration and geo-expansion, software performance analysis and system tuning. Candidate must be solutions-oriented using rigorous logic and methods to solve difficult problems with effective More ❯
protected and only accessible by the engineer with the required skillset. Identifying and mitigating network vulnerabilities. Ensure security patches/firmware are tested and applied to maintain system security. CapacityPlanning and Optimisation - Assessing network capacity and planning for future growth. Optimising network performance by analysing traffic patterns and making all necessary adjustments. Implementing network load More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
In Technology Group Limited
service availability Act on infrastructure alerts and monitoring tools to resolve issues efficiently Deliver enhancements to IT services via BAU, project workstreams, and internal initiatives Maintain and forecast infrastructure capacity and performance Perform regular housekeeping, patching, and system upgrades Core Technical Requirements (Essential): Strong experience supporting and maintaining Red Hat Linux (RHEL) environments Proven ability to perform in-place … Amazon Linux Imaging (AMI) Exposure to Windows Server environments (2016, 2019, 2022), including Active Directory, Group Policy, DNS, DHCP Experience managing VMware datastores and LUNs , including performance tuning and capacityplanning Knowledge of vRealize Operations, Log Insight, and Network Insight tools Interested? If you're a Senior Infrastructure Engineer with hands-on Red Hat and VMware experience - and More ❯
data from in-house applications Help with the onboarding of new products for Portfolio managers Provide level three support for OpenLink and processes developed by the group Participate in capacityplanning and performance/throughput analysis Consuming and publishing transaction data in AVRO over Kafka Automation of system maintenance tasks, end-of-day processing jobs, data integrity checks … and bulk data loads/extracts Release planning and deployment Build strong relationships with support and end-users/clients Focus on client service and delivery Required Skill/Experience Extensive knowledge and experience implementing Endur for European Power & Gas are essential. Bachelor's degree in Computer Science, Electrical Engineering or equivalent is essential Experience of working in a More ❯
IT systems and infrastructures are reliable, scalable, and secure. Key Responsibilities Leadership Environment Management: Deployment & Automation: Performance & Scalability: Security & Compliance: Collaboration & Stakeholder Management: Documentation & Reporting: Incident Management & Problem Resolution: CapacityPlanning: Escalate issues as appropriate. Manage assigned risks and issues. Adhere to change, project, and analysis standards Skills, Knowledge & Abilities Experience: At least 5-7 years of experience More ❯
Network, Server, and Storage systems using group standard tools Responding to notifications/alerts for failed hardware/software, and assign to appropriate Infrastructure Engineer as required Assisting in capacityplanning and monitoring of storage and systems at all times Develop and maintain communication, dependency, and reliance plans for system reboots and outages Escalate and own calls sent … to Infrastructure line support or outside vendors, returning the result and relaying resolution back to the Service Centre where needed Develop and implement disaster recovery plans Contribute to strategic planning and technology roadmap development Ideal Candidate Minimum of 8 years' experience in IT Minimum of 2 years' management experience Customer service-driven, understanding the importance of the person and More ❯
software. Provision and support Remote Site Networks (e.g., LAN, WAN connection) and related operations (e.g., procure, design, build, systems monitoring, incident diagnostics, troubleshooting, resolution and escalation, security management, and capacityplanning/analysis) Provide Break/Fix Level 2 support for in-scope end-user hardware and software as coordinated through the Service Desk. Manage and maintain inventory More ❯
to meet the service levels of the products using the platform. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacityplanning, and launch reviews. Maintain services once they are live by measuring and monitoring availability, latency, and overall system health. Scale systems sustainably through mechanisms like automation and More ❯
Cost Ops and Fin Ops Automating through use of DevOps capability Conducting systems risk management assessments Performance audits Designing performance test plans to address performance risk Demand forecasting/planning Development of capacity and performance models Proactive system monitoring to identify performance risks A day in the life of a Capacitas Senior Consultant includes: Leading small teams on … complex projects Managing, planning and executing delivery of complex projects Leading or participating in daily stand-up meetings with your project team Chairing team meetings Assisting, mentoring and line managing junior staff, following our people management processes Creating and executing thorough plans including budgeting, resourcing and risk management, taking into account dependencies and assumptions. Exercising your strong technical knowledge … detail Desirable Performance testing skills using HP, LoadRunner or Jmeter Performance test design Performance test automation skills Monitoring, including basic and complex metric understanding, and familiarity with monitoring agents Capacityplanning and modelling skills Experience delivering cloud cost optimisation, identifying opportunities for reduced spend Financial modelling Proven experience of financial numeracy, e.g. calculating savings, working out percentage growth More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Capacitas
Cost Ops and Fin Ops Automating through use of DevOps capability Conducting systems risk management assessments Performance audits Designing performance test plans to address performance risk Demand forecasting/planning Development of capacity and performance models Proactive system monitoring to identify performance risks A day in the life of a Capacitas Senior Consultant includes: Leading small teams on … complex projects Managing, planning and executing delivery of complex projects Leading or participating in daily stand-up meetings with your project team Chairing team meetings Assisting, mentoring and line managing junior staff, following our people management processes Creating and executing thorough plans including budgeting, resourcing and risk management, taking into account dependencies and assumptions. Exercising your strong technical knowledge … detail Desirable Performance testing skills using HP, LoadRunner or Jmeter Performance test design Performance test automation skills Monitoring, including basic and complex metric understanding, and familiarity with monitoring agents Capacityplanning and modelling skills Experience delivering cloud cost optimisation, identifying opportunities for reduced spend Financial modelling Proven experience of financial numeracy, e.g. calculating savings, working out percentage growth More ❯