product delivery, ensuring initiatives meet customer needs, regulatory requirements, and commercial goals. Translate high-level business objectives into actionable product OKRs and measurable success metrics. Manage backlog prioritisation, sprint planning, and capacityplanning across product squads. Identify and mitigate delivery risks, ensuring clear communication and coordination across functions. Customer & Platform Focus Ensure the organisation's digital platform More ❯
Havant, Hampshire, England, United Kingdom Hybrid / WFH Options
Hirexa Solutions UK
Required to work in Datacenter in Havant and will also be required to travel to other remote location around UK if required. Job Description: Plan, coordinate, and forecast datacenter capacity (power, cooling, space, network) across one or more facilities. Collaborate with IT and facilities teams to define and manage hardware deployment roadmaps. Develop and maintain datacenter layout designs, rack … datacenter inventory lifecycle, including tracking of assets from procurement to decommission. Ensure datacenter plans comply with safety, security, and regulatory standards. Create documentation and reports for management regarding datacenter capacity, asset tracking, and planning initiatives. Support project management for infrastructure expansions, migrations, and hardware refreshes. Utilize DCIM (Datacenter Infrastructure Management) tools to maintain accurate records and visualizations. Evaluate … efficiency and Maintain and support enterprise Wintel environments (Windows Server 2016/2019/2022) and RHEL infrastructure. Monitor and manage storage infrastructure (SAN/NAS), including provisioning, tiering, capacityplanning, and performance optimization. Support backup, recovery, and high-availability strategies across platforms. More ❯
Shefford, Bedfordshire, South East, United Kingdom
Infoplus Technologies UK Ltd
support for critical application databases Technical Skills (Core & Top-Up) ?Strong database administration ?Database upgrades on multiple versions and releases ?High availability (RAC) setup and administration ?Disaster recovery design, planning, setup and administer the Dataguard (standby) ?Install, configure, maintain and troubleshooting on Golden Gate ?Knowledge on ExaData ?OEM Configuration and management ?Automation through scripting Strong performance analysis on AWR … views Experience on OS, Storage, Network and Virtualization activities Certification on Primary Technology Handled database estate of various technologies Process & Tools ?Service Request and Incident, Problem, Change managements ?Configuration, Capacityplanning and Knowledge management ?Experience on ITIL Tools like Remedy, Service Now or any other ticketing tool ?Identify the service improvement plan ?ITIL Foundation certified Behavioral Skill ?Strong More ❯
Reading, Berkshire, United Kingdom Hybrid / WFH Options
CBSbutler Holdings Limited trading as CBSbutler
Finesse. Manage integrations with third-party systems (WFM, CRM, CTI, SIP gateways, etc.). Monitor and troubleshoot voice infrastructure, ensuring high availability and performance. Perform regular upgrades, patching, and capacity planning. Support end-to-end call flows, routing scripts, and reporting solutions. Create and maintain technical documentation, diagrams, and knowledge base articles. Collaborate with business and IT stakeholders to More ❯
Reading, Oxfordshire, United Kingdom Hybrid / WFH Options
CBSbutler Holdings Limited trading as CBSbutler
Finesse. Manage integrations with third-party systems (WFM, CRM, CTI, SIP gateways, etc.). Monitor and troubleshoot voice infrastructure, ensuring high availability and performance. Perform regular upgrades, patching, and capacity planning. Support end-to-end call flows, routing scripts, and reporting solutions. Create and maintain technical documentation, diagrams, and knowledge base articles. Collaborate with business and IT stakeholders to More ❯
support low latency connectivity and monitoring solutions for the Global Markets business, aligned with front-office trading and regulatory needs. Apply SRE principles to improve availability, latency, performance, and capacityplanning across trading infrastructure. Collaborate with network and platform engineers to design reliable, self-healing systems and reduce manual intervention through automation. Own and execute the Global Markets … project delivery through to operational handoff and lifecycle management. Partner with business stakeholders and platform owners to ensure infrastructure and observability tooling meets evolving trading requirements. Monitor and manage capacity and performance of global connectivity systems, working with regional teams to aggregate local intelligence. Conduct deep-dive post-incident analysis and forensic reviews during high-impact market events (e.g. More ❯
Proven ability to manage demand, plan resources, and develop budgets for digital initiatives Strong track record in developing SoWs, technical proposals, and project scoping documentation Experience leading pre-project planning through to delivery execution in fast-paced environments Ability to manage multiple projects/products simultaneously with strategic alignment and value-based outcomes Comfortable communicating across senior stakeholders, internal … modern delivery frameworks Strong written and verbal communication skills Key Responsibilities: Oversee the delivery of a portfolio of digital services and products aligned to business goals Drive demand and capacityplanning across technology teams Lead the creation of technical proposals, SoWs, and planning documentation Collaborate with business, UX, and engineering teams to ensure user-focused outcomes Track More ❯
Managing the Nvidia GPU cluster.? High availability and resilience: Implement failover strategies and manage maintenance events to minimize downtime. Resource allocation and optimization: Resource partitioning (GPU resources), workload scheduling, capacityplanning Performance monitoring and troubleshooting: Performance analysis, monitoring ( Realtime) with available Nvidia and HPE tools? Incident response: node failure management, network issues, driver issues, troubleshooting common issues and More ❯
and ad-hoc meetings across multiple stakeholders and time zones. Capture and distribute concise meeting minutes, highlighting key decisions and action items. Track resource allocation and utilisation to support capacity planning. Provide administrative support, including resource onboarding/offboarding and maintaining programme documentation. Complete assigned tasks within agreed timeframes with minimal supervision. Contribute to the continuous improvement of programme More ❯
Gloucester, Gloucestershire, South West, United Kingdom
Data Careers
support services and customer satisfaction. The role oversees incident and request handling, drives service improvements, and maintains documentation, policies, and procedures. The manager will also support service transition, financial planning, and liaise with internal teams and third-party suppliers. Key Responsibilities Lead, support, mentor, and develop the Service Desk team to deliver effective customer support. Oversee daily operations, manage … patterns and trends to prevent recurrence. Manage service catalogues and ensure all supported services are documented and current. Monitor performance metrics and recommend improvements to enhance service efficiency. Conduct capacityplanning, forecasting, and system performance analysis. Provide coaching, performance feedback, and on-the-job training to staff. Facilitate communication across stakeholders and act as a single point of More ❯
Managing the Nvidia GPU cluster High availability and resilience: Implement failover strategies and manage maintenance events to minimise downtime Resource allocation and optimisation: Resource partitioning (GPU resources), workload scheduling, capacityplanning Performance monitoring and troubleshooting: Performance analysis, monitoring (realtime) with available Nvidia and HPE tools Incident response: node failure management, network issues, driver issues, troubleshooting common issues and More ❯
in post: Lead the design, implementation, and support of core data networking and wireless infrastructure , ensuring systems meet user needs and align with the University’s IT strategy. Drive capacityplanning and technical development for network services, improving processes, tools, and documentation to enhance support and service delivery. Develop and test network solutions , focusing on data, voice, and … wireless technologies; contribute to strategic planning and service improvements based on feedback and research. Coordinate with external vendors to ensure effective delivery and support of third-party network and telecom services. Lead project work using the Information Services framework , managing your own tasks and supporting the team to meet deliverables and customer expectations. Provide training and documentation for Information More ❯
Newbury, Berkshire, South East, United Kingdom Hybrid / WFH Options
MECS Communications Ltd
in next-generation network technologies to build the future of connectivity across the nation. Responsibilities: Help implement, & optimise transmission networks (SDH, DWDM, OTN, IP/MPLS) to support high-capacity voice & data services Plan & deliver transport solutions for access, aggregation, & core layers, ensuring scalability, redundancy, & low latency Perform network integration, acceptance testing, & troubleshooting across multi-vendor transmission & optical platforms … Monitor, analyse, & optimise network performance using NMS/OSS tools to maintain SLA & QoS targets Support capacityplanning & upgrades, including spectrum management, wavelength allocation, & bandwidth forecasting Manage fault resolution & escalation for transmission/transport issues to minimise downtime & improve reliability. Collaborate with IP, RAN, & Core teams to ensure seamless end-to-end service delivery Develop & maintain technical documentation … or academic experience is likely to include some of the following: Transmission Engineer or Transport Network Engineer SDH, DWDM, OTN, MPLS, IP Transport Optical Transport or Transmission Networks Network Planning, Optimisation, Troubleshooting Optical DWDM & Photonic systems Ciena OME6500 equipment knowledge & basic configuration skills Ciena One Control or MCP network management platform experience Nokia 7750 port configuration (comprehensive training given More ❯
support low latency connectivity and monitoring solutions for the Global Markets business, aligned with front-office trading and regulatory needs. Apply SRE principles to improve availability, latency, performance, and capacityplanning across trading infrastructure. Collaborate with network and platform engineers to design reliable, self-healing systems and reduce manual intervention through automation. Own and execute the Global Markets … project delivery through to operational handoff and lifecycle management. Partner with business stakeholders and platform owners to ensure infrastructure and observability tooling meets evolving trading requirements. Monitor and manage capacity and performance of global connectivity systems, working with regional teams to aggregate local intelligence. Develop and maintain automated alerting, health checks, and dashboards, supporting proactive detection of issues and More ❯
part of our team, you will contribute towards the future strategy and technology direction of our products, implement and maintain high quality code to best practices, take part in capacityplanning and design activities, and work with colleagues within FI & Derivatives team and beyond to help deliver best-in-Class systems. Qualifications: Undergraduate Degree Strong Java Developer needs More ❯
infrastructure using Infrastructure as Code tools. Define, implement, and maintain observability solutions to ensure proactive system monitoring and issue diagnosis. Diagnose and resolve production issues, including performance tuning and capacity planning. Essential Skills Proficiency in reliability engineering and Python. Good understanding of observability, including inputting probes to detect production issues. Experiencewith Infrastructure as Code in cloud environments such as More ❯
Ensure high availability, scalability, and fault tolerance of Airflow clusters. Implement monitoring, alerting, and logging for Airflow and related components. Perform upgrades and patches for platform-related components. Oversee capacityplanning, resource allocation, and optimization of Airflow workers. Maintain and configure integrations with source control systems (e.g., GitHub, GitLab) for version control. Collaborate with cloud providers (e.g., AWS More ❯
Ensure high availability, scalability, and fault tolerance of Airflow clusters. Implement monitoring, alerting, and logging for Airflow and related components. Perform upgrades and patches for platform-related components. Oversee capacityplanning, resource allocation, and optimization of Airflow workers. Maintain and configure integrations with source control systems (e.g., GitHub, GitLab) for version control. Collaborate with cloud providers (e.g., AWS More ❯
teams on integrations and performance improvements Maintain SLAs and ensure customer satisfaction Technical Focus Manage Nvidia GPU clusters and related infrastructure Implement failover, resilience, and resource optimization strategies Oversee capacityplanning and workload scheduling Monitor performance using Nvidia and HPE tools Manage incident response, node failures, and access/security controls Required Skills & Experience Strong understanding of L1 More ❯
teams on integrations and performance improvements Maintain SLAs and ensure customer satisfaction Technical Focus Manage Nvidia GPU clusters and related infrastructure Implement failover, resilience, and resource optimization strategies Oversee capacityplanning and workload scheduling Monitor performance using Nvidia and HPE tools Manage incident response, node failures, and access/security controls Required Skills & Experience Strong understanding of L1 More ❯