Manchester, North West, United Kingdom Hybrid / WFH Options
Hays
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Nigel Wright Group
stakeholders, end0users and technologists ITIL (or similar) certification (or experience working within an ITIL framework) Strong understanding of application design, rational databases (SQL Server), monitoring and alerting tools (Grafana, Prometheus, Victoria Metrics), scheduling tools (Control-M), operating systems (Windows/Linux), Kubernetes, cloud platforms (Azure), issue tracking and source control (JIRA, Git, Bitbucket). Interview Process: Coding Challenge – We would More ❯
writing code that is intuitive, clear, and easy to test Developing observability for new and existing ML applications and GenAI/LLM integrations, making use of the Grafana Stack (Prometheus, Loki, Tempo) Working closely with Data Scientists and ML Engineers throughout the lifecycle of productionising their models Being responsive to incidents regarding ML applications - including an understanding of how to More ❯
team. Some things that may be a bonus (but learning these on the job will be fine): Experience in some of the other key technologies we use (Linux, Docker, Prometheus, Express, MongoDB, Redis, PostgreSQL). Experience integrating large scale pub-sub systems such as Google Cloud Pub/Sub. We expect you to: Work with us full time. Have strong More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
warrington, cheshire, north west england, united kingdom Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
bolton, greater manchester, north west england, united kingdom Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
selection. Collaboration: Work closely with development, testing, and business teams to ensure architectures are implementable, secure, and aligned with business needs. Monitoring & Observability: Integrate observability tools such as CloudWatch, Prometheus, and Datadog, defining KPIs and dashboards for operational excellence. Your Profile Essential Skills & Experience: AWS Certified Solutions Architect (Associate/Professional). Strong expertise in the AWS Well-Architected Framework. More ❯
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
london, south east england, united kingdom Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
london (city of london), south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
london (city of london), south east england, united kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Wokingham, Berkshire, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
and smart architecture are key. What You’ll Be Working With: MySQL , Vitess , and Linux in production (Dont worry if you haven't worked with Vitess) Monitoring tools like Prometheus and Grafana Shard allocation, replication tuning, disk performance Backup, restore, and DR testing Data migrations and custom table loads for NHS tenants Zero-downtime patching and performance baselining What You More ❯