Prometheus Jobs in the UK

201 to 225 of 256 Prometheus Jobs in the UK

Cloud Services Engineer

Manchester, North West, United Kingdom
Hybrid / WFH Options
Hays
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Employment Type: Permanent, Work From Home
Salary: £65,000
Posted:

Cloud Services Engineer

Manchester, Lancashire, England, United Kingdom
Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Employment Type: Full-Time
Salary: £55,000 - £65,000 per annum
Posted:

Application Support Analyst

Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Nigel Wright Group
stakeholders, end0users and technologists ITIL (or similar) certification (or experience working within an ITIL framework) Strong understanding of application design, rational databases (SQL Server), monitoring and alerting tools (Grafana, Prometheus, Victoria Metrics), scheduling tools (Control-M), operating systems (Windows/Linux), Kubernetes, cloud platforms (Azure), issue tracking and source control (JIRA, Git, Bitbucket). Interview Process: Coding Challenge – We would More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:

Senior/Lead Python Engineer (MLOps)

London, United Kingdom
Hybrid / WFH Options
Mimica
writing code that is intuitive, clear, and easy to test Developing observability for new and existing ML applications and GenAI/LLM integrations, making use of the Grafana Stack (Prometheus, Loki, Tempo) Working closely with Data Scientists and ML Engineers throughout the lifecycle of productionising their models Being responsive to incidents regarding ML applications - including an understanding of how to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer - Overleaf

United Kingdom
Hybrid / WFH Options
Overleaf Enterprise
team. Some things that may be a bonus (but learning these on the job will be fine): Experience in some of the other key technologies we use (Linux, Docker, Prometheus, Express, MongoDB, Redis, PostgreSQL). Experience integrating large scale pub-sub systems such as Google Cloud Pub/Sub. We expect you to: Work with us full time. Have strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Infrastructure Monitoring Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
Posted:

Infrastructure Monitoring Engineer

warrington, cheshire, north west england, united kingdom
Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
Posted:

Infrastructure Monitoring Engineer

bolton, greater manchester, north west england, united kingdom
Hybrid / WFH Options
Eutopia Solutions
working with or supporting Microsoft technology components such as Windows Server OS, IIS, AD, DNS, SQL Server, and Exchange Experience with containerization technologies such as Docker, Kubernetes Experience with Prometheus may be beneficial Apply now for immediate consideration More ❯
Posted:

Digital Solution Architect

Redhill, Surrey, South East, United Kingdom
Stott & May Professional Search Limited
selection. Collaboration: Work closely with development, testing, and business teams to ensure architectures are implementable, secure, and aligned with business needs. Monitoring & Observability: Integrate observability tools such as CloudWatch, Prometheus, and Datadog, defining KPIs and dashboards for operational excellence. Your Profile Essential Skills & Experience: AWS Certified Solutions Architect (Associate/Professional). Strong expertise in the AWS Well-Architected Framework. More ❯
Employment Type: Contract
Posted:

Head of Broadcast Network Operations Centre

London Area, United Kingdom
Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
Posted:

Head of Broadcast Network Operations Centre

City of London, London, United Kingdom
Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
Posted:

Head of Broadcast Network Operations Centre

london, south east england, united kingdom
Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
Posted:

Head of Broadcast Network Operations Centre

slough, south east england, united kingdom
Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
Posted:

Head of Broadcast Network Operations Centre

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Christy Media Solutions
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications Essential: Highly experienced in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master Control Room More ❯
Posted:

Reliability Engineer

City of London, London, United Kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Reliability Engineer

London Area, United Kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Reliability Engineer

slough, south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Reliability Engineer

london, south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Reliability Engineer

london (city of london), south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Technical Product Manager (Observability)

London Area, United Kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Posted:

Technical Product Manager (Observability)

City of London, London, United Kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Posted:

Technical Product Manager (Observability)

slough, south east england, united kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Posted:

Technical Product Manager (Observability)

london, south east england, united kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Posted:

Technical Product Manager (Observability)

london (city of london), south east england, united kingdom
LinuxRecruit
and comfort with technical details are must-haves. You’ll be on the technical side too, having experience with containerised platforms using Kubernetes, databases, and observability tools such as Prometheus and OpenTelemetry. This is a chance to shape the future of observability and security, build products people count on, and do it all with curiosity and creativity. More ❯
Posted:

DBA SaaS Platform Supporting NHS £50k to £75k Bonus Hybrid

Wokingham, Berkshire, England, United Kingdom
Hybrid / WFH Options
Opus Recruitment Solutions Ltd
and smart architecture are key. What You’ll Be Working With: MySQL , Vitess , and Linux in production (Dont worry if you haven't worked with Vitess) Monitoring tools like Prometheus and Grafana Shard allocation, replication tuning, disk performance Backup, restore, and DR testing Data migrations and custom table loads for NHS tenants Zero-downtime patching and performance baselining What You More ❯
Employment Type: Full-Time
Salary: £50,000 - £75,000 per annum
Posted:
Prometheus
10th Percentile
£55,000
25th Percentile
£63,750
Median
£72,500
75th Percentile
£85,000
90th Percentile
£118,000