Prometheus Jobs in the UK

76 to 100 of 1,246 Prometheus Jobs in the UK

Lead DevOps Engineer – SC Cleared or SC Eligible

London, England, United Kingdom
Hybrid / WFH Options
Whitehall Resources Ltd
least privilege IAM policies, role-based access controls (RBAC), automated compliance checks, and zero-trust security principles. • Monitoring, Logging & Alerting: Expertise in building centralized logging solutions, integrating ELK Stack, Prometheus, Grafana, Splunk, and AWS-native security monitoring tools such as CloudWatch, Security Hub, SIEM integrations. • CI/CD Security & Automation: Proficient in Jenkins, Git, GitHub Actions, ensuring secure CI/ More ❯
Posted:

Site Reliability Engineer, Lead

London, United Kingdom
Hybrid / WFH Options
Mistral AI
call rotations ) • Experience working against reliability KPIs (observability, alerting, SLAs) • Hands-on experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes ), monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog ), infrastructure-as-code tools (Terraform, CloudFormation ) • Proficiency in scripting languages (Python, Go, Bash ) and knowledge of software development best practices • Understanding of networking, security, and system More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Junior Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
activities Awareness of any cloud infrastructure principles (like AWS, GCP or OCI), understanding basic principles of secure software delivery is a plus Familiar with Observability tools like Grafana or Prometheus, understanding the importance of giving the correct visibility to our platforms and environments We highly value ownership and initiative with capabilities to drive projects independently with an organized and mindful More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Technical Architect / Data DevOps Engineer

Bristol, United Kingdom
Hewlett Packard Enterprise Development LP
such as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Risk Engineering Manager

London, England, United Kingdom
Man Group
long running services and analytics in C#. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, ELK for logs, Grafana, Prometheus & InfluxDb for metrics, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for architecture automation, and Slack for internal communication. We heavily utilise ArcticDB () our in More ❯
Posted:

Manual Tester (DV Security Clearance)

Basingstoke, Hampshire, South East
CGI
collaborate with DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects More ❯
Employment Type: Permanent
Posted:

Quant Engineer - Man Group plc

London, England, United Kingdom
Jobs via eFinancialCareers
long running services and analytics in C#. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, ELK for logs, Grafana, Prometheus & InfluxDb for metrics, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for architecture automation, and Slack for internal communication. We heavily utilise ArcticDB ( https:/ More ❯
Posted:

SAP Sovereign Cloud DevOps Engineer Azure

London, England, United Kingdom
Hybrid / WFH Options
SAP
We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building More ❯
Posted:

SAP Sovereign Cloud DevOps Engineer Azure

London, United Kingdom
SAP SE
At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal DevOps Engineer

Milton Keynes, Buckinghamshire, United Kingdom
Workforce Software
About Us WorkForce Software, an ADP Company, is the first global provider of workforce management solutions with integrated employee experience capabilities. The company's WorkForce Suite adapts to each organization's needs-no matter how unique their pay rules, labor More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Solace Messaging Administrator

City of London, London, United Kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Solace Messaging Administrator

London Area, United Kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Solace Messaging Administrator

london, south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Solace Messaging Administrator

london (city of london), south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Solace Messaging Administrator

slough, south east england, united kingdom
BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
Posted:

Mid/Senior DevOps Engineer

London, England, United Kingdom
Intelmatix
enforce security best practices across cloud and network environments. Troubleshoot deployment and performance issues across multiple environments. Set up and maintain observability tools for logging, monitoring, and alerting (e.g., Prometheus, Grafana, Loki). Contribute to internal tooling to streamline development, testing, and operations workflows. Stay current with DevOps trends and recommend improvements to tools and processes. Required Qualifications: Bachelor's … agile startup environments. Exposure to multi-cloud or hybrid cloud architectures. Tech Stack: Cloud: AWS, OCI ZTN: Cloudflare Application: Kong (API Gateway), Java Spring Boot, Python, Go, TypeScript Monitoring: Prometheus Stack (Prometheus, Grafana, Loki) Compute: ECS, EC2, Lambda Frontend: S3, CloudFront Data: Glue, S3, PostgreSQL CI/CD: GitHub Actions IaC: Terraform, AWS SAM Why Join Us? At Intelmatix, you More ❯
Posted:

DevOps GCP Engineer

london, south east england, united kingdom
HCLTech
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
Posted:

DevOps GCP Engineer

london (city of london), south east england, united kingdom
HCLTech
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
Posted:

Cloud DevOps Engineer

London, England, United Kingdom
Hybrid / WFH Options
Nivoda
Develop and deploy serverless applications using AWS Lambda and related services to enable cost-efficient and highly responsive systems. Monitoring & Incident Response : Set up proactive monitoring using AWS CloudWatch, Prometheus, or Grafana. Troubleshoot and resolve infrastructure or application issues promptly to ensure high availability. Security & Compliance : Enforce AWS security best practices, including IAM policies, VPC configurations, and security group management … tools like ArgoCD and application packaging with Helm . Strong scripting abilities (e.g., Python , Bash ) to automate workflows. Familiarity with monitoring and logging tools (e.g., AWS CloudWatch, ELK stack, Prometheus). Solid understanding of IAM , networking (VPCs, subnets, routing), and security best practices. Preferred : Experience with serverless architectures on AWS (e.g., AWS Lambda, API Gateway, DynamoDB). Knowledge of cloud More ❯
Posted:

Infrastructure Engineer

London, England, United Kingdom
Hybrid / WFH Options
Keyrock
high availability and security. Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring. Observability & Monitoring: Develop monitoring solutions with tools like Prometheus, Grafana, ELK stack to enhance system reliability. Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance standards (SOC2, ISO 27001). Incident Response & Performance Optimization: Troubleshoot issues … Kubernetes experience (EKS, K3s, or self-managed). Proficiency in scripting with Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, Ansible). Familiarity with observability tools (Prometheus, Grafana, Datadog, ELK). Solid understanding of networking (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps, CI/CD, and GitOps practices. Experience with high-performance, low-latency systems. More ❯
Posted:

DevOps Engineer

Manchester, England, United Kingdom
Lorien
experience of building and maintaining CI/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. IND_PC1 Carbon60, Lorien More ❯
Posted:

DevOps Engineer

bolton, greater manchester, north west england, united kingdom
Lorien
experience of building and maintaining CI/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. IND_PC1 Carbon60, Lorien More ❯
Posted:

DevOps Engineer

warrington, cheshire, north west england, united kingdom
Lorien
experience of building and maintaining CI/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. IND_PC1 Carbon60, Lorien More ❯
Posted:

DevOps Engineer

Portsmouth, England, United Kingdom
Hybrid / WFH Options
Trust In SODA
development life cycle. Infrastructure-as-code Bash Delivery methods and techniques, including agile scrum experience. Desirable Skills: RedHat OpenShift Hashicorp (such as Terraform, Packer, Vault) Ansible Observability (such as Prometheus, Grafana, Splunk) Containerised services (such as Postgres, Redis, Kafka, Keycloak, Elk) Experience of doing all the above at OS or S level YAML based pipelines. Immutable infrastructure Experience with MOD More ❯
Posted:

DevOps Engineer

Portsmouth, yorkshire and the humber, united kingdom
Hybrid / WFH Options
Trust In SODA
development life cycle. Infrastructure-as-code Bash Delivery methods and techniques, including agile scrum experience. Desirable Skills: RedHat OpenShift Hashicorp (such as Terraform, Packer, Vault) Ansible Observability (such as Prometheus, Grafana, Splunk) Containerised services (such as Postgres, Redis, Kafka, Keycloak, Elk) Experience of doing all the above at OS or S level YAML based pipelines. Immutable infrastructure Experience with MOD More ❯
Posted:
Prometheus
10th Percentile
£57,500
25th Percentile
£63,750
Median
£72,500
75th Percentile
£91,000
90th Percentile
£116,750