126 to 150 of 168 Prometheus Jobs in the UK

Infrastructure Engineers (DV Security Clearance)

Hiring Organisation
CGI
Location
Gloucestershire, United Kingdom
Employment Type
Full Time
best practices. •Troubleshoot & Resolve: Analyse and solve complex infrastructure issues with a problem-solving mindset. •Monitor & Optimise: Use monitoring and observability tools such as Prometheus, Grafana, and ELK. •Collaborate & Deliver: Work across diverse stakeholders and agile teams to drive continuous improvement. Required qualifications to be successful in this role … Argo CD •Scripting (Python, Shell) •Container orchestration tools (e.g., Docker Swarm, Apache Mesos) •Service Mesh technologies (e.g., Istio, Linkerd) •Monitoring and observability tools (Prometheus, Grafana, ELK) •Applying infrastructure security, compliance, and governance practices •Working within agile teams and using version control tools such as Git #LI-JS2 Together, as owners ...

Senior Software Engineer in Glasgow - Spire

Hiring Organisation
Jobleads-UK
Location
Glasgow, Scotland, United Kingdom
Proficiency using and developing containers for development and production Experience with Typescript and React Experience implementing monitoring and alerting system using systems like Grafana, Prometheus, or Nagios Experience with Infrastructure as Code tools such as Terraform and Ansible Familiarity with Python data visualization libraries Spire operates a... #J-18808-Ljbffr ...

Senior Backend Engineer (Python | AI | 3D Environments | £130,000)

Hiring Organisation
Paradigm Talent
Location
City of London, Greater London, UK
Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform ...

Technical Lead - Platform Engineering - Linux Heavy - to £85000+ (ID49370)

Hiring Organisation
Humand Talent
Location
United Kingdom
based platform environments Kubernetes and containerised workloads Secure over-the-air update approaches Automation, image build tooling and deployment pipelines Observability tools such as Prometheus, Grafana, Loki and OpenTelemetry Platform reliability, resilience and security Cloud-native engineering practices Go, scripting and modern development tooling Distributed systems operating across multiple customer ...

Solutions Architect

Hiring Organisation
Queen Square Recruitment
Location
Wokingham, England, United Kingdom
Rancher experience is advantageous. Knowledge of CI/CD pipelines and GitOps methodologies. Monitoring & Emerging Technologies Experience with monitoring and observability tools such as: Prometheus Grafana Syslog Awareness of emerging technologies including: IoT AI Edge Computing Knowledge or experience with Powertech DSA Tools is highly desirable. What We're Looking ...

Java Developer

Hiring Organisation
Global
Location
Greater London, United Kingdom
Employment Type
Full Time
components. Operations , AI & Continuous Improvement (20%): Contribute to CI/CD pipelines, run services in Kubernetes (EKS on AWS), and support monitoring and alerting (Prometheus/Grafana) . He lp maintain a stable production environment. Use AI coding tools to accelerate delivery while maintaining rigorous code quality through reviews ...

Site Reliability Engineer, iCloud

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
platforms with Splunk, Grafana, Prometheus. Demonstrable fluency in at least one of the following languages: Java, Python, or Go. Experience with Kubernetes, Nginx, Envoy, Prometheus, and/or Docker. Preferred Qualifications Understanding of standard networking protocols and components such as: HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £105,000 per annum
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Senior Site Reliability Engineer

Hiring Organisation
17918
Location
United Kingdom
findings and share learnings to prevent recurrence. Implement preventive measures and continuous improvement processes. Observability Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch. Build real-time dashboards to visualize system health and reliability metrics. Configure intelligent alerting based on anomaly detection and thresholds. ...

Senior Site Reliability Engineer

Hiring Organisation
Experian Ltd
Location
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Employment Type
Permanent, Work From Home
findings and share learnings to prevent recurrence. Implement preventive measures and continuous improvement processes. Observability Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch. Build real-time dashboards to visualize system health and reliability metrics. Configure intelligent alerting based on anomaly detection and thresholds. ...

SaaS Monitoring Engineer

Hiring Organisation
eTeam
Location
London Area, United Kingdom
latency, error rates, and resource utilization across distributed systems. Continuously improve observability through logs, metrics, and traces using modern monitoring tools such as Datadog, Prometheus, Grafana, Azure Monitor, or similar platforms. Console Dashboard Development: Design and create a centralized console dashboard that provides a real-time overview of the health … understanding of distributed systems, microservices architecture, and cloud platforms (AWS, Azure, or GCP). Hands-on experience with monitoring and visualization tools (e.g., Grafana, Prometheus, ELK stack, Splunk, Datadog). Experience building interactive dashboards and console-based monitoring systems. Proficiency in scripting or programming languages such as Python, Bash ...

Infrastructure Automation Engineer

Hiring Organisation
Searchability NS&D
Location
Glasgow, Lanarkshire, Scotland, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £55,000 per annum
post-change validation, enhance observability, and support the continuous improvement of automation capabilities. Technology Stack Linux/UNIX Python Ansible Apache Airflow Prometheus Grafana Loki VMware F5 What We're Looking For We're looking for engineers with: Strong Linux/UNIX administration fundamentals Experience building infrastructure automation using Python ...

Senior Network Engineer, Cingularity

Hiring Organisation
IMG
Location
London Area, United Kingdom
customers" are our internal TOC and Event Engineering teams. Observability Design: Utilise and assist in the development of modern monitoring and logging systems (e.g., Prometheus, Grafana, ELK/OpenSearch) and the Netbox source of truth. Automation Development: Implement automation strategies for infrastructure management using tools such as Ansible, Python ...

DevOps Engineer

Hiring Organisation
Harvey Nash
Location
Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £500 per day
event-driven and horizontal scaling Design and support workflow orchestration capabilities Deploy and manage AI and GPU-based workloads Implement monitoring and observability using Prometheus, Grafana, Azure Monitor and Application Insights Design secure cloud networking (private endpoints, DNS, hub-and-spoke architecture) Manage secrets, identity, and certificates using modern authentication … similar workflows Strong knowledge of containerisation and application lifecycle management Experience with service mesh, distributed systems, and event-driven architectures Observability tooling experience (Prometheus, Grafana, Azure Monitor, App Insights) Strong knowledge of cloud networking and secure architecture design Identity and access management experience (OIDC, federation) Experience managing production environments, incident ...

Senior Database Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
United Kingdom
organization Own and continuously improve our Datadog database observability by building actionable dashboards, alerts, and service-level views using an observability stack (e.g., Prometheus, Grafana, New Relic, or equivalent). Familiarity with PGAnalyze or Percona a plus. Automate system maintenance tasks using Bash, Powershell, Python, or Ansible . Manage infrastructure … containerization solutions (Azure & Kubernetes preferred) Proficiency with operating PostgreSQL in a Linux environment is a plus Expertise with an observability/monitoring platform (e.g., Prometheus/Grafana, New Relic, Datadog, or equivalent); Datadog experience is a plus. Experience working in Agile/DevOps environments and operating production services with ITSM ...

Mid/Senior Backend Engineer (Java)

Hiring Organisation
Revolut
Location
South West London, London, United Kingdom
Employment Type
Permanent
matters: clean, maintainable code, shipped fast with TDD, DDD, and continuous integration and delivery. Our stack includes Java 17/21, GCP, Kubernetes, Grafana, Prometheus, NewRelic, PostgreSQL, Redis, Spock, jOOQ, and Flyway. Up to shape what's next in finance? Let's get in touch. What youll be doing Building ...

Mid/Senior Backend Engineer (Java)

Hiring Organisation
We Love Alfa
Location
SW1A, City of Westminster, Greater London, United Kingdom
Employment Type
Permanent
matters: clean, maintainable code, shipped fast with TDD, DDD, and continuous integration and delivery. Our stack includes Java 17/21, GCP, Kubernetes, Grafana, Prometheus, NewRelic, PostgreSQL, Redis, Spock, jOOQ, and Flyway. Up to shape what's next in finance? Let's get in touch. What you’ll be doing ...

Backend Engineer

Hiring Organisation
Revolut
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
matters: clean, maintainable code, shipped fast with TDD, DDD, and continuous integration and delivery. Our stack includes Java 17/21, GCP, Kubernetes, Grafana, Prometheus, NewRelic, PostgreSQL, Redis, Spock, jOOQ, and Flyway. Up to shape what's next in finance? Let's get in touch. What youll be doing Building ...

Senior Engineering Manager, Developer Experience

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
track record of owning a technical domain end-to-end. You bring strong technical foundation across the DevEx stack: CI/CD, observability (Prometheus, Grafana, or equivalent), Kubernetes-based platforms - sufficient to make sound architectural decisions and earn engineer trust. You know how to lead through ambiguity and organisational change ...

Semantic Graph & Ontology Architect

Hiring Organisation
Adecco
Location
London, United Kingdom
Employment Type
Contract
graphs supporting workflows and audit trails. Exposure to vector retrieval and how graph context informs data re-ranking. Knowledge of observability tools like OpenTelemetry, Prometheus, and Grafana. Why Join Us? This is your opportunity to be at the forefront of data innovation in the energy sector! If you are eager ...

Senior Backend Engineer

Hiring Organisation
M-XR
Location
City of London, Greater London, UK
storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience ...

Devops

Hiring Organisation
Hirexa Solutions UK
Location
Manchester, Lancashire, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
Registry, Streams, ZooKeeper/KRaft DevOps/CI-CD: Jenkins, GitHub Actions, Azure DevOps Container Platform: Kubernetes, Helm, ArgoCD Cloud: AWS, Azure, GCP Monitoring: Prometheus, Grafana, Dynatrace, Confluent Control Centre Supporting: Terraform, Ansible" Skillsets required: 1. Core Kafka Expertise (Must-Have) • Kafka architecture: brokers, topics, partitions, replication • Kafka components: Kafka … Engineering (Must-Have) • Understanding of any one of the Cloud platforms: AWS/Azure/GCP 5. Monitoring, Observability & Reliability (Must-Have) • Monitoring tools: Prometheus, Grafana, Dynatrace, Confluent Control Centre • Alerting & SRE practices ...

Administrator

Hiring Organisation
Infinity Quest
Location
Manchester Area, United Kingdom
Registry, Streams, ZooKeeper/KRaft DevOps/CI-CD: Jenkins, GitHub Actions, Azure DevOps Container Platform: Kubernetes, Helm, ArgoCD Cloud: AWS, Azure, GCP Monitoring: Prometheus, Grafana, Dynatrace, Confluent Control Centre Supporting: Terraform, Ansible Core Kafka Expertise (Must-Have) • Kafka architecture: brokers, topics, partitions, replication • Kafka components: Kafka Connect, Schema Registry … Engineering (Must-Have) • Understanding of any one of the Cloud platforms: AWS/Azure/GCP 5. Monitoring, Observability & Reliability (Must-Have) • Monitoring tools: Prometheus, Grafana, Dynatrace, Confluent Control Centre • Alerting & SRE practices Scope of Services: Track400 within the Streaming CoE lab is focused on delivering and hardening enterprise Kafka ...

Site Reliability Engineer

Hiring Organisation
Digital Gurus
Location
United Kingdom
error budgets for critical data services and platform components. You will build and maintain observability dashboards and monitoring frameworks using tools such as Dynatrace, Prometheus and associated monitoring/logging/tracing platforms. You will implement end-to-end monitoring across metrics, logs and traces, helping the team detect issues … experience, ideally within production-scale environments. Hands-on experience with Kubernetes, ideally Amazon EKS. Experience with observability and monitoring tools such as Dynatrace, Prometheus, Grafana, CloudWatch, OpenTelemetry, ELK or similar. Understanding of SLIs, SLOs, error budgets and golden signals. Experience supporting incident management, root cause analysis and post-incident improvement ...