126 to 150 of 168 Prometheus Jobs in the UK

Infrastructure Engineers (DV Security Clearance)

Hiring Organisation: CGI
Location: Gloucestershire, United Kingdom
Employment Type: Full Time

best practices. •Troubleshoot & Resolve: Analyse and solve complex infrastructure issues with a problem-solving mindset. •Monitor & Optimise: Use monitoring and observability tools such as Prometheus, Grafana, and ELK. •Collaborate & Deliver: Work across diverse stakeholders and agile teams to drive continuous improvement. Required qualifications to be successful in this role … Argo CD •Scripting (Python, Shell) •Container orchestration tools (e.g., Docker Swarm, Apache Mesos) •Service Mesh technologies (e.g., Istio, Linkerd) •Monitoring and observability tools (Prometheus, Grafana, ELK) •Applying infrastructure security, compliance, and governance practices •Working within agile teams and using version control tools such as Git #LI-JS2 Together, as owners ...

Senior Software Engineer in Glasgow - Spire

Hiring Organisation: Jobleads-UK
Location: Glasgow, Scotland, United Kingdom

Proficiency using and developing containers for development and production Experience with Typescript and React Experience implementing monitoring and alerting system using systems like Grafana, Prometheus, or Nagios Experience with Infrastructure as Code tools such as Terraform and Ansible Familiarity with Python data visualization libraries Spire operates a... #J-18808-Ljbffr ...

Senior Backend Engineer (Python | AI | 3D Environments | £130,000)

Hiring Organisation: Paradigm Talent
Location: City of London, Greater London, UK

Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform ...

Technical Lead - Platform Engineering - Linux Heavy - to £85000+ (ID49370)

Hiring Organisation: Humand Talent
Location: United Kingdom

based platform environments Kubernetes and containerised workloads Secure over-the-air update approaches Automation, image build tooling and deployment pipelines Observability tools such as Prometheus, Grafana, Loki and OpenTelemetry Platform reliability, resilience and security Cloud-native engineering practices Go, scripting and modern development tooling Distributed systems operating across multiple customer ...

Solutions Architect

Hiring Organisation: Queen Square Recruitment
Location: Wokingham, England, United Kingdom

Rancher experience is advantageous. Knowledge of CI/CD pipelines and GitOps methodologies. Monitoring & Emerging Technologies Experience with monitoring and observability tools such as: Prometheus Grafana Syslog Awareness of emerging technologies including: IoT AI Edge Computing Knowledge or experience with Powertech DSA Tools is highly desirable. What We're Looking ...

Java Developer

Hiring Organisation: Global
Location: Greater London, United Kingdom
Employment Type: Full Time

components. Operations , AI & Continuous Improvement (20%): Contribute to CI/CD pipelines, run services in Kubernetes (EKS on AWS), and support monitoring and alerting (Prometheus/Grafana) . He lp maintain a stable production environment. Use AI coding tools to accelerate delivery while maintaining rigorous code quality through reviews ...

Site Reliability Engineer, iCloud

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

platforms with Splunk, Grafana, Prometheus. Demonstrable fluency in at least one of the following languages: Java, Python, or Go. Experience with Kubernetes, Nginx, Envoy, Prometheus, and/or Docker. Preferred Qualifications Understanding of standard networking protocols and components such as: HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model ...

Site Reliability Engineer - SRE

Hiring Organisation: Sanderson Recruitment
Location: City of London, London, United Kingdom
Employment Type: Permanent

root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Site Reliability Engineer - SRE

Hiring Organisation: Sanderson
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £105,000 per annum

Senior Site Reliability Engineer

Hiring Organisation: 17918
Location: United Kingdom

findings and share learnings to prevent recurrence. Implement preventive measures and continuous improvement processes. Observability Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch. Build real-time dashboards to visualize system health and reliability metrics. Configure intelligent alerting based on anomaly detection and thresholds. ...

Senior Site Reliability Engineer

Hiring Organisation: Experian Ltd
Location: Nottingham, Nottinghamshire, East Midlands, United Kingdom
Employment Type: Permanent, Work From Home

SaaS Monitoring Engineer

Hiring Organisation: eTeam
Location: London Area, United Kingdom

latency, error rates, and resource utilization across distributed systems. Continuously improve observability through logs, metrics, and traces using modern monitoring tools such as Datadog, Prometheus, Grafana, Azure Monitor, or similar platforms. Console Dashboard Development: Design and create a centralized console dashboard that provides a real-time overview of the health … understanding of distributed systems, microservices architecture, and cloud platforms (AWS, Azure, or GCP). Hands-on experience with monitoring and visualization tools (e.g., Grafana, Prometheus, ELK stack, Splunk, Datadog). Experience building interactive dashboards and console-based monitoring systems. Proficiency in scripting or programming languages such as Python, Bash ...

Infrastructure Automation Engineer

Hiring Organisation: Searchability NS&D
Location: Glasgow, Lanarkshire, Scotland, United Kingdom
Employment Type: Full-Time
Salary: £45,000 - £55,000 per annum

post-change validation, enhance observability, and support the continuous improvement of automation capabilities. Technology Stack Linux/UNIX Python Ansible Apache Airflow Prometheus Grafana Loki VMware F5 What We're Looking For We're looking for engineers with: Strong Linux/UNIX administration fundamentals Experience building infrastructure automation using Python ...

Senior Network Engineer, Cingularity

Hiring Organisation: IMG
Location: London Area, United Kingdom

customers" are our internal TOC and Event Engineering teams. Observability Design: Utilise and assist in the development of modern monitoring and logging systems (e.g., Prometheus, Grafana, ELK/OpenSearch) and the Netbox source of truth. Automation Development: Implement automation strategies for infrastructure management using tools such as Ansible, Python ...

DevOps Engineer

Hiring Organisation: Harvey Nash
Location: Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type: Contract
Contract Rate: £450 - £500 per day

event-driven and horizontal scaling Design and support workflow orchestration capabilities Deploy and manage AI and GPU-based workloads Implement monitoring and observability using Prometheus, Grafana, Azure Monitor and Application Insights Design secure cloud networking (private endpoints, DNS, hub-and-spoke architecture) Manage secrets, identity, and certificates using modern authentication … similar workflows Strong knowledge of containerisation and application lifecycle management Experience with service mesh, distributed systems, and event-driven architectures Observability tooling experience (Prometheus, Grafana, Azure Monitor, App Insights) Strong knowledge of cloud networking and secure architecture design Identity and access management experience (OIDC, federation) Experience managing production environments, incident ...

Senior Database Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: United Kingdom

organization Own and continuously improve our Datadog database observability by building actionable dashboards, alerts, and service-level views using an observability stack (e.g., Prometheus, Grafana, New Relic, or equivalent). Familiarity with PGAnalyze or Percona a plus. Automate system maintenance tasks using Bash, Powershell, Python, or Ansible . Manage infrastructure … containerization solutions (Azure & Kubernetes preferred) Proficiency with operating PostgreSQL in a Linux environment is a plus Expertise with an observability/monitoring platform (e.g., Prometheus/Grafana, New Relic, Datadog, or equivalent); Datadog experience is a plus. Experience working in Agile/DevOps environments and operating production services with ITSM ...

Mid/Senior Backend Engineer (Java)

Hiring Organisation: Revolut
Location: South West London, London, United Kingdom
Employment Type: Permanent

matters: clean, maintainable code, shipped fast with TDD, DDD, and continuous integration and delivery. Our stack includes Java 17/21, GCP, Kubernetes, Grafana, Prometheus, NewRelic, PostgreSQL, Redis, Spock, jOOQ, and Flyway. Up to shape what's next in finance? Let's get in touch. What youll be doing Building ...

Mid/Senior Backend Engineer (Java)

Hiring Organisation: We Love Alfa
Location: SW1A, City of Westminster, Greater London, United Kingdom
Employment Type: Permanent

Backend Engineer

Hiring Organisation: Revolut
Location: City of London, London, United Kingdom
Employment Type: Permanent, Work From Home

Senior Engineering Manager, Developer Experience

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

track record of owning a technical domain end-to-end. You bring strong technical foundation across the DevEx stack: CI/CD, observability (Prometheus, Grafana, or equivalent), Kubernetes-based platforms - sufficient to make sound architectural decisions and earn engineer trust. You know how to lead through ambiguity and organisational change ...

Semantic Graph & Ontology Architect

Hiring Organisation: Adecco
Location: London, United Kingdom
Employment Type: Contract

graphs supporting workflows and audit trails. Exposure to vector retrieval and how graph context informs data re-ranking. Knowledge of observability tools like OpenTelemetry, Prometheus, and Grafana. Why Join Us? This is your opportunity to be at the forefront of data innovation in the energy sector! If you are eager ...

Senior Backend Engineer

Hiring Organisation: M-XR
Location: City of London, Greater London, UK

storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience ...

Devops

Hiring Organisation: Hirexa Solutions UK
Location: Manchester, Lancashire, England, United Kingdom
Employment Type: Contractor
Contract Rate: Salary negotiable

Registry, Streams, ZooKeeper/KRaft DevOps/CI-CD: Jenkins, GitHub Actions, Azure DevOps Container Platform: Kubernetes, Helm, ArgoCD Cloud: AWS, Azure, GCP Monitoring: Prometheus, Grafana, Dynatrace, Confluent Control Centre Supporting: Terraform, Ansible" Skillsets required: 1. Core Kafka Expertise (Must-Have) • Kafka architecture: brokers, topics, partitions, replication • Kafka components: Kafka … Engineering (Must-Have) • Understanding of any one of the Cloud platforms: AWS/Azure/GCP 5. Monitoring, Observability & Reliability (Must-Have) • Monitoring tools: Prometheus, Grafana, Dynatrace, Confluent Control Centre • Alerting & SRE practices ...

Administrator

Hiring Organisation: Infinity Quest
Location: Manchester Area, United Kingdom

Registry, Streams, ZooKeeper/KRaft DevOps/CI-CD: Jenkins, GitHub Actions, Azure DevOps Container Platform: Kubernetes, Helm, ArgoCD Cloud: AWS, Azure, GCP Monitoring: Prometheus, Grafana, Dynatrace, Confluent Control Centre Supporting: Terraform, Ansible Core Kafka Expertise (Must-Have) • Kafka architecture: brokers, topics, partitions, replication • Kafka components: Kafka Connect, Schema Registry … Engineering (Must-Have) • Understanding of any one of the Cloud platforms: AWS/Azure/GCP 5. Monitoring, Observability & Reliability (Must-Have) • Monitoring tools: Prometheus, Grafana, Dynatrace, Confluent Control Centre • Alerting & SRE practices Scope of Services: Track400 within the Streaming CoE lab is focused on delivering and hardening enterprise Kafka ...

Site Reliability Engineer

Hiring Organisation: Digital Gurus
Location: United Kingdom

error budgets for critical data services and platform components. You will build and maintain observability dashboards and monitoring frameworks using tools such as Dynatrace, Prometheus and associated monitoring/logging/tracing platforms. You will implement end-to-end monitoring across metrics, logs and traces, helping the team detect issues … experience, ideally within production-scale environments. Hands-on experience with Kubernetes, ideally Amazon EKS. Experience with observability and monitoring tools such as Dynatrace, Prometheus, Grafana, CloudWatch, OpenTelemetry, ELK or similar. Understanding of SLIs, SLOs, error budgets and golden signals. Experience supporting incident management, root cause analysis and post-incident improvement ...