26 to 50 of 59 Prometheus Jobs in London

Platform Engineer

Hiring Organisation
SR2 | Socially Responsible Recruitment | Certified B Corporation™
Location
City of London, London, United Kingdom
patterns Eligibility for SC clearance Nice to have AWS Landing Zone accelerator or framework experience Kubernetes/EKS and container platforms Observability tooling (Grafana, Prometheus, ELK, OpenTelemetry) Python, Go or similar for automation Experience working directly with AWS on architecture or delivery The nitty and gritty ...

Architect

Hiring Organisation
Hellowork Consultants
Location
City of London, London, United Kingdom
security, network policies Helm chart authoring, deployment strategies, custom charts, container registries Ingress controllers, API gateways, service mesh, and traffic policy enforcement Observability (Prometheus, Grafana), log pipelines, distributed tracing High availability, cluster upgrades, autoscaling strategy, performance tuning Lead advanced troubleshooting (pods, networking, DNS, controllers, storage, ingress). 5. Azure ...

API Platform Architect

Hiring Organisation
Hellowork Consultants
Location
London Area, United Kingdom
security, network policies Helm chart authoring, deployment strategies, custom charts, container registries Ingress controllers, API gateways, service mesh, and traffic policy enforcement Observability (Prometheus, Grafana), log pipelines, distributed tracing High availability, cluster upgrades, autoscaling strategy, performance tuning Lead advanced troubleshooting (pods, networking, DNS, controllers, storage, ingress). 5. Azure ...

Senior DevOps Consultant

Hiring Organisation
Exponential-e
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
limited to; Elasticsearch, NiFi, Rabbit, Kafka, MongoDB, Hadoop, Ansible, Git and Kubernetes) Development of dashboards for monitoring and alerting through Grafana, Splunk, Prometheus and OpenText Om and Ops Bridge On prem to cloud application migration Full, current UK Driving license and provision of vehicle for business purposes Our People ...

Software Engineering Manager - UCX (Web Frontend)

Hiring Organisation
Hargreaves Lansdown
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
high-quality user interfaces. Experience working in cloud-native environments, including AWS, Docker, and Kubernetes , with familiarity in observability and monitoring tools such as Prometheus and Grafana. Strong advocate for quality and security , embedding automated testing, code quality checks, and security scanning into development pipelines. Passionate about mentoring and developing ...

Senior DevOps Engineer

Hiring Organisation
Xact Placements Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 per annum
looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Observability Architect

Hiring Organisation
Fairfield Consultancy Services Ltd
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£600 - £625 per day
cloud-native monitoring tools, and hybrid observability. Experience with: APM platforms: Dynatrace, AppDynamics, Datadog Logging platforms: Splunk, ELK/Opensearch, CloudWatch Logs Metrics & telemetry: Prometheus, Grafana, OpenTelemetry Event management: ServiceNow, PagerDuty, Moogsoft, BigPanda Strong knowledge of instrumentation for distributed systems, microservices, containers (EKS, ECS), serverless workloads, and legacy systems. Migration ...

AWS Solutions Architect

Hiring Organisation
Henderson Scott
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£550 - £575 per day
Experience: We value architects who can adapt quickly to unfamiliar tools. Experience in any of the following is highly advantageous: Observability: Elasticsearch Stack, Dynatrace, Prometheus, or Grafana. Security & Identity: Hashicorp Vault, LDAP, Redhat SSO, OIDC, and Firewalling (Fortigate/AWS Network Firewall). Infrastructure/DevOps: Terraform, Concourse ...

SC Technical Architect CGEMJP

Hiring Organisation
Experis
Location
Croydon, London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
able to quickly learn and adapt to unfamiliar tools. Whilst not essential, it is desirable architects have expertise in the following; Elasticsearch Stack Dynatrace Prometheus & Grafana AWS EKS & KOPS Hashicorp Consul/Vault VPNs - e.g. OpenVPN, Fortigate site to site VPNs Atlassian tooling - e.g. Bitbucket, confluence, JIRA, Crowd AWS Workspaces ...

SC Technical Architect CGEMJP00326902

Hiring Organisation
Experis
Location
Croydon, London, United Kingdom
Employment Type
Contract
able to quickly learn and adapt to unfamiliar tools. Whilst not essential, it is desirable architects have expertise in the following; Elasticsearch Stack Dynatrace * Prometheus & Grafana AWS EKS & KOPS Hashicorp Consul/Vault VPNs - e.g. OpenVPN, Fortigate site to site VPNs Atlassian tooling - e.g. Bitbucket, confluence, JIRA, Crowd AWS Workspaces ...

Network SRE

Hiring Organisation
KBC Technologies Group
Location
London Area, United Kingdom
remediation Reduce manual toil through self-healing and event-driven automation Observability & Monitoring Implement comprehensive network observability using tools such as: Grafana Splunk Prometheus and related telemetry systems Develop dashboards, alerts, and reports that provide actionable insights Correlate network telemetry with application and system metrics Proactively identify performance degradation ...

Software Engineer - Core Wealth

Hiring Organisation
Hargreaves Lansdown
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An understanding of observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with some AWS services and how to integrate them into modern applications. A keen ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £90,000 per annum
Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash or Go What You’ll Be Doing Designing and maintaining reliable, scalable … cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations (EKS, Helm, or similar). Solid knowledge of monitoring, alerting, and logging (Grafana, Prometheus, ELK). Hands-on experience with Terraform and CI/CD tooling. Strong scripting or development background (Python, Go, or similar). Excellent troubleshooting skills ...

Principal Platform Engineer

Hiring Organisation
XACT PLACEMENTS LIMITED
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Senior Linux Infrastructure Engineer

Hiring Organisation
Levy Global
Location
London Area, United Kingdom
latency environments Strong background in automation and configuration management (e.g. Ansible, Puppet, Chef) Hands-on experience with monitoring and observability tools (e.g. Prometheus, Grafana, Elastic, Splunk) Exposure to build and release management pipelines Programming skills in Python and Bash (C/C++ advantageous) Solid understanding of networking fundamentals and protocols ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £105,000 per annum
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Principal Engineer

Hiring Organisation
Motive Group
Location
City of London, London, United Kingdom
orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working knowledge of ML infrastructure ...

Hardware Infrastructure Engineer

Hiring Organisation
Levy Global
Location
City of London, London, United Kingdom
platforms: HPE, Dell, Supermicro, Lenovo Strong ownership mindset and documentation discipline Nice-to-Have Experience in low-latency, HPC, or financial environments Observability tooling (Prometheus, Grafana, VictoriaLogs) Firmware lifecycle programs at scale Python/Bash scripting Capacity planning or refresh program exposure ...

Platform Engineer

Hiring Organisation
Ncounter LTD
Location
East London, London, United Kingdom
Employment Type
Permanent
analyse issues across application, network, and infrastructure layers Clear communication skills and the ability to collaborate across engineering teams Useful Extras Experience with Prometheus or Grafana Knowledge of Terraform, Ansible, or similar infrastructure as code tools If you are a practical engineer who enjoys owning and improving Solace-based messaging ...

Platform Engineering – Solace

Hiring Organisation
Ncounter
Location
East London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£13,000 - £150,000 per annum
analyse issues across application, network, and infrastructure layers • Clear communication skills and the ability to collaborate across engineering teams Useful Extras • Experience with Prometheus or Grafana • Knowledge of Terraform, Ansible, or similar infrastructure as code tools If you are a practical engineer who enjoys owning and improving Solace-based messaging ...

Site Reliability Engineer

Hiring Organisation
Networking People (UK) Limited
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
£400 - £405 per day + + travel expenses
physical infrastructure (Service Mesh) - traffic management, security, and observability Nice to Have Windows (Advanced) - in-depth administration and troubleshooting Ansible - configuration management and automation Prometheus - monitoring and alerting solutions Optional/Beneficial Experience Storage Technologies & Hardware - enterprise storage systems and concepts Firewall Hardware - deployment and operational experience Cilium - eBPF-based ...

OpenShift Telemetry Engineer

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £450 to £500 per day
data pipelines with Kafka (producers/consumers, schema registry, Kafka Connect/KSQL/Stream). Proficiency with OpenShift/Kubernetes telemetry (Open Telemetry, Prometheus) and CLI tooling. Experience integrating telemetry into Splunk (HEC, UF, source types, CIM), building dashboards and alerting. Strong data engineering skills in Python (or similar ...

Solace Administrator

Hiring Organisation
BGC Group
Location
City of London, London, United Kingdom
reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging … related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration ...

Senior Site Reliability Engineer

Hiring Organisation
Stratospherec Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £100000/annum Excellent Benefits package
looking for a DevOps Engineer with a strong understanding of C# code combined with experience of monitoring tools like DataDog, Grafana and Prometheus to join a growing global Cloud Infrastructure team supporting SaaS products. Our client are a Global Digital SaaS Software Company have a fantastic fully remote opportunity … Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. ...