|
1 to 25 of 79 Prometheus Jobs in the City of London
City of London, London, United Kingdom GlobalLogic
Deploy, UrbanCode etc. • Containers – Docker, Kubernetes, Mesosphere etc. • Configuration Management – Ansible, Chef, Puppet etc. • Cloud – AWS preferred; multi clould experience ie with Azure, GCP etc. highly desirable • Monitoring – ELK, Prometheus, Splunk etc. • Experience in one of the following scripting language: Java, Bash, Python, Powershell, Golang, etc. • Experience working with Linux and/or Windows systems About you (ideally): • Demonstrate a More ❯
City of London, London, United Kingdom Hybrid / WFH Options LHH
Kubernetes. • Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your More ❯
City of London, England, United Kingdom Hybrid / WFH Options VE3
networking (DNS, TCP/IP, VPN, firewalls). Knowledge of containerization technologies (Docker, ECS, EKS, or Kubernetes). Experience with monitoring/logging tools such as CloudWatch, ELK Stack, Prometheus/Grafana. Excellent problem-solving skills and the ability to work independently. Preferred Qualifications AWS Certified SysOps Administrator/DevOps Engineer – Professional. Experience with hybrid cloud/on-prem environments. More ❯
City of London, London, United Kingdom Hybrid / WFH Options Amber Labs
teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team player More ❯
City of London, London, United Kingdom BGC Group
ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root … cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and … background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix More ❯
City of London, London, United Kingdom HCLTech
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/ Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
City of London, London, United Kingdom Mastek
canary releases). Monitoring, Logging & Alerting: Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security best practices and controls More ❯
City of London, London, United Kingdom ITR Partners
knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat More ❯
City of London, London, United Kingdom Hybrid / WFH Options Explore Group
on production support Tech Stack Cloud: AWS (EKS, ECS, RDS, IAM, Lambda, etc.) IaC: Terraform, Terragrunt Containerisation: Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and More ❯
City of London, London, United Kingdom Trireme
Production experience with Kubernetes and cloud-native deployment strategies. Hands-on with AWS, GCP, and Azure for compute, networking, and storage configurations. Familiarity with monitoring/logging tools (e.g., Prometheus, Grafana, ELK stack). Trading Systems & Finance: Solid understanding of trading infrastructure, latency optimization, execution systems, and market data feeds. Experience working in or with quantitative research, HFT, or hedge More ❯
City of London, England, United Kingdom JAM Recruitment
advantageous: Software development in web technologies or OOP (e.g., Python, Java, etc.) Database tech: Oracle SQL, PostgreSQL, MongoDB Proficient with Linux/Windows command line (Bash, PowerShell) Monitoring: Grafana, Prometheus, ELK, Splunk Agile working and tooling (e.g., Jira, Confluence) Diagnosing and resolving complex system issues ITIL knowledge or exposure to IT service operations Containerisation: Docker, Kubernetes, OpenShift Awareness of modern More ❯
City of London, England, United Kingdom Hybrid / WFH Options Parser Limited
relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at the office. More ❯
City of London, London, United Kingdom Marlin Selection Recruitment
and software brokers across cloud and on-prem platforms Responding to production incidents and working on root cause analysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message delivery Partnering with development and support teams to troubleshoot integration and message flow issues Driving capacity … What We’re Looking For: 3+ years’ hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well More ❯
City of London, England, United Kingdom Whitehall Resources Ltd
high system availability, enabling rapid delivery through CI/CD, and supporting development teams with robust infrastructure and tooling. A key part of the role includes proactive monitoring using Prometheus, Grafana, and Splunk, as well as participating in on-call rotations to respond to live incidents. Collaboration across engineering, security, and product teams is essential to build scalable and resilient … cause analysis and preventive measures. 3. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. 4. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. 5. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. 6. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. 7. … Engineer level 2. Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements 3. Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL 4. Proficient in one or more languages of Python, Go, Bash, SQL 5. Familiar with GitHub/GitOps/container orchestration/Kubernetes More ❯
City of London, London, United Kingdom NJF Global Holdings Ltd
workloads Implement and maintain DevOps tooling (Terraform, Ansible, GitLab CI/CD, Jenkins) Lead PoCs for new storage technologies and present results to technical leadership Support observability via Grafana, Prometheus, Splunk , and related platforms Contribute to containerization efforts with Docker and Kubernetes (preferred) What We’re Looking For: 8+ years of experience in storage systems administration and infrastructure/platform … Linux performance tuning , particularly in HPC or ML/AI contexts Programming/scripting experience in Python , Golang , or similar languages Familiarity with modern observability and monitoring tools (Grafana, Prometheus, Splunk) Experience supporting AI/ML modelling environments is highly desirable Knowledge of container and orchestration technologies (Docker, Kubernetes) is a plus Proactive, collaborative, and passionate about building world-class More ❯
City of London, London, United Kingdom Caspian One
engineering experience in performance-critical environments Proficiency in Python and bash Scripting, with hands-on Ansible experience Solid networking fundamentals: IP Addressing, VLANs, etc. Familiarity with observability tools like Prometheus, Grafana, and ELK Infrastructure-as-code experience with Terraform and CI/CD pipelines Proven ability to resolve complex system-level issues and performance challenges Knowledge of container orchestration tools More ❯
City of London, London, United Kingdom Oliver Bernard
infrastructure across on-prem and AWS Administer and optimise Kubernetes clusters and containerised pipelines Implement and maintain Infrastructure as Code using Terraform Improve observability and resilience using tools like Prometheus Manage and monitor GitLab CI/CD pipelines for multi-platform builds (Linux, Windows, macOS) Collaborate with engineering teams to optimise developer workflows and apply DevOps best practices Set clear More ❯
City of London, London, United Kingdom Ultralytics
by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services (AWS) or Microsoft More ❯
City of London, London, United Kingdom Hybrid / WFH Options Monument Technology
is a plus Experience of working on a microservice architecture hosted in Kubernetes; should also be familiar with the tooling used to observe/monitor these environments, such as, Prometheus, Grafana, Zipkin, and Jaeger Proven experience of managing and securing AWS cloud environments with configuration of key services like AWS WAF, CloudWatch, CloudTrail, Kubernetes, GuardDuty, X-Ray, Control Tower, Security More ❯
City of London, London, United Kingdom H&P Executive Search
reliability across production and non-production environments. You will be working on incident response, capacity planning, WAN optimization, and system observability so should have experience with tools such as Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance … and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and future demand. Automate routine maintenance tasks and … years of experience administering Solace PubSub+ messaging systems. Strong background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN Solid experience with Prometheus and Grafana Proficiency in troubleshooting Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and scripting Beneficial skills: Experience with containerized environments such as More ❯
City of London, London, United Kingdom Hybrid / WFH Options Arcus Search
strong emphasis on automation, self-service, and operational excellence. Tech You'll Use: Azure & AWS (production experience) Kubernetes (EKS preferred) Terraform & GitHub Actions CI/CD, observability tooling (Grafana, Prometheus), containerisation (Docker) What You'll Be Doing: Designing and implementing secure, resilient AWS infrastructure Building CI/CD pipelines and reusable deployment patterns Advising on cloud-native app transformation and More ❯
City of London, London, United Kingdom InfyStrat Software Services
Python, or Perl) and automation tools (Ansible, Puppet, or Chef). Solid understanding of networking, DNS, DHCP, firewalls, and troubleshooting tools. Experience with system monitoring tools (e.g., Nagios, Zabbix, Prometheus). Familiarity with backup and recovery processes. Desirable: Linux certifications (e.g., RHCSA, RHCE, LPIC) are highly desirable. Experience with containerization tools like Docker and orchestration with Kubernetes. Familiarity with ITIL More ❯
City of London, London, United Kingdom Radley James
Strong technical skills in Linux/Unix systems, SQL, and scripting Strong experience with a programming language such as Python, Java, etc Strong experience with monitoring and observability tools ( Prometheus, Grafana, Splunk, Geneos, OpenTelemetry, Corvil) Familiarity with cloud platforms, containerization (e.g., Kubernetes, Docker), and CI (Continuous Integration)/CD (continuous Delivery) pipelines Strong understanding of the trade lifecycle and fundamental More ❯
City of London, London, United Kingdom Hybrid / WFH Options Annapurna
with CI/CD pipelines and container technologies like Docker and Kubernetes. Deep understanding of networking, distributed systems, and databases. Expertise in monitoring and observability tools such as DataDog, Prometheus, Grafana, ELK stack, or Splunk. Excellent communication skills and a meticulous approach to problem-solving. Desirable Experience: Familiarity with Azure. Experience working in the autonomous vehicle sector. Exposure to AI More ❯
City Of London, England, United Kingdom Vallum Associates
tools. Proficient in Linux operating systems and shell scripting Strong understanding of CI/CD pipelines and tools (e.g., Jenkins, GitLab). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana). Soft Skills: Excellent problem-solving and troubleshooting abilities. Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams. Ability to manage multiple priorities More ❯
|
Salary Guide Prometheus the City of London - 10th Percentile
- £63,500
- 25th Percentile
- £69,375
- Median
- £82,500
- 75th Percentile
- £96,875
- 90th Percentile
- £99,750
|