Employment Type

Any 29
Permanent 29

Remote Jobs

Hybrid/WFH 9

Sort By

Relevance
Date

Locations

London 238

Job Titles

1 to 25 of 29 Permanent Prometheus Jobs in East London

DevOps Engineer

South East London, England, United Kingdom

GlobalLogic

Deploy, UrbanCode etc. • Containers – Docker, Kubernetes, Mesosphere etc. • Configuration Management – Ansible, Chef, Puppet etc. • Cloud – AWS preferred; multi clould experience ie with Azure, GCP etc. highly desirable • Monitoring – ELK, Prometheus, Splunk etc. • Experience in one of the following scripting language: Java, Bash, Python, Powershell, Golang, etc. • Experience working with Linux and/or Windows systems About you (ideally): • Demonstrate a More ❯

Posted: Today

Cloud Native DevOps Engineer (AWS)

South East London, England, United Kingdom
Hybrid / WFH Options

LHH

Kubernetes. • Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your More ❯

Posted: Yesterday

Senior DevOps Engineer (SC Cleared)

South East London, England, United Kingdom
Hybrid / WFH Options

Amber Labs

teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team player More ❯

Posted: 3 days ago

Site Reliability Engineer

South East London, England, United Kingdom
Hybrid / WFH Options

Explore Group

on production support Tech Stack Cloud: AWS (EKS, ECS, RDS, IAM, Lambda, etc.) IaC: Terraform, Terragrunt Containerisation: Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and More ❯

Posted: 2 days ago

Senior Software Engineer – Quant Full Stack & Infrastructure (Team Lead)

South East London, England, United Kingdom

Trireme

Production experience with Kubernetes and cloud-native deployment strategies. Hands-on with AWS, GCP, and Azure for compute, networking, and storage configurations. Familiarity with monitoring/logging tools (e.g., Prometheus, Grafana, ELK stack). Trading Systems & Finance: Solid understanding of trading infrastructure, latency optimization, execution systems, and market data feeds. Experience working in or with quantitative research, HFT, or hedge More ❯

Posted: 4 days ago

Senior Site Reliability Engineer - AWS Kubernetes

South East London, England, United Kingdom

SGI

Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and … dstat for monitoring storage and system performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability More ❯

Posted: Yesterday

Messaging Administator - Solace

East London, London, United Kingdom

Marlin Selection

and software brokers across cloud and on-prem platforms Responding to production incidents and working on root cause analysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message delivery Partnering with development and support teams to troubleshoot integration and message flow issues Driving capacity … continuity What Were Looking For: 3+ years hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well More ❯

Employment Type: Permanent

Posted: 10 days ago

Messaging Administrator - Solace

South East London, England, United Kingdom

Marlin Selection Recruitment

and software brokers across cloud and on-prem platforms Responding to production incidents and working on root cause analysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message delivery Partnering with development and support teams to troubleshoot integration and message flow issues Driving capacity … What We’re Looking For: 3+ years’ hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well More ❯

Posted: 4 days ago

Solace Messaging Administrator

South East London, England, United Kingdom

H&P Executive Search

reliability across production and non-production environments. You will be working on incident response, capacity planning, WAN optimization, and system observability so should have experience with tools such as Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance … and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and future demand. Automate routine maintenance tasks and … years of experience administering Solace PubSub+ messaging systems. Strong background in production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN Solid experience with Prometheus and Grafana Proficiency in troubleshooting Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and scripting Beneficial skills: Experience with containerized environments such as More ❯

Posted: Yesterday

IT Operations/Infrastructure Engineer

South East London, England, United Kingdom

Tencent

support, incident management, and ITSM processes (ServiceNow/Jira/Slack) Collaborate on DevOps initiatives: CI/CD (Jenkins/Terraform), containers (Docker/K8s), and monitoring (Grafana/Prometheus) Employee Lifecycle & Security Automate onboarding/offboarding workflows (account provisioning, access controls) Partner with HR/InfoSec to enforce IAM policies and compliance standards Continuous Improvement Develop IT operational playbooks More ❯

Posted: 3 days ago

DevOps Engineer - Crypto - Fully Remote

East London, London, United Kingdom
Hybrid / WFH Options

Oliver Bernard

/technologies as possible: AWS Cloud and AWS Services Containerisation with Docker and/or Kubernetes Terraform Strong CI/CD (GitOps, ArgoCD, CircleCI etc) knowledge Monitoring experience with Prometheus and Grafana Linux and Network concepts Front Office trading exerience is a must The role can offer remote working anywhere in the UK. More ❯

Posted: Yesterday

DevOps Engineer - Crypto - Fully Remote

South East London, England, United Kingdom
Hybrid / WFH Options

Oliver Bernard

Posted: Yesterday

Linux Platform Engineer

South East London, England, United Kingdom

Stanford Black Limited

skills (or Go/Rust) Experience with configuration management (Chef/Ansible) Distributed storage expertise (NFS, GPFS, WEKA) Cloud platform experience (AWS/GCP) Observability tools knowledge (ELK stack, Prometheus, Grafana, Datadog) Modern development practices (version control, CI/CD, agile) Engineering degree preferred Excellent communication skills Financial Services experience - Investment Banking or Hedge Fund preferred Fast-paced, collaborative environment More ❯

Posted: Yesterday

Senior Linux System Administrator

South East London, England, United Kingdom

NineTech

for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands-on Linux (RHEL … managing Kubernetes clusters. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). More ❯

Posted: 4 days ago

Cloud Platform Architect

South East London, England, United Kingdom

Ubique Systems

CI/CD pipelines and DevOps methodologies. Solid understanding of networking concepts (TCP/IP, DNS, VPNs, firewalls). Experience with monitoring and logging tools (e.g., GCP Operations Suite, Prometheus, Grafana). Strong problem-solving, debugging, and analytical skills. More ❯

Posted: Yesterday

Cloud Platforms & Infra Architect

South East London, England, United Kingdom

PURVIEW

Posted: Yesterday

Cloud Platforms & Infra Architect (Retail domain / Teleco Domain must)

South East London, England, United Kingdom

Ubique Systems

Posted: Yesterday

Platform Engineer

South East London, England, United Kingdom

Insight Global

common patterns and implementing best practices Exposure to secrets management platforms (e.g., HashiCorp Vault) Familiarity with infrastructure as code using Terraform Experience with monitoring, logging, and security tools (e.g., Prometheus, Grafana, and BQL) Expertise in containerization and orchestration using Kubernetes for deployments Experience working with high-availability systems architecture and the ability to support critical scalable and robust systems Bachelor More ❯

Posted: Yesterday

Advance Node JS Developer

South East London, England, United Kingdom
Hybrid / WFH Options

HOK Consulting - Technical Recruitment Consultancy

Ansible Strong debugging, testing, and performance tuning skills Nice to Have: Experience with event-driven architecture and message queues (e.g., Pub/Sub, Kafka) Familiarity with observability tools (e.g., Prometheus, Grafana, Stackdriver) Understanding of security best practices in microservices and API development Experience working in Agile/Scrum environments More ❯

Posted: Today

Site Reliability Engineer

South East London, England, United Kingdom
Hybrid / WFH Options

Unitary

you if: Have worked with visualisation tools such as Grafana for creating and maintaining dashboards that provide meaningful insights into system performance Are proficient with metrics platforms such as Prometheus, InfluxDB, or OpenTelemetry for collecting and analysing system data Have experience with incident management tools such as Incident.io for coordinating response efforts and recording follow-up learnings and actions Can More ❯

Posted: Today

Software Developer - Storage Developer

South East London, England, United Kingdom

Squarepoint

more programming languages (Go, Rust, C++, Java) Proven experience in troubleshooting and resolving complex issues in large scale backend system Experience with observability stack (ex. Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) and Infrastructure-as-code (ex. Terraform) Experience with building platform solutions/services on top of major cloud providers (GCP, AWS) is a plus Experience with building and operating More ❯

Posted: Yesterday

Senior Production Engineer - Fintech/Digital Assets

South East London, England, United Kingdom

Tempest Vane Partners

roles. Strong experience with Java (Spring) and cloud platforms (ideally Azure ). Proven track record in building and maintaining mission-critical systems. Deep understanding of Kubernetes, observability tooling (Grafana, Prometheus, ELK, etc.), and Infrastructure as Code (Terraform, Bicep). Ability to lead technical conversations across Engineering and Product. Bonus points if you bring: Experience in fintech, crypto, or regulated digital More ❯

Posted: Today

Software Engineer - Infrastructure / Observability

South East London, England, United Kingdom

SGI

suit a software engineer who cares about clean, testable code and good software practices, but prefers working in the infra/tooling space. What you’ll be doing: Writing Prometheus exporters and integrations for infrastructure systems Building out dashboards and monitoring pipelines in Grafana and Prometheus Developing infrastructure-as-code tooling (Terraform, Ansible) Designing well-structured, testable software that improves … system visibility What they’re looking for: Strong software engineering skills (Go or Python preferred) Experience working in or alongside platform engineering teams Familiarity with modern observability tools (Grafana, Prometheus, etc.) Comfort working across both code and infrastructure – but this is not a pure ops/SRE role If you've worked in finance that would be great but not More ❯

Posted: 2 days ago

Senior System Engineer

South East London, England, United Kingdom

Capital Markets Recruitment

System Engineer within financial services Know how to write good code (Go, Python, Bash, etc.). Know how to use virtualization (Docker, KVM, etc.). Familiar with monitoring systems (Prometheus, Grafana, etc.). Know about networking hardware (switches, routers). If this opportunity is of interest, please reach out to Daniel O'Connell directly on LinkedIn or email at daniel.oconnell More ❯

Posted: 4 days ago

Cloud Database Administrator -PostgreSQL / MS SQL / Oracle

South East London, England, United Kingdom

Vallum Associates

cloud-native tools and scripting (e.g., Terraform, Ansible, AWS RDS/Aurora tools, Azure SQL automation). Monitoring & Health Checks: Utilize tools such as CloudWatch, Azure Monitor, OEM, or Prometheus to monitor performance and availability. Troubleshooting & Root Cause Analysis: Diagnose and resolve database incidents; conduct RCAs for critical incidents and outages. Collaboration: Work closely with DevOps, Application, and Security teams More ❯

Posted: 4 days ago