20 of 20 Prometheus Jobs in the City of London

DevOps Engineer

Hiring Organisation
intro
Location
City of London, London, United Kingdom
Terraform, Ansible, and/or CloudFormation. Strong background in Docker and Kubernetes. Good understanding of networking fundamentals. Experience with monitoring and logging stacks (Datadog, Prometheus, Grafana, ELK, etc.). Strong communication skills and the ability to collaborate across engineering teams. Knowledge of compliance/security frameworks (PCI DSS, SOC2 ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
City of London, London, United Kingdom
with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing ...

Principal DevOps Engineer

Hiring Organisation
TEC Partners - Technical Recruitment Specialists
Location
City of London, London, United Kingdom
CodePipeline. Strong scripting skills (e.g., Bash, Python, or PowerShell) for automation and tooling. Familiarity with monitoring and log management tools (e.g., Prometheus, Grafana, ELK stack). Knowledge of networking concepts and security best practices. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration abilities, with a passion for working ...

Senior DevOps Engineer - ArgoCD/GitOps

Hiring Organisation
Tec Partners
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £85000/annum
tooling (GitHub Actions, GitLab CI, or similar) Solid Linux and scripting skills Nice to Have EKS at scale, Helm, multi-account AWS Observability tools (Prometheus, Grafana, CloudWatch) AWS or Kubernetes certifications ...

Platform Engineer

Hiring Organisation
Block MB
Location
City of London, London, United Kingdom
/CD tools and workflows (e.g., GitLab, Jenkins). Familiarity with container image management and security scanning. Knowledge of monitoring and logging stacks (e.g., Prometheus, Grafana, ELK). Scripting skills (e.g., Python, Bash) for automation and tooling. Experience working in an agile, collaborative environment. Desirable: Experience in the financial services ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £85,000 per annum
platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What They’re Looking ...

Software Engineer

Hiring Organisation
Opus Enterprise Ltd T/A Real Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£80,000
Server. Understanding of cloud infrastructure (preferably AWS) and containerisation (Docker, Kubernetes). Familiarity with DevOps automation (CI/CD, Helm, Terraform) and monitoring tools (Prometheus, Grafana, AWS CloudWatch). Experience in production support, debugging, and troubleshooting applications. Knowledge of agile methodologies (scrum, Jira, Kanban, Confluence). Effective communication skills ...

Senior Full-Stack Engineer (Java & Python)

Hiring Organisation
McCabe & Barton
Location
City of London, London, United Kingdom
Employment Type
Permanent
ability to collaborate in agile, cross-functional teams . Desirable Experience with infrastructure as code (Terraform, Helm). Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK). Exposure to regulated environments and associated data controls. ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
City of London, London, United Kingdom
more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Golang Engineer

Hiring Organisation
Oliver Bernard
Location
City of London, London, United Kingdom
Azure) Comfortable with CI/CD pipelines and infrastructure-as-code A proactive, collaborative mindset Nice to Have Experience with Helm, Terraform, Prometheus, or Grafana Knowledge of service meshes, event-driven systems, or Kafka Previous experience in high-scale or SaaS environments What We Offer £100-£120k base + equity ...

Platform Engineer

Hiring Organisation
SR2 | Socially Responsible Recruitment | Certified B Corporation™
Location
City of London, London, United Kingdom
patterns Eligibility for SC clearance Nice to have AWS Landing Zone accelerator or framework experience Kubernetes/EKS and container platforms Observability tooling (Grafana, Prometheus, ELK, OpenTelemetry) Python, Go or similar for automation Experience working directly with AWS on architecture or delivery The nitty and gritty ...

Architect

Hiring Organisation
Hellowork Consultants
Location
City of London, London, United Kingdom
security, network policies Helm chart authoring, deployment strategies, custom charts, container registries Ingress controllers, API gateways, service mesh, and traffic policy enforcement Observability (Prometheus, Grafana), log pipelines, distributed tracing High availability, cluster upgrades, autoscaling strategy, performance tuning Lead advanced troubleshooting (pods, networking, DNS, controllers, storage, ingress). 5. Azure ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £90,000 per annum
Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash or Go What You’ll Be Doing Designing and maintaining reliable, scalable … cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations (EKS, Helm, or similar). Solid knowledge of monitoring, alerting, and logging (Grafana, Prometheus, ELK). Hands-on experience with Terraform and CI/CD tooling. Strong scripting or development background (Python, Go, or similar). Excellent troubleshooting skills ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring ...

Principal Engineer

Hiring Organisation
Motive Group
Location
City of London, London, United Kingdom
orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working knowledge of ML infrastructure ...

Hardware Infrastructure Engineer

Hiring Organisation
Levy Global
Location
City of London, London, United Kingdom
platforms: HPE, Dell, Supermicro, Lenovo Strong ownership mindset and documentation discipline Nice-to-Have Experience in low-latency, HPC, or financial environments Observability tooling (Prometheus, Grafana, VictoriaLogs) Firmware lifecycle programs at scale Python/Bash scripting Capacity planning or refresh program exposure ...

Solace Administrator

Hiring Organisation
BGC Group
Location
City of London, London, United Kingdom
reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging … related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration ...

Lead Data Engineer(Kakfa/Openshift)

Hiring Organisation
Synechron
Location
City of London, London, United Kingdom
Required Skills: Strong hands‐on experience with Kafka (producers/consumers, schema registry, KSQL, Kafka Streams). Deep understanding of OpenShift/Kubernetes telemetry , Prometheus, OpenTelemetry, and CLI tooling. Experience integrating telemetry into Splunk (HEC, UF, sourcetypes, CIM) and creating dashboards/alerts. Proficiency in Python or similar languages ...

Senior Backend Engineer

Hiring Organisation
M-XR
Location
City of London, London, United Kingdom
storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience ...

Head of Engineering

Hiring Organisation
Lightdash
Location
City of London, London, United Kingdom
polishing what nobody needs. Our tech stack: Primary: TypeScript, React, Node, SQL Frameworks: Express, React-hooks, Redux, RTK, Mantine Infrastructure: Docker, GCP, Kubernetes, Tracing, Prometheus While familiarity with our stack is helpful, we value your ability to learn and adapt over specific technical experience. ⚡ We’re deliberately looking for ambitious ...