20 of 20 Permanent Grafana Jobs in Central London

DevOps Engineering Intern

Hiring Organisation
Hireshire
Location
City of London, London, United Kingdom
Support infrastructure automation using Infrastructure-as-Code (IaC) tools such as Terraform or CloudFormation. Monitor system health, performance, and uptime using tools like Prometheus, Grafana, and ELK Stack. Write automation scripts in Python or Bash to streamline repetitive tasks. Collaborate with developers to help troubleshoot basic deployment and build issues. ...

DevOps Engineer / Linux Systems Administrator

Hiring Organisation
N P Associates
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 - £110,000 per annum
Extensive proven experience with AWS network and security implementations and management. Experience administering and maintaining systems monitoring/alerting technologies (CloudWatch, Splunk, Nagios, Rapid7, Grafana etc.). Proven experience with containerisation - Docker/Kubernetes/ECS/ECR, etc., Database administration - MySQL, QuestDB, Elasticsearch. Experience with multiple cloud providers ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
City of London, London, United Kingdom
relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code ...

Principal DevOps Engineer

Hiring Organisation
TEC Partners - Technical Recruitment Specialists
Location
City of London, London, United Kingdom
CodePipeline. Strong scripting skills (e.g., Bash, Python, or PowerShell) for automation and tooling. Familiarity with monitoring and log management tools (e.g., Prometheus, Grafana, ELK stack). Knowledge of networking concepts and security best practices. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration abilities, with a passion for working ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £75,000 per annum
platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What They’re Looking ...

Senior DevOps Engineer - ArgoCD/GitOps

Hiring Organisation
Tec Partners
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £85000/annum
GitHub Actions, GitLab CI, or similar) Solid Linux and scripting skills Nice to Have EKS at scale, Helm, multi-account AWS Observability tools (Prometheus, Grafana, CloudWatch) AWS or Kubernetes certifications ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
City of London, London, United Kingdom
more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Golang Engineer

Hiring Organisation
Oliver Bernard
Location
City of London, London, United Kingdom
Azure) Comfortable with CI/CD pipelines and infrastructure-as-code A proactive, collaborative mindset Nice to Have Experience with Helm, Terraform, Prometheus, or Grafana Knowledge of service meshes, event-driven systems, or Kafka Previous experience in high-scale or SaaS environments What We Offer £100-£120k base + equity ...

Full Stack Software Engineer

Hiring Organisation
Firenze
Location
City of London, London, United Kingdom
Interest in event-driven or distributed systems (Kafka, RabbitMQ, SQS). Exposure to DDD, CQRS, or hexagonal architectures. Experience with observability tools (OpenTelemetry, Prometheus, Grafana). Familiarity with multi-tenant SaaS, RBAC, or performance tuning (JVM, SQL). Why Join Us? Learn from experienced engineers: Work side-by-side with ...

Machine Learning Engineer

Hiring Organisation
Stott and May
Location
City of London, London, United Kingdom
SageMaker, Vertex AI) Experience with CI/CD pipelines and automation tools such as GitHub Actions Understanding of monitoring and logging tools (e.g., NewRelic, Grafana) Desirable Skills and Experience Prior experience deploying ML models in production environments Knowledge of infrastructure-as-code tools like Terraform or CloudFormation Familiarity with model ...

Site Reliability Engineer - SRE

Hiring Organisation
Sanderson Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent
priority and root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest ...

Principal Engineer

Hiring Organisation
Motive Group
Location
City of London, London, United Kingdom
Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working knowledge of ML infrastructure and familiarity with GPU drivers, CUDA ...

Technical Solutions Engineer - Deep-Tech AI Start-up

Hiring Organisation
Urban Digital Recruitment Ltd
Location
City of London, London, United Kingdom
+ edge devices Troubleshoot end-to-end: AI model behaviour, device integrations, networks, cloud infra, on-device performance Analyse logs, metrics and telemetry (Grafana, Metabase) to pinpoint root cause Work hands-on with Linux, SQL, Docker, AWS/GCP/Azure Lead pilots, rollouts and on-device testing across major ...

Solutions Architect

Hiring Organisation
SEEKR
Location
City of London, London, United Kingdom
JavaScript, C# Frameworks: React, Next.js, Node.js, .NET Testing: Vitest, Playwright, Pact, K6 Datastores: PostgreSQL, CosmosDB, Redis DevOps: GitHub Actions, Azure, Kubernetes, Docker, Terraform Monitoring: Grafana, Azure App Insights What’s On Offer £80,000–£120,000 salary, hybrid working (central London office), benefits, and a mandate to help shape ...

Solace Administrator

Hiring Organisation
BGC Group
Location
City of London, London, United Kingdom
across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related … incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £90,000 per annum
Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash or Go What You’ll Be Doing Designing and maintaining reliable, scalable, and secure … running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations (EKS, Helm, or similar). Solid knowledge of monitoring, alerting, and logging (Grafana, Prometheus, ELK). Hands-on experience with Terraform and CI/CD tooling. Strong scripting or development background (Python, Go, or similar). Excellent troubleshooting ...

Application Support Engineer – Elite Systematic Trading Firm - Prop Trading - Market Leading Compensation - Hybrid - London

Hiring Organisation
Mondrian Alpha
Location
City of London, London, United Kingdom
tools like Ansible and Geneos. Experience with relational databases (Postgres, Sybase, Oracle) and writing performant SQL queries. Knowledge of production-grade monitoring tools (e.g., Grafana, Splunk), alert tuning, and system health validation. Understanding of low-latency infrastructure, colocation environments, and performance tuning (CPU affinity, NUMA). Strong communicator with ...

Senior Backend Engineer

Hiring Organisation
M-XR
Location
City of London, London, United Kingdom
asset storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development ...

Senior C# Engineer - .Net Core, AWS Serverless Services, MySQL, APIs

Hiring Organisation
Smart Sourcer Limited
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£80,000
Senior C# Engineer - .Net Core, AWS Serverless Services, MySQL, APIs, Event Driven Outstanding permanent opportunity to join this global, market leading, B2B SaaS tech business as a Senior C# Backend Engineer with deep expertise C# ...

Site Reliability Engineer

Hiring Organisation
Global Fintech
Location
City of London, London, United Kingdom
maintain advanced reconciliation applications to ensure consistency across digital and traditional finance trade-capture processes. Develop and enhance monitoring dashboards and alerts using DataDog, Grafana, or similar technologies to proactively identify and address production issues, including end-to-end system latency. Build tooling and monitoring solutions to facilitate comprehensive post … familiarity with Kafka, CockroachDB, FastAPI, GraphQL, Snowflake, Redis, and QuestDB or equivalent technologies. Proven experience designing and implementing monitoring and alerting tools (DataDog, Grafana). Solid experience with AWS Cloud Infrastructure and related operational processes. Deep understanding of and experience troubleshooting REST APIs and WebSockets. Exposure to crypto, blockchain ...