51 to 71 of 71 Grafana Jobs in London

Senior Lead Software Engineering - AI/ML Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
managing incident resolution Experienced in observability, including white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, and Splunk Strong understanding of SLI/SLO/SLA and Error Budgets Proficient in Python or PySpark for AI/ML modeling ...

Database Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
enthusiast: enjoy the challenge of multi‐tenant, multi‐region, multi‐cloud scenarios with rigorous data integrity. Security & Observability mindset: build deep observability (Prometheus/Grafana/OpenTelemetry/Humio) and guardrails for secure operation. Engineering via code: deliver backend services in Java with clean relational modeling and performant DDL. Interview ...

Test Automation Engineer - Kingston, Surrey - Hybrid - £65,000

Hiring Organisation
Ashdown Group
Location
Kingston Upon Thames, Surrey, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
certification). Experience in non-functional testing disciplines including performance, scalability, resilience, failover/DR, and security testing. Familiarity with tools such as: JMeter Grafana Azure Monitor Azure Load Testing Burp Suite Experience leveraging AI-driven testing techniques and intelligent automation tools. Personal Attributes Methodical, analytical, and detail-oriented. Passionate ...

Lead Engineer (Routing Squad)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Backend languages: Python, Go Tech infrastructure: AWS, CDK TypeScript, Lambda, SQS, EventBridge, RDS, DynamoDB Data tooling: GCP, BigQuery, Looker, Looker Studio Observability: Loki, Tempo, Grafana, Prometheus Event-driven architecture and domain-driven design How we reward our team Dynamic hybrid working environment with a diverse and driven team Huge opportunity ...

Senior Software Engineer / SRE - Electronic Trading

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Reliability or Production Engineering role. Deep knowledge of system health assessment and building effective alerting. Hands‐on experience with monitoring tools (e.g., Grafana, Humio) and chaos engineering. Familiarity with leveraging Generative AI (e.g., GitHub Copilot, Gemini) to accelerate development. Experience with big data technologies like Apache Spark or Amazon S3. ...

Machine Learning Systems & Infrastructure Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
pipelines, including self‐hosted GPU runners. Observability and reliability: Monitoring, logging, and alerting for job performance, data‐pipeline health, and cost (e.g., Prometheus/Grafana, OpenTelemetry); define SLOs and incident response for the systems you own. Security and access: Manage secrets, IAM, and network boundaries (e.g., Tailscale, cloud … caching layers. Familiarity with ML workflow orchestration and experiment tracking (e.g., Kubeflow Pipelines, MLflow). Experience with monitoring and observability tooling (e.g., Prometheus/Grafana, OpenTelemetry) and CI/CD for infra and ML workflows (e.g., GitHub Actions). At SpAItial, we are committed to creating a diverse and inclusive ...

Monitoring & Observability Engineer

Hiring Organisation
COMPUTACENTER (UK) LIMITED
Location
South East London, London, United Kingdom
Employment Type
Permanent
through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM … designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability ...

Customer Success Manager

Hiring Organisation
Harrington Starr
Location
City of London, London, United Kingdom
time or high-availability environments Hybrid infrastructure estates Technical customer advisory or post-sales consulting Monitoring tooling such as Geneos, Dynatrace, Datadog, Splunk, AppDynamics, Grafana, Prometheus, Elastic, New Relic, or similar platforms Why This Role Stands Out Opportunity to work with globally recognised Financial Services institutions Complex, technically sophisticated customer ...

Software Engineer (Graduate to Experienced)

Hiring Organisation
ECM Selection (Holdings) Limited
Location
London, United Kingdom
Employment Type
Permanent
Salary
£40000 - £100000/annum DoE
company’s tech stack is Rust, Flutter/Dart, and Postgres – experience with these is highly beneficial. Additionally, any exposure with gRPC, Arrow, Prometheus, Grafana, or Docker would be desirable. As the sector is in aviation, any personal interest in this evidenced through flying lessons, flight simulators etc… would ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, London, United Kingdom
Employment Type
Permanent
Salary
£50,000
both technical and non-technical teams Desirable qualifications Microsoft certifications (AZ-900, AZ-104, AZ-305, AZ-500) or similar Experience with LogicMonitor admin, Grafana or other observability tools Familiarity with SRE concepts (SLIs, SLOs, error budgets) Understanding of ITIL processes Who are Solus? Solus, who are owned by Aviva ...

Integration Developer FTC

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
data engineers, and stakeholders Technology Stack Kafka/Redpanda Docker & Kubernetes Microsoft Azure REST APIs & webhooks CI/CD & Infrastructure as Code OpenTelemetry, Prometheus & Grafana Required Skills Strong software engineering background Experience building integration or event-driven platforms Kafka, Redpanda, or similar streaming technologies Enterprise system integrations and API design … Agile development experience Strong communication and collaboration skills Desirable Skills Go and/or Python CDC pipeline development Azure cloud experience Observability tooling (Prometheus, Grafana, OpenTelemetry) Experience within regulated environments What's on Offer Hybrid working - 2 days per week in London Salary up to £60,900 Generous pension ...

London-Based Observability TAM - Drive Real-Time Data Value

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
sales journeys, engaging with stakeholders from software engineers to executives, and troubleshooting complex integrations. Candidates should have hands-on experience with observability tools like Grafana, DataDog, or Splunk and a strong communication focus. This position offers an opportunity in a high-growth environment, great benefits, and potential stock options. #J ...

DC Operations Engineer

Hiring Organisation
Hamilton Barnes
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 400 - 420 Daily
with our partners Completing Level 1 support tickets Completing network audits to ensure compliance to standards What You Will Ideally Bring Previous usage of Grafana dashboards. Ideally Victoria metrics too, or at least Prometheus CCNP Level understanding of Routing and Switching protocols Understanding of BGP troubleshooting Highly organised and motivated ...

Platform Engineer

Hiring Organisation
UA Consulting
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
From £300 to £400 per day
/Aurora, S3). Develop and maintain Infrastructure as Code using Terraform and configuration management with Ansible. Enhance monitoring, logging, and alerting using the Grafana stack (Prometheus, Loki, Tempo). Participate in incident management, root cause analysis, and post-incident reviews. Implement automation to reduce manual operational tasks and improve … workloads (EKS, EC2, RDS/Aurora, S3, IAM). Infrastructure as Code with Terraform and configuration management with Ansible. Strong experience with observability tools (Grafana, Prometheus, Loki, Tempo). Understanding of SRE concepts (SLIs, SLOs, error budgets, capacity planning). Comfortable working in incident and problem management processes. Strong GitOps ...

Platform Engineer

Hiring Organisation
UA Consulting
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£75,000
/Aurora, S3). Develop and maintain Infrastructure as Code using Terraform and configuration management with Ansible. Enhance monitoring, logging, and alerting using the Grafana stack (Prometheus, Loki, Tempo). Participate in incident management, root cause analysis, and post-incident reviews. Implement automation to reduce manual operational tasks and improve … workloads (EKS, EC2, RDS/Aurora, S3, IAM). Infrastructure as Code with Terraform and configuration management with Ansible. Strong experience with observability tools (Grafana, Prometheus, Loki, Tempo). Understanding of SRE concepts (SLIs, SLOs, error budgets, capacity planning). Comfortable working in incident and problem management processes. Strong GitOps ...

AWS Cloud Engineer / Site Reliability Engineer (SRE)

Hiring Organisation
GTC Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£600 - £662/day
e.g., Amazon EC2, Amazon EKS) Managing and improving CI/CD pipelines using GitLab CI Implementing monitoring, alerting, and observability with Amazon CloudWatch and Grafana Automating infrastructure, deployments, and operational processes Managing live environments and ensuring high availability and performance Troubleshooting incidents and resolving complex production issues Collaborating with engineering … Experience building and maintaining CI/CD pipelines (e.g., GitLab CI) Proven ability to monitor and troubleshoot systems using Amazon CloudWatch and/or Grafana Strong understanding of networking fundamentals (TCP/IP, TLS, routing, firewalls) Experience working in live production environments Solid scripting or programming skills (e.g., Python, Bash ...

Solace Messaging Administrator

Hiring Organisation
Searchability
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £130,000 per annum
supporting enterprise production environments Experience with Solace PubSub+ appliances and software brokers Strong understanding of distributed systems and WAN environments Experience with Prometheus and Grafana monitoring tools Linux/Unix administration and scripting experience Strong troubleshooting and analytical problem-solving skills Experience supporting low latency, high throughput messaging systems … submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS Solace, PubSub+, Messaging Administrator, Linux, Prometheus, Grafana, WAN, Low Latency Systems, Distributed Systems, Python, Bash, Infrastructure, Production Support, Kafka, RabbitMQ, Docker, Kubernetes, AWS, Azure, Messaging Systems ...

Solace Messaging Administrator

Hiring Organisation
Searchability (UK) Ltd
Location
City of London, London, United Kingdom
Employment Type
Permanent
supporting enterprise production environments Experience with Solace PubSub+ appliances and software brokers Strong understanding of distributed systems and WAN environments Experience with Prometheus and Grafana monitoring tools Linux/Unix administration and scripting experience Strong troubleshooting and analytical problem-solving skills Experience supporting low latency, high throughput messaging systems … submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS Solace, PubSub+, Messaging Administrator, Linux, Prometheus, Grafana, WAN, Low Latency Systems, Distributed Systems, Python, Bash, Infrastructure, Production Support, Kafka, RabbitMQ, Docker, Kubernetes, AWS, Azure, Messaging Systems ...

Remote Network Monitoring Specialist - Streaming Telemetry

Hiring Organisation
Akkodis
Location
London, United Kingdom
Employment Type
Permanent
Salary
£70000 - £75000/annum
performance baselining and operational handover. The client is open to different monitoring backgrounds, particularly where candidates have worked with tools such as VictoriaMetrics, Prometheus, Grafana, Nagios, Zabbix, InfluxDB, Telegraf, SolarWinds, PRTG, Datadog, Elastic, OpenTelemetry, SNMP, NetFlow/IPFIX or syslog pipelines. You will work closely with network engineering and operational … Build monitoring capability that provides clear visibility of network health, performance and service availability. Work with monitoring and observability platforms such as VictoriaMetrics, Prometheus, Grafana, Nagios, Zabbix, InfluxDB, SolarWinds, PRTG, Datadog, Elastic or similar. Support metrics ingestion, retention, alerting, dashboarding and performance visibility. Build or support streaming telemetry pipelines ...

Senior Software Engineer/SRE - TRAX Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
analysis, and a distributed trace pipeline (Argo, Spark, Solr) that processes large‐scale data for deep investigation. We also leverage tools such as Humio, Grafana, and MetricTank to support observability across the department. What’s in it for you? Learning & Technical Growth Work alongside experienced senior engineers with deep expertise … welcome) Knowledge of Unix/Linux fundamentals (or strong willingness to learn) Familiarity with observability concepts (e.g., distributed tracing, logging, metrics, tools such as Grafana or similar) Understanding of distributed systems concepts (replication, partitioning, scalability, messaging, state management) and eagerness to deepen that knowledge We would love to see: Experience ...

Remote Network Monitoring Engineer - VictoriaMetrics

Hiring Organisation
Akkodis
Location
London, United Kingdom
Employment Type
Permanent
Salary
£70000 - £75000/annum
working with VictoriaMetrics in a production environment, including configuration, optimisation, ingestion, retention and performance tuning. You will also work across streaming telemetry, Nagios, Grafana and wider observability tooling. This would suit someone with strong network monitoring experience who is comfortable taking ownership of a critical technical workstream in a project … telemetry pipelines to provide real-time visibility across the network. Implement and manage Nagios-based monitoring for alerting and service health. Develop dashboards in Grafana, or similar, to support engineering and operational teams. Commission monitoring across network devices, access infrastructure and Layer 1-3 equipment. Define baseline performance metrics, thresholds ...