1 to 25 of 26 Permanent Grafana Jobs in the Thames Valley

Site Reliability Engineer

Hiring Organisation
Response Informatics
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
public cloud environments (AWS, Azure, GCP). Solid understanding of Java and Spring Boot applications. Experience with monitoring, logging, and observability tools (Prometheus, Grafana, ELK, Splunk). Strong troubleshooting and problem-solving skills. Excellent communication and collaboration skills. Preferred Qualifications Experience in financial services or payments/transaction processing environments. ...

Site Reliability Engineer

Hiring Organisation
Response Informatics
Location
Reading, Berkshire, UK
Employment Type
Full-time
public cloud environments (AWS, Azure, GCP). Solid understanding of Java and Spring Boot applications. Experience with monitoring, logging, and observability tools (Prometheus, Grafana, ELK, Splunk). Strong troubleshooting and problem-solving skills. Excellent communication and collaboration skills. Preferred Qualifications Experience in financial services or payments/transaction processing environments. ...

Site Reliability Engineer

Hiring Organisation
Response Informatics
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
public cloud environments (AWS, Azure, GCP). Solid understanding of Java and Spring Boot applications. Experience with monitoring, logging, and observability tools (Prometheus, Grafana, ELK, Splunk). Strong troubleshooting and problem-solving skills. Excellent communication and collaboration skills. Preferred Qualifications Experience in financial services or payments/transaction processing environments. ...

DevOps / Platform Engineer

Hiring Organisation
Locai Labs
Location
Slough, Berkshire, UK
Employment Type
Full-time
relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code ...

DevOps Engineer

Hiring Organisation
CT19
Location
Oxford, England, United Kingdom
embedded and hybrid applications Automate infrastructure provisioning and configuration using Terraform, Ansible, or equivalent tools Monitor system performance, reliability, and observability using Prometheus, Grafana, and ELK stack Collaborate closely with embedded software engineers and hardware teams to ensure seamless integration of software updates Strengthen security, resilience, and compliance throughout ...

Principal DevOps Engineer

Hiring Organisation
TEC Partners - Technical Recruitment Specialists
Location
Slough, Berkshire, UK
Employment Type
Full-time
CodePipeline. Strong scripting skills (e.g., Bash, Python, or PowerShell) for automation and tooling. Familiarity with monitoring and log management tools (e.g., Prometheus, Grafana, ELK stack). Knowledge of networking concepts and security best practices. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration abilities, with a passion for working ...

Senior MLOps Engineer

Hiring Organisation
algo1
Location
Slough, Berkshire, UK
Employment Type
Full-time
relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers ...

Senior Software Engineer - Backend

Hiring Organisation
Zettafleet
Location
Slough, Berkshire, UK
Employment Type
Full-time
platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Strong ownership mindset: You naturally take responsibility for outcomes, not just tasks, and care deeply about the quality of what you ship. Product-oriented ...

Senior Software Engineer - Backend

Hiring Organisation
Zettafleet
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Strong ownership mindset: You naturally take responsibility for outcomes, not just tasks, and care deeply about the quality of what you ship. Product-oriented ...

Senior Software Engineer - Backend

Hiring Organisation
Zettafleet
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Strong ownership mindset: You naturally take responsibility for outcomes, not just tasks, and care deeply about the quality of what you ship. Product-oriented ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
Slough, Berkshire, UK
Employment Type
Full-time
more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Kafka Data Architect(Streaming And Payment)

Hiring Organisation
IBU
Location
Slough, Berkshire, UK
Employment Type
Full-time
based encryption Tokenization where required Least-privilege IAM Immutable audit logging Observability, Reliability & FinOps Build observability for streaming and data platforms using: CloudWatch, Prometheus, Grafana Track operational KPIs: Throughput (TPS) Processing lag Success/error rates Cost per million events Define actionable alerts, dashboards, and operational runbooks. Design for high ...

Full Stack Software Engineer

Hiring Organisation
Firenze
Location
Slough, Berkshire, UK
Employment Type
Full-time
Interest in event-driven or distributed systems (Kafka, RabbitMQ, SQS). Exposure to DDD, CQRS, or hexagonal architectures. Experience with observability tools (OpenTelemetry, Prometheus, Grafana). Familiarity with multi-tenant SaaS, RBAC, or performance tuning (JVM, SQL). Why Join Us? Learn from experienced engineers: Work side-by-side with ...

Machine Learning Engineer

Hiring Organisation
Stott and May
Location
Slough, Berkshire, UK
Employment Type
Full-time
SageMaker, Vertex AI) Experience with CI/CD pipelines and automation tools such as GitHub Actions Understanding of monitoring and logging tools (e.g., NewRelic, Grafana) Desirable Skills and Experience Prior experience deploying ML models in production environments Knowledge of infrastructure-as-code tools like Terraform or CloudFormation Familiarity with model ...

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
networking, containers, and low-level debugging Proficiency in at least one programming language (Go, Python, or Rust) Experience with modern observability tooling (Prometheus, ClickHouse, Grafana) Proven experience designing and operating complex distributed systems Experience with configuration management (SaltStack, Ansible, Puppet) Strong root-cause analysis skills with a blameless, data-driven ...

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Reading, Berkshire, UK
Employment Type
Full-time
networking, containers, and low-level debugging Proficiency in at least one programming language (Go, Python, or Rust) Experience with modern observability tooling (Prometheus, ClickHouse, Grafana) Proven experience designing and operating complex distributed systems Experience with configuration management (SaltStack, Ansible, Puppet) Strong root-cause analysis skills with a blameless, data-driven ...

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
networking, containers, and low-level debugging Proficiency in at least one programming language (Go, Python, or Rust) Experience with modern observability tooling (Prometheus, ClickHouse, Grafana) Proven experience designing and operating complex distributed systems Experience with configuration management (SaltStack, Ansible, Puppet) Strong root-cause analysis skills with a blameless, data-driven ...

DevOps Engineer

Hiring Organisation
Code Wizards Group
Location
Slough, Berkshire, UK
Employment Type
Full-time
Unity, Unreal etc. An excitement to expand your knowledge on unfamiliar infrastructure and tools Automation knowledge on a wide variety of tools – Ansible, Terraform, Grafana, and Kubernetes, etc. Prior experience migrating a game from one Cloud provider onto AWS Excellent knowledge of containers, virtual machines and Linux server administration Demonstrable ...

DevOps Engineer

Hiring Organisation
Code Wizards Group
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
Unity, Unreal etc. An excitement to expand your knowledge on unfamiliar infrastructure and tools Automation knowledge on a wide variety of tools – Ansible, Terraform, Grafana, and Kubernetes, etc. Prior experience migrating a game from one Cloud provider onto AWS Excellent knowledge of containers, virtual machines and Linux server administration Demonstrable ...

DevOps Engineer

Hiring Organisation
Code Wizards Group
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
Unity, Unreal etc. An excitement to expand your knowledge on unfamiliar infrastructure and tools Automation knowledge on a wide variety of tools – Ansible, Terraform, Grafana, and Kubernetes, etc. Prior experience migrating a game from one Cloud provider onto AWS Excellent knowledge of containers, virtual machines and Linux server administration Demonstrable ...

Principal Engineer

Hiring Organisation
Motive Group
Location
Slough, Berkshire, UK
Employment Type
Full-time
Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working knowledge of ML infrastructure and familiarity with GPU drivers, CUDA ...

Technical Solutions Engineer - Deep-Tech AI Start-up

Hiring Organisation
Urban Digital Recruitment Ltd
Location
Slough, Berkshire, UK
Employment Type
Full-time
+ edge devices Troubleshoot end-to-end: AI model behaviour, device integrations, networks, cloud infra, on-device performance Analyse logs, metrics and telemetry (Grafana, Metabase) to pinpoint root cause Work hands-on with Linux, SQL, Docker, AWS/GCP/Azure Lead pilots, rollouts and on-device testing across major ...

Senior Front End Engineer

Hiring Organisation
NewDay
Location
Slough, Berkshire, UK
Employment Type
Full-time
working with monorepo project structures using tools such as Turborepo or Nx. Experience being on call, with monitoring and observability tools (e.g., Azure AppInsights, Grafana). Experience with web accessibility standards (WCAG 2.1/2.2) Where next? In the NewDay Tech team, you'll join an Expert or Leader career ...

Application Support Engineer - Elite Systematic Trading Firm - Prop Trading - Market Leading Compensation - WFH - London

Hiring Organisation
Mondrian Alpha
Location
Slough, Berkshire, UK
Employment Type
Full-time
tools like Ansible and Geneos. Experience with relational databases (Postgres, Sybase, Oracle) and writing performant SQL queries. Knowledge of production-grade monitoring tools (e.g., Grafana, Splunk), alert tuning, and system health validation. Understanding of low-latency infrastructure, colocation environments, and performance tuning (CPU affinity, NUMA). Strong communicator with ...

Senior Backend Engineer

Hiring Organisation
M-XR
Location
Slough, Berkshire, UK
Employment Type
Full-time
asset storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development ...