OpenTelemetry Jobs in East London

4 of 4 OpenTelemetry Jobs in East London

Site Reliability Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Explore Group
Tech Stack Cloud: AWS (EKS, ECS, RDS, IAM, Lambda, etc.) IaC: Terraform, Terragrunt Containerisation: Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with More ❯
Posted:

Observability Engineer - Grafana Dashboarding

South East London, England, United Kingdom
Levy Global
with cross-functional engineering teams. Experience working in Linux-based environments. Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Senior Software Engineer

South East London, England, United Kingdom
Stealth AI Startup
and user training. Collaborate directly with executives & operators - run white-boarding sessions, turn ambiguous requirements into concrete specs, demo weekly, and iterate fast. Champion observability & reliability - instrument services with OpenTelemetry, define SLIs/SLOs, and automate incident response. Contribute across the stack - build lightweight front-ends when needed and pair with ML engineers on inference and evaluation pipelines. You Might More ❯
Posted:

Site Reliability Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Unitary
worked with visualisation tools such as Grafana for creating and maintaining dashboards that provide meaningful insights into system performance Are proficient with metrics platforms such as Prometheus, InfluxDB, or OpenTelemetry for collecting and analysing system data Have experience with incident management tools such as Incident.io for coordinating response efforts and recording follow-up learnings and actions Can demonstrate strong problem More ❯
Posted: