26 to 50 of 98 OpenTelemetry Jobs in the UK

Observability Engineer

Hiring Organisation
Hays Technology
Location
City of London, London, Cheap, United Kingdom
Employment Type
Contract
move to an AIOps environment. What you'll need to succeed Extensive experience in observability/SRE/platform engineering roles Strong experience with OpenTelemetry, Prometheus, Grafana, Splunk, Elastic etc Python, Go or Java programming Experience with Terraform, Helm or other IAC tools What you'll get in return ...

Senior TechOps Engineer

Hiring Organisation
IntaPeople
Location
Cardiff, Butetown Community, South Glamorgan, United Kingdom
Employment Type
Permanent
Salary
£60000 - £70000/annum
comprehensive monitoring stack whilst configuring Prometheus for metrics, visualise data in Grafana , and standardise distributed tracing across our services using tools such as OpenTelemetry . You will Implement and manage traffic management strategies using AWS to ensure performance standard for their global client base. Using Kubernetes (EKS) to manage ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
London, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Nottingham, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Liverpool, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Southampton, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Glasgow, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Leicester, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Leeds, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Birmingham, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Bristol, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Manchester, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Woking, Surrey, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Shrewsbury, Shropshire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Cheltenham, Gloucestershire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Worcester, Worcestershire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Stevenage, Hertfordshire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Plymouth, Devon, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Norwich, Norfolk, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Luton, Bedfordshire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Gloucester, Gloucestershire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Wakefield, West Yorkshire, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...

Site Reliability Engineer

Hiring Organisation
SS&C Technologies
Location
Wolverhampton, West Midlands, UK
Employment Type
Full-time
recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale: Operate and harden Kubernetes (EKS preferred); manage deployments, autoscaling … Nice‐to‐Have EKS internals, cluster autoscaler, managed node groups/Fargate; service mesh (Istio/Linkerd), ingress controllers (Nginx/ALB). Prometheus, OpenTelemetry, Loki/Tempo, alert tuning and SLO burn‐ratealerts. Argo CD/FluxCD, Helm chart authoring, Kustomize. CD patterns (blue/green, canary, feature flags ...