Isleworth, England, United Kingdom Hybrid / WFH Options
Sky
and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in More ❯
London, England, United Kingdom Hybrid / WFH Options
Sky Ireland Limited
and best practices Proven experience in logging systems (e.g. ELK stack) Proven experience in monitoring systems (e.g. Prometheus) Proven experience in tracing systems (e.g. OpenTelemetry, Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service-affecting issues in More ❯
scaling observability platforms in a cloud-native environment. Observability Expertise: Deep understanding of observability pillars (metrics, logs, traces) and related tools (e.g., Prometheus, Grafana, OpenTelemetry, Jaeger, Kibana Elastic Stack). AI/ML Proficiency: Hands-on experience integrating ML/AI models into observability systems to drive advanced insights, anomaly More ❯
London, England, United Kingdom Hybrid / WFH Options
XpertDirect
GKE) Automate infrastructure provisioning with Infrastructure as Code tools like Terraform or Pulumi Implement observability tools (monitoring, logging, tracing) such as Prometheus, Grafana, Loki, OpenTelemetry Ensure platform security using best practices in identity, access management, and network policies Collaborate with cross-functional teams to define cloud architecture standards and promote More ❯
Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on experience with complex troubleshooting of Kubernetes and Docker container Good knowledge of Regex, Lucene, PromQL More ❯
Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on experience with complex troubleshooting of Kubernetes and Docker container Good knowledge of Regex, Lucene, PromQL More ❯
Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on experience with complex troubleshooting of Kubernetes and Docker container Good knowledge of RegEx, Lucene, PromQL More ❯
Logz. io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on experience with complex troubleshooting of Kubernetes and Docker container Good knowledge of RegEx, Lucene, PromQL More ❯
Code (IaC) : Proficiency with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation. Distributed Tracing : Experience with distributed tracing tools like Jaeger or OpenTelemetry for debugging microservices. Security : Strong knowledge of securing microservices, Kubernetes clusters, and cloud-based applications. Additional Information We believe that coming together as a community More ❯
Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on experience with complex troubleshooting of Kubernetes and Docker container Good knowledge of Regex, Lucene, PromQL More ❯
with cloud platforms (AWS, GCP, Azure) and DevOps tooling Familiarity with observability stacks like Grafana, Prometheus, Datadog, Splunk, Kibana, etc. Experience with technical integrations (OpenTelemetry, Fluentd, Fluentbit, Filebeat, etc.) Skilled in troubleshooting Kubernetes and containerised environments Strong communication skills — able to engage with technical teams and senior stakeholders Comfortable working More ❯
London, England, United Kingdom Hybrid / WFH Options
Auros
managing configuration at scale. Experience with CI/CD pipeline, version control best practices. Experience with application and infrastructure instrumentation using tools like Prometheus, OpenTelemetry and eBPF. This is not a network engineer role, however knowledge of networking management and routing in both a cloud and global SD-WAN environment More ❯
London, England, United Kingdom Hybrid / WFH Options
9fin
AWS services including ECS, EC2, Lambda, VPC, IAM, Route53, CloudFront, S3, RDS Good understanding of monitoring and logging solutions, e.g. Prometheus, AWS Cloudwatch, Grafana, OpenTelemetry, Honeycomb, ELK etc. Basic SRE knowledge, and experience in alerting and incident management platforms (eg. Opsgenie, Pagerduty) Proven ability to provide and support strong and More ❯
related field. 5+ years of experience as a Site Reliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure as Code, Terraform is preferred Comfortable with CI/CD pipelines such as GitHub More ❯
London, England, United Kingdom Hybrid / WFH Options
Circadia Health
AI workloads , MLops, or large‐language‐model serving. Knowledge of edge/IoT deployments and over‐the‐air update strategies. Exposure to observability stacks (OpenTelemetry, Loki) and security tooling (Falco, Aqua, Wiz). What We Offer Base salary £100,000 – £170,000 plus meaningful equity. Gym membership Comprehensive health, dental More ❯
AI workloads , MLops, or large‐language‐model serving. Knowledge of edge/IoT deployments and over‐the‐air update strategies. Exposure to observability stacks (OpenTelemetry, Loki) and security tooling (Falco, Aqua, Wiz). What We Offer Base salary £100,000 – £170,000 plus meaningful equity. Gym membership Comprehensive health, dental More ❯
need to check all the boxes, any combination of the below is appreciated)- Monitoring systems knowledge: Prometheus, Logstash, Elastic Search, Grafana, In uxDB, Jaeger, OpenTelemetry or similar. Experience with Kubernetes (k3s, k8s) singularity and/or docker. Other frontend and/or visualization technologies like HTML/CSS or d3js. More ❯
London, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
and scripting Strong experience with a programming language such as Python, Java, etc Strong experience with monitoring and observability tools (Prometheus, Grafana, Splunk, Geneos, OpenTelemetry, Corvil) Familiarity with cloud platforms, containerization (e.g., Kubernetes, Docker), and CI (Continuous Integration)/CD (continuous Delivery) pipelines Strong understanding of the trade lifecycle and More ❯
/coding skills in one or more languages (Python/Golang etc.). Expert knowledge of observability systems (Prometheus/ELK/Jaeger/Opentelemetry/Service Meshes etc.). Experience with configuration management tools (Ansible/Puppet/Kapitan/Terraform). Experience with distributed data platforms (Kafka/ More ❯
with IAM, secrets management, regulatory requirements (GDPR), and DevSecOps best practices.Observability: Demonstrable experience with logging, metrics, and tracing frameworks (ELK/EFK, Prometheus, Grafana, OpenTelemetry). Preferred Qualifications (Nice-to-Have) AWS Solutions Architect - Professional certification Automotive Industry Experience: Familiarity with vehicle telematics, connected car platforms, or automotive retail solutions More ❯
and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is More ❯