Isleworth, England, United Kingdom Hybrid / WFH Options
Sky
networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in a Broadcast/Livestream environment More ❯
London, England, United Kingdom Hybrid / WFH Options
Auros
required. Knowledge and experience in managing configuration at scale. Experience with CI/CD pipeline, version control best practices. Experience with application and infrastructure instrumentation using tools like Prometheus, OpenTelemetry and eBPF. This is not a network engineer role, however knowledge of networking management and routing in both a cloud and global SD-WAN environment is a plus. Understanding of More ❯
London, England, United Kingdom Hybrid / WFH Options
9fin
you: Good working knowledge of AWS services including ECS, EC2, Lambda, VPC, IAM, Route53, CloudFront, S3, RDS Good understanding of monitoring and logging solutions, e.g. Prometheus, AWS Cloudwatch, Grafana, OpenTelemetry, Honeycomb, ELK etc. Basic SRE knowledge, and experience in alerting and incident management platforms (eg. Opsgenie, Pagerduty) Proven ability to provide and support strong and scalable CI/CD pipelines More ❯
London, England, United Kingdom Hybrid / WFH Options
Circadia Health
. Experience orchestrating GPU/AI workloads , MLops, or large‐language‐model serving. Knowledge of edge/IoT deployments and over‐the‐air update strategies. Exposure to observability stacks (OpenTelemetry, Loki) and security tooling (Falco, Aqua, Wiz). What We Offer Base salary £100,000 – £170,000 plus meaningful equity. Gym membership Comprehensive health, dental & vision coverage (UK & global travel More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stealth AI Startup
Who we are We are a seed-stage AI start-up backed by leading European and US funds. Our founders previously built and deployed cutting-edge AI systems at world-class research labs and high-growth technology companies. We apply More ❯
Who we are We are a seed-stage AI start-up backed by leading European and US funds. Our founders previously built and deployed cutting-edge AI systems at world-class research labs and high-growth technology companies. We apply More ❯
London, England, United Kingdom Hybrid / WFH Options
Stealth AI Startup
This range is provided by Stealth AI Startup. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Direct message the job poster from Stealth AI Startup Fractional Talent More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high impact role in More ❯
London, England, United Kingdom Hybrid / WFH Options
Durlston Partners
on with Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at the lowest levels of the stack More ❯
London, England, United Kingdom Hybrid / WFH Options
Xtremepush
the API to Application to Database layer of the platform. Strong communication skills and ability to explain complex technical solutions simply to others Strong understanding of PHP, GoLang, MySQL, Opentelemetry, Prometheus Experience with Cloud and DevOps technologies (AWS, Terraform, CI/CD etc.) Experience with specific technologies in our stack: Clickhouse, Kafka, Pulsar, Python Experience with networking and security concepts More ❯
worked with visualisation tools such as Grafana for creating and maintaining dashboards that provide meaningful insights into system performance Are proficient with metrics platforms such as Prometheus, InfluxDB, or OpenTelemetry for collecting and analysing system data Have experience with incident management tools such as Incident.io for coordinating response efforts and recording follow-up learnings and actions Can demonstrate strong problem More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Unitary
worked with visualisation tools such as Grafana for creating and maintaining dashboards that provide meaningful insights into system performance Are proficient with metrics platforms such as Prometheus, InfluxDB, or OpenTelemetry for collecting and analysing system data Have experience with incident management tools such as Incident.io for coordinating response efforts and recording follow-up learnings and actions Can demonstrate strong problem More ❯
London, England, United Kingdom Hybrid / WFH Options
uSwitch
Control Tower Familiarity with Argo Workflows or similar data pipeline as a service tools Familiarity working with a variety of Cloud Native projects Familiarity with Github Action Familiarity with OpenTelemetry Out team has been featured in a few conferences: CNCF: PlatformCon: and We have also been featured in the London AWS Summit 2023 for contribution to the EKS tooling community More ❯
London, England, United Kingdom Hybrid / WFH Options
Birdie
About Birdie Birdie is the leading home healthcare technology platform that aims to radically transform the lives of older adults. Its all-in-one solution supports around 4.8 million (and growing) care visits every month, equipping care providers with the More ❯
London, England, United Kingdom Hybrid / WFH Options
Bjak
services. Conduct performance and load testing for distributed systems. Work with DevOps Engineers to integrate tests into CI/CD pipelines. Ensure observability and logging for test executions, e.g. OpenTelemetry, ELK. Collaborate with Software Engineers to enforce quality in system refactoring efforts. Bachelor's Degree in Computer Science, Software Engineering, or related fields. 3+ years of experience in QA Automation More ❯
London, England, United Kingdom Hybrid / WFH Options
Smart Communications group
Job Details: Observability Performance Engineer Full details of the job. Vacancy Name: Observability Performance Engineer Employment Type: Permanent Location: UK - Remote Summary We are looking for an Observability Performance Engineer to help us improve the visibility into the performance and More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Site Reliability Engineer (SRE) - Kubernetes, Observability, Prometheus, Dynatrace, OpenTelemetry Role Overview This is a fantastic opportunity with a consulting company seeking to fill multiple SRE roles. You will play a key role in managing client platforms with a strong emphasis on observability and Kubernetes expertise. Joining their growing practice, you'll have access to extensive online and classroom More ❯
London, England, United Kingdom Hybrid / WFH Options
Abound
balancing multiple priorities—diagnosing complex system issues, onboarding clients to our platform, writing monitoring queries, and coordinating incident responses across teams. Our technology stack: AWS (including ECS and RDS), OpenTelemetry, NewRelic, Python, Postgres, Liquibase, Angular, Docker Who You Are Four or more years of professional experience in customer-facing technical support or engineering roles Excellent communication skills, with the ability More ❯
London, England, United Kingdom Hybrid / WFH Options
ProtonMail
demonstrating ability to diagnose and troubleshoot complex problems for critical systems Experience with Docker or similar containerization technologies Experience with metrics graphing/analysis toolkits such as Grafana and OpenTelemetry Experience with Python and/or Rust Experience with OpenAPI specs or other API documentation frameworks Why should you join Proton? Be part of a movement - Proton is not just More ❯