Experience working in Agile environments Strong understanding of Site Reliability Engineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Understanding of More ❯
distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or More ❯
distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or More ❯
london (city of london), south east england, united kingdom
Orbis Group
distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Huxley
or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days depending More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Betsson Group
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
london, south east england, united kingdom Hybrid / WFH Options
Betsson Group
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
to managing our infrastructure, using Terraform. We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We’re in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). We manage a data pipeline using Pub/Sub More ❯
to managing our infrastructure, using Terraform. We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We’re in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). We manage a data pipeline using Pub/Sub More ❯
london (city of london), south east england, united kingdom
Duffel
to managing our infrastructure, using Terraform. We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We’re in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). We manage a data pipeline using Pub/Sub More ❯
london, south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global trading systems More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global trading systems More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global trading systems More ❯
writing intermediate to advanced SQL queries for data extraction and troubleshooting purposes. Experience with using and troubleshooting programming interfaces especially REST APIs and Web Sockets. Experience with monitoring tools (Grafana, DataDog) Experience working with Crypto and blockchain (DLT) Familiarity with common engineering development workflows and tools (e.g. JIRA, Confluences, github, scrum, etc...) Familiarly with scaling, monitoring, and general production challenges More ❯
writing intermediate to advanced SQL queries for data extraction and troubleshooting purposes. Experience with using and troubleshooting programming interfaces especially REST APIs and Web Sockets. Experience with monitoring tools (Grafana, DataDog) Experience working with Crypto and blockchain (DLT) Familiarity with common engineering development workflows and tools (e.g. JIRA, Confluences, github, scrum, etc...) Familiarly with scaling, monitoring, and general production challenges More ❯
london (city of london), south east england, united kingdom
Global Fintech
writing intermediate to advanced SQL queries for data extraction and troubleshooting purposes. Experience with using and troubleshooting programming interfaces especially REST APIs and Web Sockets. Experience with monitoring tools (Grafana, DataDog) Experience working with Crypto and blockchain (DLT) Familiarity with common engineering development workflows and tools (e.g. JIRA, Confluences, github, scrum, etc...) Familiarly with scaling, monitoring, and general production challenges More ❯
Platform (GCP) services. Familiarity with incident.io for incident tracking and management (of equivalent) Proficiency in using JIRA for task management and support workflows. Strong experience working with observability tools (Grafana) Strong troubleshooting and problem-solving skills in cloud environments. Understanding of cloud security and performance optimisation best practices. Knowledge of scripting or automation tools (e.g., Python, Terraform) is a plus. More ❯
Platform (GCP) services. Familiarity with incident.io for incident tracking and management (of equivalent) Proficiency in using JIRA for task management and support workflows. Strong experience working with observability tools (Grafana) Strong troubleshooting and problem-solving skills in cloud environments. Understanding of cloud security and performance optimisation best practices. Knowledge of scripting or automation tools (e.g., Python, Terraform) is a plus. More ❯
london (city of london), south east england, united kingdom
WALT Labs
Platform (GCP) services. Familiarity with incident.io for incident tracking and management (of equivalent) Proficiency in using JIRA for task management and support workflows. Strong experience working with observability tools (Grafana) Strong troubleshooting and problem-solving skills in cloud environments. Understanding of cloud security and performance optimisation best practices. Knowledge of scripting or automation tools (e.g., Python, Terraform) is a plus. More ❯
Burgess Hill, West Sussex, South East, United Kingdom Hybrid / WFH Options
Randstad Digital
driven architecture . Databases & Messaging: Strong knowledge of both SQL and NoSQL databases, as well as Kafka . Tools: Familiarity with Jenkins , GitHub , and monitoring tools like Splunk or Grafana . Good to Have: Experience with reactive programming , caching mechanisms , and Agile projects. If you are a passionate and skilled developer, we encourage you to apply and join our team. More ❯
to ensure applications meet performance and reliability standards. Automate operational tasks using tools such as Ansible, Terraform, or Python scripts. Build and maintain monitoring and alerting systems (eg, Prometheus, Grafana). Participate in incident response and conduct root cause analysis for performance-related issues. Document performance benchmarks, testing procedures, and system configurations. If you are interested in this position and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Ncounter
VLAN/VxLAN, MLAG, STP. Hands-on with Arista/Cisco; strong troubleshooting tools (Wireshark, netcat, etc.). Familiar with network security, automation (Python, Ansible), and observability stacks (Prometheus, Grafana). Excellent communicator with experience delivering in high-stakes, collaborative settings. STEM degree and CCNP/CCIE preferred. Why Join? Join a trusted global institution where networking is core to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Venn Group
including RHEL, CentOS, Ubuntu, VMware, and F5 load balancers Manage web services, LAMP stack applications, Samba servers, and authentication proxies Utilise tools such as Ansible, Katello, Nagios, Prometheus, and Grafana for configuration and monitoring Automate routine tasks using scripts and infrastructure-as-code practices Maintain clear and up-to-date technical documentation Support knowledge sharing and training for first- and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
ll Bring Strong experience with GCP , Terraform , and Infrastructure-as-Code Deep knowledge of cloud networking, security automation, and compliance standards Proficiency in CI/CD pipelines , monitoring tools (Grafana, Datadog), and scripting A collaborative mindset with excellent communication and mentoring skills Why Join? Shape a next-gen AI infrastructure with autonomy and purpose Hybrid working with regular meetups in More ❯