Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
london (city of london), south east england, united kingdom
Damia Group
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
london (city of london), south east england, united kingdom
Harnham
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
Skills: Experience working in Agile environments Strong understanding of Site Reliability Engineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Understanding More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
london (city of london), south east england, united kingdom
Prism Digital
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Huxley
Bicep or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Betsson Group
RabbitMQ, Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source More ❯
london, south east england, united kingdom Hybrid / WFH Options
Betsson Group
RabbitMQ, Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
london (city of london), south east england, united kingdom
Arrows
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or blockchain More ❯
systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or blockchain More ❯
london (city of london), south east england, united kingdom
Orbis Group
systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems or blockchain More ❯
teams to ensure applications meet performance and reliability standards. Automate operational tasks using tools such as Ansible, Terraform, or Python scripts. Build and maintain monitoring and alerting systems (eg, Prometheus, Grafana). Participate in incident response and conduct root cause analysis for performance-related issues. Document performance benchmarks, testing procedures, and system configurations. If you are interested in this position More ❯
london, south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global More ❯