with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
East London, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
East London, London, United Kingdom Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
City of London, London, United Kingdom Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
East London, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
security best practices Building and maintaining CI/CD templates to enable rapid, reliable deployments Developing a developer portal (e.g. Backstage) and internal tooling to support engineering productivity Supporting observability initiatives and contributing to reliability metrics Working on internal open-source projects that enhance the developer experience What They’re Looking For: Strong experience with AWS (ECS, EC2, Lambda, VPC More ❯
security best practices Building and maintaining CI/CD templates to enable rapid, reliable deployments Developing a developer portal (e.g. Backstage) and internal tooling to support engineering productivity Supporting observability initiatives and contributing to reliability metrics Working on internal open-source projects that enhance the developer experience What They’re Looking For: Strong experience with AWS (ECS, EC2, Lambda, VPC More ❯
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯