London, England, United Kingdom Hybrid/Remote Options
Staffworx
cloud environments. Strong communication skills, creativity and a systems-thinking mindset. Curiosity, adaptability and a drive to stay ahead of rapid advancements in GenAI. BENEFICIAL Experience with PromptOps & LLM Observability tools (PromptLayer, LangFuse, Humanloop, Helicone, LangSmith). Understanding of Responsible AI, model safety, bias mitigation, evaluation frameworks and governance. Background in Computer Science, AI/ML, Engineering, or related fields. More ❯
frontend technologies such as HTML, CSS, HTMX, React Experience of C# with .NET Web and API development Understanding of git, CI/CD and DevOps practices and experience in Observability/monitoring Any knowledge of Actuarial/Commercial Insurance Some exposure to or interest in Terraform and deployment pipelines in AWS or Azure Awareness of modern software techniques such as More ❯
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
East London, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
and orchestration using Docker, Kubernetes, Helm, Service Mesh (ie. Istio) and GitOps (ie. ArgoCD), with a focus on streamlined deployments and managing complex service-oriented architectures. Experienced in leveraging observability tools, such as Honeycomb (OpenTelemetry) and DataDog, to support data-driven decisions across the wider engineering team. Comprehensive understanding of networking in cloud environments, including VPN solutions, efficient network configuration More ❯
to join their Cloud Engineering team. You will assist in building scalable, cutting edge, automated GCP Platform. Core skills: GCP Terraform/Terramate GKE/Kubernetes CI/CD Observability – Mimir, Grafana, Prometheus Python Desirable: Airflow PostgreSQL Helm Please apply ASAP for more information. More ❯
to join their Cloud Engineering team. You will assist in building scalable, cutting edge, automated GCP Platform. Core skills: GCP Terraform/Terramate GKE/Kubernetes CI/CD Observability – Mimir, Grafana, Prometheus Python Desirable: Airflow PostgreSQL Helm Please apply ASAP for more information. More ❯
AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes This position is open to Lead level Engineers, able to offer £80-90K More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Oliver Bernard
AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes This position is open to Lead level Engineers, able to offer £80-90K More ❯
Salary : Up to £65,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python, TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. While More ❯
Salary : Up to £65,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python, TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. While More ❯
East London, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £65,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python, TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. While More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £65,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python, TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. While More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £65,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python, TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. While More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Blockchain 121
performance, reliability, and cost efficiency Requirements 5+ years of experience managing production backend systems at scale Cloud and on-premise infrastructure (AWS, GCP, etc.) Kubernetes and container orchestration Networking, observability, and data systems (relational, in-memory, time-series) Infrastructure-as-code (Terraform, Ansible) Proven experience building fault-tolerant, secure, and automated systems Familiarity with performance testing, chaos engineering, and security More ❯
performance, reliability, and cost efficiency Requirements 5+ years of experience managing production backend systems at scale Cloud and on-premise infrastructure (AWS, GCP, etc.) Kubernetes and container orchestration Networking, observability, and data systems (relational, in-memory, time-series) Infrastructure-as-code (Terraform, Ansible) Proven experience building fault-tolerant, secure, and automated systems Familiarity with performance testing, chaos engineering, and security More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯