decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS cloud More ❯
in Terraform, Ansible, Jenkins, or GitLab CI Knowledge of Kafka, Cassandra, and relational or NoSQL databases Scripting skills in Python, Bash, Go, or Java Familiarity with monitoring tools like Prometheus, Nagios, or Icinga Understanding of networking fundamentals and virtualisation (e.g. VMware) Comfortable with on-call rotations and troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid More ❯
shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Configuration Management Ansible Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS cloud infrastructure (ideally in a regulated or More ❯
and orchestration (Docker, Kubernetes). Ability to manage and optimize large-scale cloud infrastructure. Familiarity with Python (Jupyter) and ML frameworks (e.g., PyTorch). Experience with cloud monitoring tools (Prometheus, Grafana). Exposure to cloud-based databases (RDS, Aurora, Spanner, etc.) and data-visualisation tools. Knowledge of CI/CD tools (e.g., CircleCI). More ❯
GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead More ❯
london (city of london), south east england, united kingdom
Damia Group
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
london (city of london), south east england, united kingdom
Harnham
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
control (802.1x, RADIUS), or zero-trust security concepts. Exposure to infrastructure-as-code (Terraform, Ansible) and version control systems (Git). Experience with monitoring and observability tools (LogicMonitor, Grafana, Prometheus). Knowledge of hybrid cloud networking, including AWS Direct Connect or GCP Interconnect. Relevant certifications such as CCNP, AWS Advanced Networking Specialty, or Google Cloud Network Engineer. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
london (city of london), south east england, united kingdom
Hunter Bond
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
london (city of london), south east england, united kingdom
Prism Digital
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
Back End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and More ❯
the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash, or Go What Youll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating … Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations (EKS, Helm, or similar). Solid knowledge of monitoring, alerting, and logging (Grafana, Prometheus, ELK). Hands-on experience with Terraform and CI/CD tooling. Strong scripting or development background (Python, Go, or similar). Excellent troubleshooting skills and a proactive, problem-solving More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Understanding Recruitment
/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Paradigm Talent
AWS, GCP, or Azure). Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform that fuses AI, creativity, and More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Paradigm Talent
AWS, GCP, or Azure). Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform that fuses AI, creativity, and More ❯