GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯
GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯
london (city of london), south east england, united kingdom
algo1
GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯
Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead More ❯
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
london (city of london), south east england, united kingdom
Damia Group
Trivy, Checkov, SonarQube) into automated workflows Manage authentication, access control, and secrets using Vault, AWS Secrets Manager, OAuth2.0, and Zero Trust principles Monitor environments with ELK Stack, Splunk, and Prometheus to ensure visibility, auditing, and compliance Collaborate with engineering, operations, and security teams to promote DevSecOps best practices Key Skills & Experience Strong background in cloud platforms, particularly AWS and Kubernetes More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
london (city of london), south east england, united kingdom
Harnham
Key Details: Salary: £100k–£180k (flexible for strong profiles) + equity Working Model: On-site, London Tech Stack: AWS/GCP/Azure, Kubernetes, Docker, Terraform, Python, MLflow/Prometheus/Grafana If you want to shape the backbone of one of Europe’s most ambitious AI startups, we’d love to hear from you. More ❯
control (802.1x, RADIUS), or zero-trust security concepts. Exposure to infrastructure-as-code (Terraform, Ansible) and version control systems (Git). Experience with monitoring and observability tools (LogicMonitor, Grafana, Prometheus). Knowledge of hybrid cloud networking, including AWS Direct Connect or GCP Interconnect. Relevant certifications such as CCNP, AWS Advanced Networking Specialty, or Google Cloud Network Engineer. More ❯
Hands-on experience in technical integrations and POCs Comfortable coding in any high-level programming language (Java, Go, Python) Strong hands-on knowledge of Kubernetes, AWS, Azure, GCP, Docker, Prometheus, and OpenTelemetry Industry knowledge and opinions on Monitoring, Observability, Log Management, SIEM Engineering/DevOps Background - advantage Experience in Technical Sales of Log Analytics/Monitoring/APM/SIEM More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and More ❯
London, England, United Kingdom Hybrid / WFH Options
Cint
Kubernetes, Docker, Packer, Ansible and Jenkins. We support applications and services written in Golang, Python, Java, Scala and .Net. We monitor and alert on everything we deploy via Grafana, Prometheus, Graphite and ELK stacks. The team holds itself accountable to a high standard of build quality. We have recently completed the first major phase of a completely green-field infrastructure … GitHub Actions etc.) You have a grasp of “cloud native” and 12-Factor applications You have good knowledge of monitoring and alerting using one or more of: Graphite, Statsd, Prometheus, Grafana, PagerDuty You have expertise in at least one scripting or programming language (Python, Bash, Ruby, Node, Golang, Java) Bonus Points If You Have You have good knowledge of the More ❯
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
london (city of london), south east england, united kingdom
Hunter Bond
with infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯