City of London, London, United Kingdom Hybrid/Remote Options
Advanced Resource Managers
Linux or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Signify Technology
Docker, ECS, or Kubernetes). Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Sanderson
SNS, SQS, Lambda, DynamoDB). Drive automation across CI/CD pipelines using tools like GitHub Actions , Terraform , and Argo CD for seamless and secure deployments. Enhance observability using Prometheus , Grafana , Datadog , and CloudWatch , enabling proactive incident prevention. Own incident management and post-mortem practices — guiding the team through challenges calmly and driving meaningful improvement. Collaborate with global engineering and More ❯
or Azure acceptable) Experience working with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code in Python and/ More ❯
pipelines using tools like Azure DevOps, Jenkins, GitHub Actions, or similar. • Deep understanding of cloud infrastructure automation and orchestration. • Experience in monitoring and alerting solutions (e.g., Azure Monitor, Grafana, Prometheus). • Strong problem-solving skills with the ability to take initiative and work independently without micromanagement. • Excellent communication and collaboration skills, capable of working with cross-functional teams. Desirable skills More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Experis UK
Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning tools (e.g., SonarQube More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harnham
in cloud platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence More ❯
City Of Westminster, London, United Kingdom Hybrid/Remote Options
Additional Resources
and data lake/lakehouse environments. Understanding of Agile development methods, CI/CD pipelines, GitHub, and automated testing. Practical experience monitoring live services using tools such as Grafana, Prometheus, or New Relic. This is an excellent opportunity to play a key role in shaping innovative data solutions within a forward-thinking organisation. Important Information: We endeavour to process your More ❯
Westminster, City of Westminster, Greater London, United Kingdom Hybrid/Remote Options
Additional Resources
and data lake/lakehouse environments. Understanding of Agile development methods, CI/CD pipelines, GitHub, and automated testing. Practical experience monitoring live services using tools such as Grafana, Prometheus, or New Relic. This is an excellent opportunity to play a key role in shaping innovative data solutions within a forward-thinking organisation. Important Information: We endeavour to process your More ❯
in Terraform, Ansible, Jenkins, or GitLab CI Knowledge of Kafka, Cassandra, and relational or NoSQL databases Scripting skills in Python, Bash, Go, or Java Familiarity with monitoring tools like Prometheus, Nagios, or Icinga Understanding of networking fundamentals and virtualisation (e.g. VMware) Comfortable with on-call rotations and troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid More ❯
with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Lorien
cloud-native storage: GCS, S3 and filer solutions. Exposure to data services: Cloud SQL, managed databases, MongoDB. Messaging systems: Kafka, RabbitMQ and EMS; API gateways like Apigee. Observability tools: Prometheus, Grafana, Cloud Monitoring. IAM and secrets management: dynamic secrets, Vault. Test-driven development and automated testing frameworks. Several years of real-world cloud engineering experience. Personal Attributes Curious and growth More ❯
GitLab CI Advanced networking: BGP, OSPF, VLANs, VPNs, MPLS, SD-WAN, and cross-region connectivity Firewall administration (Cisco ASA or similar) and network access control Monitoring, logging, and observability (Prometheus, Grafana, CloudWatch, ECS, etc.) Linux and Windows systems administration IT infrastructure management: Active Directory, Microsoft services, networking, and on-prem systems Security, compliance, and disaster recovery in cloud and hybrid More ❯
City Of London, England, United Kingdom Hybrid/Remote Options
Harrington Starr
in AWS. Strong scripting skills with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next Step? For a confidential conversation More ❯
Confluent Platform, Confluent Cloud managed services, multi-cloud deployments, and Confluent for Kubernetes • Knowledge of data mesh architectures, KRaft migration, and modern event streaming patterns • Exposure to monitoring tools (Prometheus, Grafana, Splunk) • Experience with data lakes, data warehouses, or big data ecosystems Personal Besides the professional qualifications of the candidates we place great importance in addition to various forms personality More ❯
/Lambda or Cloud Run/GKE), containerized with Docker. Own CI/CD (GitHub Actions), IaC (Terraform), logging/metrics/tracing ( OpenTelemetry , CloudWatch/Stackdriver, Grafana/Prometheus), and SLOs . Optimize p95 latency, throughput, and cost ; manage secrets, networking, VPCs, and build resilient retries/backoffs. 15% Collaborate Work closely with design/PM on specs and More ❯
City of London, London, United Kingdom Hybrid/Remote Options
LHH
of cloud platforms including AWS, Azure, and GCP, as well as hybrid/private data centres. Experience with CI/CD tools and technologies such as Kubernetes, Docker, GitOps, Prometheus, Grafana, ELK Stack, and Datadog. Experience leading multi-supplier teams, collaborating in a ‘one-team’ approach, and working within highly regulated environments such as telecoms or government. Excellent analytical, verbal More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harrington Starr
concepts. Mindset: Pragmatic, customer-focused, and driven by efficiency and automation. Education: Minimum 2:1 degree in a STEM subject or equivalent experience. Desirable: Exposure to observability tooling (Grafana, Prometheus, Mimir). Interest in data platforms or AI-enabled development workflows. Learn More For more information, contact George Harris at Harrington Starr for a confidential conversation, or click “Apply” to More ❯
GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯