City of London, London, United Kingdom Hybrid/Remote Options
Advanced Resource Managers
Linux or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Signify Technology
Docker, ECS, or Kubernetes). Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). More ❯
packaging and deploying AI/ML workloads, particularly LLMs and vector-based apps. Comfortable with containerisation, Git workflows, and scripting (Bash, Python, etc.). Exposure to observability tools like Prometheus, Grafana, Datadog, or ELK. A proactive, detail-oriented approach with strong documentation and communication skills. Comfortable working with JIRA for tickets and sprint rituals. 🌟 Why Join Us? Join a team More ❯
or Azure acceptable) Experience working with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code in Python and/ More ❯
Actions, Jenkins, AWS CodePipeline). Experience with Linux administration , networking concepts, and IAM-based access control. Strong scripting ability (Python, Bash, or PowerShell). Knowledge of monitoring tools (CloudWatch, Prometheus, Grafana, etc.). Desirable Skills: Experience working in public sector, defence, or other secure environments. Exposure to containerisation technologies (Docker, ECS, EKS, Kubernetes). Understanding of Zero Trust, NCSC cloud More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Experis UK
Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning tools (e.g., SonarQube More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harnham
in cloud platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence More ❯
City of London, London, United Kingdom Hybrid/Remote Options
La Fosse
or Azure) using IaC such as Terraform/Bicep Enhancing CI/CD pipelines and release automation (GitHub Actions, Jenkins, GitLab etc.) Designing and implementing monitoring, logging and alerting (Prometheus, Grafana, cloud-native tools) Supporting ML-driven services, serverless functions and data pipelines Improving automation and DevOps best practices across the engineering team Helping define platform architecture as the company More ❯
City Of Westminster, London, United Kingdom Hybrid/Remote Options
Additional Resources
and data lake/lakehouse environments. Understanding of Agile development methods, CI/CD pipelines, GitHub, and automated testing. Practical experience monitoring live services using tools such as Grafana, Prometheus, or New Relic. This is an excellent opportunity to play a key role in shaping innovative data solutions within a forward-thinking organisation. Important Information: We endeavour to process your More ❯
Westminster, City of Westminster, Greater London, United Kingdom Hybrid/Remote Options
Additional Resources
and data lake/lakehouse environments. Understanding of Agile development methods, CI/CD pipelines, GitHub, and automated testing. Practical experience monitoring live services using tools such as Grafana, Prometheus, or New Relic. This is an excellent opportunity to play a key role in shaping innovative data solutions within a forward-thinking organisation. Important Information: We endeavour to process your More ❯
with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Digital Saints | Start-up Talent Partner
including cost, resilience and data residency Confidence working with senior stakeholders to make pragmatic, commercial platform decisions Tech environment includes: AWS, Azure, Kubernetes (EKS), ECS, Terraform, Airflow, EFK/Prometheus/Grafana, MySQL/PostgreSQL/Redis/Elasticsearch/Kafka, GitHub Actions, plus services written in Python, Node.js and Go. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Lorien
cloud-native storage: GCS, S3 and filer solutions. Exposure to data services: Cloud SQL, managed databases, MongoDB. Messaging systems: Kafka, RabbitMQ and EMS; API gateways like Apigee. Observability tools: Prometheus, Grafana, Cloud Monitoring. IAM and secrets management: dynamic secrets, Vault. Test-driven development and automated testing frameworks. Several years of real-world cloud engineering experience. Personal Attributes Curious and growth More ❯
GitLab CI Advanced networking: BGP, OSPF, VLANs, VPNs, MPLS, SD-WAN, and cross-region connectivity Firewall administration (Cisco ASA or similar) and network access control Monitoring, logging, and observability (Prometheus, Grafana, CloudWatch, ECS, etc.) Linux and Windows systems administration IT infrastructure management: Active Directory, Microsoft services, networking, and on-prem systems Security, compliance, and disaster recovery in cloud and hybrid More ❯
City Of London, England, United Kingdom Hybrid/Remote Options
Harrington Starr
in AWS. Strong scripting skills with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next Step? For a confidential conversation More ❯
/Lambda or Cloud Run/GKE), containerized with Docker. Own CI/CD (GitHub Actions), IaC (Terraform), logging/metrics/tracing ( OpenTelemetry , CloudWatch/Stackdriver, Grafana/Prometheus), and SLOs . Optimize p95 latency, throughput, and cost ; manage secrets, networking, VPCs, and build resilient retries/backoffs. 15% Collaborate Work closely with design/PM on specs and More ❯
City of London, London, United Kingdom Hybrid/Remote Options
LHH
of cloud platforms including AWS, Azure, and GCP, as well as hybrid/private data centres. Experience with CI/CD tools and technologies such as Kubernetes, Docker, GitOps, Prometheus, Grafana, ELK Stack, and Datadog. Experience leading multi-supplier teams, collaborating in a ‘one-team’ approach, and working within highly regulated environments such as telecoms or government. Excellent analytical, verbal More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harrington Starr
concepts. Mindset: Pragmatic, customer-focused, and driven by efficiency and automation. Education: Minimum 2:1 degree in a STEM subject or equivalent experience. Desirable: Exposure to observability tooling (Grafana, Prometheus, Mimir). Interest in data platforms or AI-enabled development workflows. Learn More For more information, contact George Harris at Harrington Starr for a confidential conversation, or click “Apply” to More ❯
GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A More ❯