infrastructure, and automation. Strong knowledge of CI/CD tooling, IaC, and cloud-native technologies. Advanced scripting (Bash, Python) and automation experience. Skilled in monitoring and observability tools (e.g., Prometheus, Grafana, ELK). Strong problem-solving, communication, and leadership skills. Familiarity and Experience of CI/CD Tools: Jenkins, GitLab CI Infrastructure as Code: Terraform, Ansible, Helm Cloud Platforms: AWS More ❯
Sunbury-On-Thames, London, United Kingdom Hybrid / WFH Options
BP Energy
using Amazon Web Services or Microsoft Azure. Skilled with infrastructure tools like Kubernetes, Istio, EKS, Kafka. Experience in Terraform, Ansible, Puppet, Chef, for infrastructure as code, monitoring tools (e.g., Prometheus, Grafana) and logging systems (e.g., ELK stack). Skilled in the understanding of using core cloud application infrastructure services including identity platforms, networking, storage, databases, containers, and serverless. Skilful knowledge More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Ignite Digital Search Ltd
Scripting expertise (Python, Bash, PowerShell) Highly Valued: Experience in regulated industries (healthcare, financial services, life sciences) AWS cost management and FinOps experience Monitoring tools expertise (CloudWatch, Datadog, New Relic, Prometheus) Security and compliance framework knowledge Experience with observability and APM solutions Why This Opportunity Stands Out: Real Impact - Your work directly improves healthcare outcomes Growth Trajectory - Join a scaling company More ❯
MySQL, MongoDB, PostgreSQL Experience with containers and orchestration: Docker; Kubernetes networking and service mesh; RKE2 Experience with streaming protocols: RTMP, WebRTC, HLS, MPEG-DASH Experience with monitoring/telemetry: Prometheus, Grafana, Alertmanager, Thanos; ELK; Jaeger; custom exporters Experience with CI/CD and automation: Jenkins, GitLab, Sonatype Nexus, Terraform, Ansible, Argo CD Experience with cloud platforms: AWS (primary), with GCP More ❯
storage infrastructure configuration and deployment. Develop Infrastructure-as-Code (IaC) solutions (e.g., using Terraform, Ansible) for scalable and repeatable storage provisioning. Integrate monitoring dashboards and alerting systems (e.g., Grafana, Prometheus, ELK) to ensure visibility into storage health and performance. Collaborate with infrastructure, platform, and cloud teams to align automation with operational goals. Ensure solutions meet enterprise standards for security , resilience More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
and resilience. Qualifications What we’d love you to bring: Experience of AWS (particularly EC2, EKS, Lambda, S3, IAM, etc) Monitoring/alerting tools (for example we use Grafana, Prometheus, Loki, CloudWatch and Dynatrace) Knowledge of monitoring best practices for a variety of different platforms and technologies Docker and Kubernetes Git/Gitlab Jenkins/CI/CD/ArgoCD More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
required. Qualifications What we’d love you to bring: Deep experience of AWS (particularly EC2, EKS, Lambda, S3, IAM, etc) Monitoring/alerting tools (for example we use Grafana, Prometheus, Loki, CloudWatch and Dynatrace) SME on monitoring best practices for a variety of different platforms and technologies Docker and Kubernetes Git/Gitlab Jenkins/CI/CD/ArgoCD More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
Scale-up Experience Strong cloud skills (AWS, GCP, Azure) and containerisation (Docker, Kubernetes) Experience in automating deployments and orchestrating cloud environments Nice to have: Python (Jupyter, PyTorch), monitoring tools (Prometheus, Grafana), cloud databases (RDS, Aurora, Spanner), CI/CD tools (CircleCI), and data visualisation experience. This is a unique opportunity to join a visionary team redefining AI in 3D , with More ❯
Employment Type: Full-Time
Salary: £140,000 - £160,000 per annum, Inc benefits
london, south east england, united kingdom Hybrid / WFH Options
Searchability NS&D
with Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Searchability NS&D
with Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Searchability NS&D
with Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must More ❯
Docker, Kubernetes). Expertise in managing and optimizing cloud-based systems at scale. Preferred Skills Familiarity with Python (Jupyter) and ML frameworks (e.g., PyTorch). Knowledge of monitoring tools (Prometheus, Grafana). Experience with cloud-based databases (RDS, Aurora, Redshift, Cloud SQL, etc.) and visualization tools (QuickSight, Superset). Understanding of CI/CD pipelines and tools (e.g., CircleCI). More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
observability, and cost optimisation Nice to Have Experience with ML tooling (MLflow, Kubeflow) Knowledge of FastAPI , Databricks, or Snowflake Exposure to SRE practices or cloud security certifications Familiarity with Prometheus , Grafana , or Datadog Interested? If you want to be part of a world-class AI team at an early stage-where your infrastructure decisions will directly shape the company's More ❯
and live data visualisation Collaborate with QA and DevOps to enhance automated testing and deployment pipelines Lead efforts in securing, scaling, and monitoring the frontend environment Use observability tools (Prometheus, Grafana, Loki) to monitor UI health and performance Drive UI architectural decisions, performance benchmarking, and best practice implementation Skills and Experience Required Degree in Computer Science, Engineering, or a related More ❯
issues Support Kubernetes/OpenShift environments and application deployments Enable developers through onboarding and technical support Maintain and improve CI/CD pipelines (Tekton, Argo CD) Monitor systems using Prometheus, Grafana, Splunk, Loki, and EFK Automate infrastructure provisioning using scripting and IaC tools Collaborate with vendors and internal teams for issue resolution What You'll Bring Strong Linux (Red Hat More ❯
the production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL INCLUDE: Improve monitoring on our application servers which we are currently lacking. Re-implement high availability on … development CI/CD tools such as Version Control Systems (SVN or Git), Jira, GitLab, or Jenkins Experience in using configuration management, monitoring and logging tools such as Ansible, Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). Extensive experience with Windows and Linux operating system environments Experience with infrastructure scripting solutions such as Linux and/or More ❯
NW10, Middlesex, Greater London, United Kingdom Hybrid / WFH Options
ITH Pharma
the production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL INCLUDE: Improve monitoring on our application servers which we are currently lacking. Re-implement high availability on … development CI/CD tools such as Version Control Systems (SVN or Git), Jira, GitLab, or Jenkins Experience in using configuration management, monitoring and logging tools such as Ansible, Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). Extensive experience with Windows and Linux operating system environments Experience with infrastructure scripting solutions such as Linux and/or More ❯
their integration with Azure DevOps, Monday. com, Teams etc. Integrating Azure SSO/RBAC into all the collaboration and development tools. Implement monitoring, logging, and alerting using tools like Prometheus, Grafana, or ELK Manage containerised services using Docker and Kubernetes (EKS) Candidate would also be required to add features and support existing system. Skills: Proven hands-on experience with Microsoft … Azure DevOps, GitHub Actions, etc.) Git-based workflow (PR, Merges, Jira status, CI and then CD). IaC (Terraform, ARM) Scripting (Python, PowerShell, Bash) Monitoring and alerting tools (e.g. Prometheus, Grafana, Azure Monitor). Collaboration tools (Jira and Monday. com). Integration tools like Power Automate, Slack etc. Good to have Programming language (C#/.Net/Python More ❯
Position Summary We are looking for an experienced Systems Engineer with strong Linux and Kubernetes experience to join our Group Engineering - Systems team. You will help design, build and operate modern infrastructure platforms that support continually evolving applications and services. More ❯