healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and More ❯
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Netcompany UK Limited
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated experience More ❯
Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
pipelines and lead Infrastructure as Code (Terraform, CloudFormation). Implement DevSecOps best practices to meet HIPAA, SOC 2, and ISO 27001 requirements. Monitor system performance and availability using CloudWatch, Prometheus, Grafana, and related tooling. Collaborate with engineering, security, and product teams to drive end to end reliability. Qualifications Experience 6+ years of DevOps/SRE experience in cloud environments (AWS More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
British Veterinary Association
using Terraform Build and maintain CI/CD pipelines (e.g., GitLab CI, GitHub Actions, Jenkins) Operate and scale containerised workloads with Docker and Kubernetes (EKS) Implement observability solutions including Prometheus , Grafana , and ELK Drive platform consistency across multiple regions and teams Collaborate with engineers, security, and product managers to deliver robust systems Promote automation and the "platform as a product More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom
Experis
Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions). Experience with Linux systems, networking, and containerization (Docker). Understanding of monitoring and tracing tools (e.g., Prometheus, Jaeger, Grafana). Excellent problem-solving and communication skills. Preferred Qualifications: Experience contributing to OpenTelemetry or other open-source observability projects. Familiarity with enterprise environments (e.g., VMware, bare-metal servers More ❯
Familiarity with CI/CD tools (eg, Jenkins, GitLab CI, GitHub Actions). Experience with Linux systems, networking, and containerization (Docker). Understanding of monitoring and tracing tools (eg, Prometheus, Jaeger, Grafana). Excellent problem-solving and communication skills. Preferred Qualifications: Experience contributing to OpenTelemetry or other open-source observability projects. Familiarity with enterprise environments (eg, VMware, bare-metal Servers More ❯
handsworth, yorkshire and the humber, united kingdom
Experis
Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions). Experience with Linux systems, networking, and containerization (Docker). Understanding of monitoring and tracing tools (e.g., Prometheus, Jaeger, Grafana). Excellent problem-solving and communication skills. Preferred Qualifications: Experience contributing to OpenTelemetry or other open-source observability projects. Familiarity with enterprise environments (e.g., VMware, bare-metal servers More ❯
Groovy/Jenkins/Golang Provisioning software/frameworks (Elasticsearch/Spark/Hadoop/PostgreSQL) Infrastructure Management - CasC, IasC (Ansible, Terraform, Packer) Log and metric aggregation with Fluentd, Prometheus, Grafana, Alertmanager Public Cloud, primarily GCP, but also AWS and Azure More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with More ❯
storage infrastructure configuration and deployment. Develop Infrastructure-as-Code (IaC) solutions (e.g., using Terraform, Ansible) for scalable and repeatable storage provisioning. Integrate monitoring dashboards and alerting systems (e.g., Grafana, Prometheus, ELK) to ensure visibility into storage health and performance. Collaborate with infrastructure, platform, and cloud teams to align automation with operational goals. Ensure solutions meet enterprise standards for security , resilience More ❯
to low-code platforms (e.g., Retool) for rapid application development. Experience in DevOps practices, including infrastructure-as-code (IaC), monitoring, alerting, and incident management. Familiarity with observability tools (Grafana, Prometheus) and APM tools (New Relic, Datadog). Knowledge of microservices architecture, event-driven design, and scalability best practices. Experience implementing data compliance standards (GDPR, ISO 27001). Find.co is an More ❯
for improvement Take pride in building and operating scalable, reliable, secure systems Are comfortable with ambiguity and rapid change Preferred skills and experience: Familiar with monitoring tools such as Prometheus, Grafana, or similar 5+ years building core infrastructure Experience running inference clusters at scale Experience operating orchestration systems such as Kubernetes at scale Benefits & perks (UK full-time employees): Generous More ❯
troubleshooting and scripting languages such as Python, Go, or Bash. Experience with Kubernetes security, including workload isolation, RBAC, and network policies, containerisation, orchestration, and Kubernetes observability tools (e.g., Falco, Prometheus, Grafana). Experience with infrastructure-as-code and configuration management tools (e.g., Terraform, Helm, ArgoCD). United Kingdom Security Vetting Developed Vetting (DV) clearance. Preferred qualifications: Certifications in Security (e.g. More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
SLOs, error budgets, and incident response. Experience with infrastructure as code (e.g., Terraform, Deployment Manager) and CI/CD pipelines. Proficiency in monitoring, logging, and observability tools (e.g., Stackdriver, Prometheus, Grafana). Knowledge of Linux systems, networking, and cloud security best practices. It would be great if you also had Experience working in DevOps environments, with a focus on automation More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Strive Gaming
Participate in on-call rotations and help troubleshoot production issues. Tech Requirements (must have): IAC - Infrastructure as Code (Terraform) AWS Argo Strong linux skills ELK/LGTM stack knowledge Prometheus DataDog Grafana Kubernetes Helm Docker Bash/shell scripting Git Strong security mindset Tech (nice to have) Crowdstrike OnPrem/ESXI Windows Server EntraID More ❯
shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Configuration Management Ansible Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS cloud infrastructure (ideally in a regulated or More ❯
Birmingham, West Midlands (County), United Kingdom
Syntax Consultancy Ltd
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on Digital or Technology More ❯
Birmingham, West Midlands (County), United Kingdom
Syntax Consultancy Ltd
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g. Grafana, Alert Manager, Prometheus, Node exporter ). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on Digital or Technology projects More ❯
environments and deploying with Helm Strong skills in Terraform, CI/CD automation, GitOps, and containerization (e.g., Docker) Proficient in scripting (Bash, PowerShell, Python) and using monitoring tools like Prometheus and Azure Monitor Proven track record managing Azure landing zones with enterprise governance and security controls Solid understanding of cloud security frameworks (CIS, NIST) and Azure tools like Key Vault More ❯
/Desktop environments Proficiency in scripting with Bash, PowerShell, and Ansible; Python experience is a plus Familiarity with virtualisation platforms, containerisation, and orchestration tools Experience with monitoring stacks (e.g., Prometheus, Grafana, ELK/EFK) Ability to troubleshoot complex issues using a structured, methodical approach Excellent written and visual communication skills; able to produce clear documentation and diagrams Highly organised and More ❯