Working within cloud environments (AWS, Azure, GCP) for automation and infrastructure management. Exposure to security compliance frameworks (ISO 27001, CIS benchmarks, NIST). Experience with monitoring and observability tools (Prometheus, Grafana, ELK/EFK stacks). Integration of automation platforms with ticketing systems (ServiceNow, Jira). Hands-on work with container security scanning and remediation processes. Experience in disaster recovery More ❯
Oldham, Greater Manchester, North West, United Kingdom
Innovative Technology
Docker, Kubernetes) Familiarity with CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours More ❯
Docker, Kubernetes) Familiarity with CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
The Granite Group
Techniques - Hands-on experience with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
itecopeople
IaC) using Terraform and/or CloudFormation CI/CD tools such as GitHub Actions , GitLab CI , Jenkins , or CodePipeline Monitoring and logging with tools like CloudWatch , ELK Stack , Prometheus , Grafana , or similar Scripting in Python , Bash , or similar Good understanding of networking, security groups, load balancers, and general cloud security best practices Typical Tasks: As a key member of More ❯
/CD pipelines using tools such as Gitlab CI. Significant Demonstrable experience in designing and implementing best practices using IaC tools such as Terraform, and monitoring tools such as Prometheus & Grafana. To see the full job description, requirements, candidate pack and apply button please click the link. More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Netcompany UK Limited
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
Engineering role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable More ❯
Engineering role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Strive Gaming
Participate in on-call rotations and help troubleshoot production issues. Tech Requirements (must have): IAC - Infrastructure as Code (Terraform) AWS Argo Strong linux skills ELK/LGTM stack knowledge Prometheus DataDog Grafana Kubernetes Helm Docker Bash/shell scripting Git Strong security mindset Tech (nice to have) Crowdstrike OnPrem/ESXI Windows Server EntraID More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Gamma Communications plc
with automation, IaC, and CI/CD principles. Understand Network concepts, Infrastructure, and common protocols. Able to write basic scripts for automation Build dashboards in Grafana and understanding of Prometheus and PromQL. Knowledge of SDLC and experience integrating solutions into CI pipelines Experience with cloud (AWS, GCP) is beneficial, but not essential. Able to self-manage Jira tickets and provide More ❯
by several microservices, also written in Python, utilising frameworks and libraries such as Celery, Eventlet, SQLAlchemy, etc. Additionally, GOV.UK Notify utilises AWS RDS (Postgres), AWS SQS, AWS ElastiCache, OpenTelemetry, Prometheus, Grafana and other related services. Concourse CI and Terraform are used to run build-pipelines and manage our infrastructure. For the frontend, we follow theGOV.UK Design System , making use of More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Hays
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Bradford, Yorkshire, United Kingdom Hybrid / WFH Options
Yorkshire Building Society Group
in the following: Continuous Integration/Continuous Delivery pipelines - tools such as Jenkins & GitLab Scripting and automation capabilities Modern monitoring skills and best practices using tools such as Grafana, Prometheus, Kibana, DynaTrace Testing frameworks Knowledge of networks and routing. Knowledge of integrations of services utilising different technologies such as PLSQL, .Net, C#, Java, Sprint Boot, Spring Batch Experience of integrating More ❯
are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem-solving More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
DCS Recruitment
are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem-solving More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Nigel Wright Group
stakeholders, end0users and technologists ITIL (or similar) certification (or experience working within an ITIL framework) Strong understanding of application design, rational databases (SQL Server), monitoring and alerting tools (Grafana, Prometheus, Victoria Metrics), scheduling tools (Control-M), operating systems (Windows/Linux), Kubernetes, cloud platforms (Azure), issue tracking and source control (JIRA, Git, Bitbucket). Interview Process: Coding Challenge – We would More ❯
such as IBM Netcool, Moogsoft, BigPanda, PagerDuty, ServiceNow AIOps. Proficiency in Python, and hands-on knowledge of Ansible Automation Platform. Other highly valued skills include: Knowledge of Observability Platforms: Prometheus, Grafana, ELK, Splunk. Experience with integration into ITSM platforms such as ServiceNow. Experience with Kafka. You may be assessed on the key critical skills relevant for success in role, such More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Lorien
modern technologies. with clear progression routes available. Key Requirements: Strong troubleshooting and fault-resolution experience across infrastructure and applications Hands-on experience with monitoring tools such as Instana, Splunk, Prometheus, Grafana, or SolarWinds Confident supporting both Windows and Linux operating systems Experience working in ITIL-aligned support environments Understanding of web hosting technologies (DNS, HTTP/S, SSL Certs, and More ❯
orchestration. Support multi-tenancy and environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like New Relic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost visibility and optimization efforts across … and operating developer platforms and enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially New Relic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire, and develop talented platform engineers More ❯