Strong scripting skills in Python, Bash, or PowerShell for automation. Understanding of AWS networking concepts , including VPCs, subnets, and security groups. Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and optimise cloud deployments. Security Requirements More ❯
platform (IAM, secrets management, encryption) • Support compliance initiatives (ISO 27001, NIST, GDPR, MCERTS, etc.) • Manage network configuration, firewalls, and secure endpoints Monitoring & Reliability • Set up observability and monitoring tools (Prometheus, Grafana, Datadog, or CloudWatch) • Ensure high availability, scalability, and cost efficiency of cloud services • Define SLIs, SLOs, and SLAs for platform components • Troubleshoot production issues and coordinate incident response Collaboration More ❯
manual processes related to deployment, monitoring, and operations using scripting languages such as Python, Bash, or Go. * Monitoring & Logging: Implement and manage robust monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK Stack) to proactively identify and resolve system issues. * Collaboration: Work closely with software developers, network engineers, and product managers to troubleshoot issues and optimize performance * Security: Integrate security More ❯
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom Hybrid/Remote Options
UK Health Security Agency
Coding skills in programming/scripting languages such as Python, PowerShell or Bash Understanding of Linux/Unix & Windows systems, networking, and distributed systems Experience with observability tools (e.g., Prometheus, Grafana, Datadog) and alerting systems Understanding of infrastructure automation (e.g., Terraform, Ansible, PowerShell, Helm) Excellent communication and collaboration skills Experience with security best practices Possesses problem solving skills and the More ❯
Docker, Kubernetes) Strong understanding of internet protocols — TCP/IP, DNS, HTTP Awareness of security best practices in web application deployment Experience with monitoring and logging tools (e.g., CloudWatch, Prometheus, Grafana) Experience with Web application firewalls like AWS Waf, Cloudflare Desirable skills: Experience with PostgreSQL and MSSQL administration and replication Exposure to Azure cloud services Experience with KongHQ/AWS More ❯
Crewe, Cheshire, United Kingdom Hybrid/Remote Options
Applause IT Recruitment Ltd
administration skills Docker and Kubernetes experience Strong understanding of TCP/IP, DNS, HTTP Knowledge of security best practice for web application deployments Experience with monitoring & logging tools (CloudWatch, Prometheus, Grafana) Web application firewall experience (AWS WAF, Cloudflare) Desirable PostgreSQL and MSSQL administration Azure cloud services KongHQ/AWS API Gateway Azure DevOps CloudFront and other CDNs SSL certificate management More ❯
Devops Engineer (AWS, C# .Net, IaC) C# .NET, NuGet, AWS, EC2, EKS, RDS, Aurora, laC, TCP/IP, DNS, Docker, Kubernetes, CI/CD workflows, Terraform, CDK Cloudwatch, Prometheus, Grafana, AWS Waf Our client is looking for an experienced Devops Engineer to come in and hit the ground running. You will be joining a small established team in their growing More ❯
networking. Collaborate with MLOps and Data Engineers to ensure reproducible, reliable, and auditable infrastructure across environments. Monitor system health, logging, and metrics using platform observability tools (e.g. Azure Monitor, Prometheus, Grafana where applicable). About You: As an experienced DevOps Engineer, your skills and experience will include; Azure Landing Zone principles, vNet design, RBAC, networking, and hybrid connectivity. CI/ More ❯
with office-hours base in Stockport). Tech We Use Linux: RedHat/CentOS Automation: Ansible, Terraform Containers: Docker, Kubernetes Cloud: AWS/Azure/GCP Monitoring: Nagios, Grafana, Prometheus Virtualisation: VMware Networking: Firewalls, switching fundamentals More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
tooling (GO or .NET preferred) * Cloud experience, ideally AWS, and knowledge of container orchestration (Kubernetes) and Infrastructure as Code (Terraform) * Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry * Strong understanding of networking fundamentals and distributed systems* Ability to collaborate effectively with engineering, operations and product teams TO BE CONSIDERED: Please either apply through this advert or More ❯
Wigan, Greater Manchester, North West, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
tooling (GO or .NET preferred) * Cloud experience, ideally AWS, and knowledge of container orchestration (Kubernetes) and Infrastructure as Code (Terraform) * Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry * Strong understanding of networking fundamentals and distributed systems * Ability to collaborate effectively with engineering, operations and product teams TO BE CONSIDERED: Please either apply through this advert or More ❯
Knutsford, Cheshire, North West, United Kingdom Hybrid/Remote Options
Anson Mccade
SLOs, incident response, and reliability metrics. Experience supporting and optimising large-scale, high-availability environments. Desirable: Exposure to database standardisation and automation at scale. Familiarity with observability tools (Grafana, Prometheus, AppDynamics, or similar). Strong communication and collaboration skills, with a focus on continuous improvement and shared learning. What's on Offer Salary up to £62,000 plus bonus, medical More ❯
and deployment of these services all the way to production in a controlled and secure way. Tech stack - Java engineer needs experience with spring boot framework, TDD, Grafana and Prometheus for monitoring and alerting and understanding of the CI/CD process.All candidates must pass a BPSS.Immediate start.End March 2026.Weekly travel to Leeds/Newcastle/Manchester.£400 - £500 per More ❯
and deployment of these services all the way to production in a controlled and secure way. Tech stack - Java engineer needs experience with spring boot framework, TDD, Grafana and Prometheus for monitoring and alerting and understanding of the CI/CD process. All candidates must pass a BPSS. Immediate start. End March 2026. Weekly travel to Leeds/Newcastle/ More ❯
Nelson, Lancashire, England, United Kingdom Hybrid/Remote Options
Lorien
on Azure or AWS. Driving Infrastructure as Code (IaC) practices using Terraform. Building and optimising CI/CD pipelines to accelerate delivery. Implementing and maintaining monitoring and observability with Prometheus and Grafana. Enabling team collaboration and incident response through Slack and other ChatOps tools. Leading, mentoring, and supporting engineers (or preparing to step into people management if you're progressing … hands-on experience with Azure or AWS. Solid background with Terraform and IaC. Proven use of CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.). Knowledge of Prometheus and Grafana for monitoring. Familiarity with collaboration tools like Slack. Either: Prior management/team lead experience, or A Senior DevOps engineer ready to progress into a managerial role. (Bonus More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
events. SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and … to process and submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS SRE, Site Reliability Engineering, AWS, Kubernetes, Terraform, Grafana, Prometheus, OpenTelemetry, Go, .NET, Cloud Infrastructure, Observability, CI/CD, DevOps, Automation, Performance Tuning, Incident Management More ❯
Wigan, Greater Manchester, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
events. SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and … to process and submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS SRE, Site Reliability Engineering, AWS, Kubernetes, Terraform, Grafana, Prometheus, OpenTelemetry, Go, .NET, Cloud Infrastructure, Observability, CI/CD, DevOps, Automation, Performance Tuning, Incident Management More ❯