in data migration between different storage systems Knowledge of blue-green deployment or other zero-downtime deployment strategies -3 years' monitoring and logging experience Experience with tools like Prometheus, Grafana, ELK stack -3 years' agile methodologies experience Experience working as a Scrum team member -3 years' soft skills experience Experience working with the Agile software development methodologies and collaborating between More ❯
including vulnerability management and compliance. Collaborate with development and operations teams to improve system performance and scalability. Maintain and improve logging, monitoring, and alerting systems using tools like Prometheus, Grafana, ELK Stack, or Datadog Support and optimize infrastructure for both Linux and Windows-based environments. Participate in incident management, problem resolution, and root cause analysis. Ensure documentation of infrastructure, processes More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Hedge End, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
Tes
environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI More ❯
Integration services such as messaging and streams. Building RESTful API Services. Containerization, Kubernetes, serverless functions. Microservices and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). Automation Scripting (using Scripting languages such as Terraform, Ansible, etc.). Experience working with Continuous Integration (CI), Continuous Delivery (CD), and continuous testing tools. Experience working within an Agile More ❯
automating data processing tasks. Experience with CI/CD tools (GitHub Actions, Jenkins, AWS CodePipeline), and integrating data-centric workflows. Familiarity with monitoring and logging tools (e.g., Prometheus, Loki, Grafana) in application and data-intensive environments. Proficiency in Configuration Management tools (Chef, Puppet, Ansible) and data orchestration tools (e.g., Airflow, Prefect). Strong background in containerization using Docker and orchestration More ❯
field) -5-7 years' experience as a cybersecurity network engineer (or similar) -Cloud certifications (e.g., AWS Certified Solutions Architect, Azure Administrator Associate). -Experience with monitoring tools (e.g., Prometheus, Grafana, CloudWatch). -Knowledge of serverless computing and microservices architecture. -Experience with hybrid cloud or multi-cloud environments. -Proficiency with at least one major cloud platform (AWS, Azure, GCP). -Experience More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python. Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI. Practical experience with Grafana, Prometheus and/0r other monitoring tools. Solid understanding of networking, security, and compliance principles. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills, with the ability to More ❯
Use Terraform, AWS CDK, or CloudFormation to automate cloud resource provisioning, enabling consistent and repeatable infrastructure deployments. Monitoring & Observability: Implement monitoring, logging, and alerting solutions using tools like Prometheus, Grafana, Loki, Datadog, or CloudWatch to ensure system health and performance. Security & Compliance: Implement security best practices for cloud infrastructure, including IAM policies, security groups, and VPC configurations, to ensure compliance … Go. Experience with CI/CD tools such as Jenkins, GitLab CI, or AWS CodePipeline for automated deployment and testing. Familiarity with monitoring and logging tools such as Prometheus, Grafana, Loki, or Datadog. Strong understanding of cloud security best practices and IAM management. Excellent problem-solving and troubleshooting skills with the ability to resolve complex infrastructure and application issues. Strong More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
Undisclosed
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
Experience in migrating monolithic applications into microservices architectures. In-depth Linux/Unix experience, emphasizing system performance tuning and automation. Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Loki, OTel, ELK stack) to ensure system reliability and performance. Experience in developing and working with backend applications technologies (e.g. Express, Django). Benefits we offer: 23 days’ holiday + More ❯
London, England, United Kingdom Hybrid / WFH Options
Quaisr Limited
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
London, England, United Kingdom Hybrid / WFH Options
Global Screening Services
Take strategic direction and own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems Strong experience with Python and More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI Practical experience with Grafana, Prometheus and/0r other monitoring tools Solid understanding of networking, security, and compliance principles Excellent problem-solving and troubleshooting skills Strong communication and collaboration skills, with the ability to More ❯
At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments. We leverage advanced cryptographic methods such as tokenization, format-preserving encryption, and quantum-resilient More ❯
infrastructure-as-code tools (e.g., Terraform). Ensure secure, stable environments through proper VPC design, IAM governance, and secret management. Build and maintain system metrics and alerts using Prometheus, Grafana, and Loki. Enforce GitHub repo and branching standards across development teams. Ensure cost-effective infrastructure usage through continuous monitoring, resource optimization, and cost control strategies across AWS and containerized deployments. … bonus. Solid scripting skills (Python, Bash, or equivalent). Hands-on experience with Docker, Kubernetes, Helm, and deployment automation. Familiar with monitoring and logging stacks; experience with Prometheus/Grafana is expected. Security-conscious and experienced in IAM, encryption, and secure system design. Able to monitor and optimize computing resources to maintain performance within budget constraints. Comfortable using AI tools More ❯