Isleworth, England, United Kingdom Hybrid / WFH Options
Sky
experience in networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to More ❯
compliance. Collaborate with development and operations teams to improve system performance and scalability. Maintain and improve logging, monitoring, and alerting systems using tools like Prometheus, Grafana, ELK Stack, or Datadog Support and optimize infrastructure for both Linux and Windows-based environments. Participate in incident management, problem resolution, and root cause More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
. Maintain configuration management using tools like Ansible, Chef, or Puppet. Monitor application performance, uptime, and logs using tools like Splunk, ELK Stack, or Prometheus/Grafana. Work with cloud platforms (Azure preferred, AWS or GCP a plus) to ensure scalable and secure environments. Ensure compliance with enterprise security, audit More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
. Maintain configuration management using tools like Ansible, Chef, or Puppet. Monitor application performance, uptime, and logs using tools like Splunk, ELK Stack, or Prometheus/Grafana. Work with cloud platforms (Azure preferred, AWS or GCP a plus) to ensure scalable and secure environments. Ensure compliance with enterprise security, audit More ❯
applications into microservices architectures. In-depth Linux/Unix experience, emphasizing system performance tuning and automation. Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Loki, OTel, ELK stack) to ensure system reliability and performance. Experience in developing and working with backend applications technologies (e.g. Express, Django). Benefits More ❯
Delivery - Automic, Octopus Deploy, UrbanCode etc. • Containers - Docker, Kubernetes, Mesosphere etc. • Configuration Management - Ansible, Chef, Puppet etc. • Cloud - AWS, Azure, GCP etc. • Monitoring - ELK, Prometheus, Splunk etc. • Experience in one of the following scripting language: Java, Bash, Python, Powershell, Golang, etc. • Experience working with Linux and/or Windows systems More ❯
London, England, United Kingdom Hybrid / WFH Options
Tes
Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI More ❯
Grays, England, United Kingdom Hybrid / WFH Options
TES
Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI More ❯
networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration More ❯
London, England, United Kingdom Hybrid / WFH Options
SAP SE
networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration More ❯
networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging: Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation: Strong scripting skills in PowerShell, Bash, and Python, along with automation frameworks like Ansible. Collaboration & Problem More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
PowerShell, Bash, or Python Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI Practical experience with Grafana, Prometheus and/0r other monitoring tools Solid understanding of networking, security, and compliance principles Excellent problem-solving and troubleshooting skills Strong communication and collaboration skills More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis More ❯
tasks. Experience with CI/CD tools (GitHub Actions, Jenkins, AWS CodePipeline), and integrating data-centric workflows. Familiarity with monitoring and logging tools (e.g., Prometheus, Loki, Grafana) in application and data-intensive environments. Proficiency in Configuration Management tools (Chef, Puppet, Ansible) and data orchestration tools (e.g., Airflow, Prefect). Strong More ❯
London, England, United Kingdom Hybrid / WFH Options
Quaisr Limited
problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR More ❯
problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR More ❯
SOA) Cloud: AWS CI/CD: Jenkins, GoCD Orchestration: Docker, Docker compose, Kubernetes Interservice communication: Rest, Rabbit MQ, HTTP Databases: PostgreSQL, MongoDB, Clickhouse Monitoring: Prometheus, Grafana Logging: ELK stack The Benefits Offered Paid Time Off - A minimum of 35 days of paid time off per year, inclusive of annual leave More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
PowerShell, Bash, or Python. Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI. Practical experience with Grafana, Prometheus and/0r other monitoring tools. Solid understanding of networking, security, and compliance principles. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills More ❯
Kafka, Docker, Redis, MongoDB. Experience with application clustering, load balancing, high availability, and reliability concepts and supporting technologies. Experience with monitoring systems such as Prometheus, Grafana, Splunk, or the ELK Stack. Clear written and verbal communication skills. Some level of participation in an on-call escalation path. A passion for More ❯