In-depth experience engineering and maintaining a private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana, Nagios, Zabbix Familiarity with Big Data tools: Hadoop, HDFS, Spark, HBase Ability to write code in Go, Python, Bash, or Perl for automation. Work Experience More ❯
In-depth experience engineering and maintaining a private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus. Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana, Nagios, Zabbix. Familiarity with Big Data tools: Hadoop, HDFS, Spark, HBase. Ability to write code in Go, Python, Bash, or Perl for automation. Work Experience More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for development, maintenance, and ongoing support of the platform, acting as the internal focal point for More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for developing, maintaining, and supporting the platform. The role involves acting as the internal focal point More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for development, maintenance, and ongoing support of the platform. The role involves acting as the internal More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for the development, maintenance, and ongoing support of the platform. The role acts as the internal More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for development, maintenance, and ongoing support of the platform, acting as the internal focal point for More ❯
IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for the development, maintenance, and ongoing support of the platform, acting as the internal focal point for an enterprise platform. The ideal candidate should be self More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for the development, maintenance, and ongoing support of the platform. The role acts as the internal More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for developing, maintaining, and supporting the platform. The role acts as the internal focal point for More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for the development, maintenance, and ongoing support of the platform, acting as the internal focal point More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for platform development, maintenance, and support. The role involves acting as the internal focal point for More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for the development, maintenance, and ongoing support of the platform. The role involves acting as the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. Develop and maintain … the DevOps Engineer level Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/Kubernetes operations Working More ❯
with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. Develop and maintain … the DevOps Engineer level Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/Kubernetes operations Working More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. Develop and maintain … the DevOps Engineer level Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/Kubernetes operations Working More ❯
processes that enable continuous integration, continuous deployment, and efficient infrastructure management. Your expertise in using cutting-edge technologies such as GitHub, Azure, Kubernetes, ACR, YAML, Terraform, Azure DevOps, HELM, Prometheus, Grafana, PowerShell, and Jira will be crucial in driving innovation and ensuring the reliability of our software delivery pipeline. Key Responsibilities: Infrastructure Automation and Configuration Management: Design, build, and maintain … deploy, manage, and scale containerized applications. Work with Docker and Azure Container Registry (ACR) to create and manage container images efficiently. Monitoring and Performance Management: Implement monitoring solutions using Prometheus and Grafana to track system performance, application health, and resource utilization. Set up alerts and notifications to promptly respond to potential issues. Security and Compliance: Collaborate with the security team … manifests and Helm for managing Kubernetes applications. Experience with implementing and maintaining CI/CD pipelines using Azure DevOps or similar tools. Knowledge of monitoring and logging solutions like Prometheus and Grafana, or similar tools. Strong scripting skills, especially in PowerShell, for automating tasks and configurations. Excellent problem-solving skills and the ability to work well in a fast-paced More ❯
with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. Develop and maintain … the DevOps Engineer level Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/Kubernetes operations Working More ❯
GitHub Actions) Define and enforce platform standards across environments (dev, staging, prod) Collaborate with developers and DevOps on deployment tooling and security Enable platform observability using tools like Datadog, Prometheus, and CloudWatch Maintain Helm charts and Terraform modules for shared infrastructure Contribute to onboarding documentation and platform adoption practices Participate in incident response and postmortem analysis, where applicable Essential Skills … and secure image management Scripting or programming experience in Bash, Python, or TypeScript Strong understanding of GitOps practices and infrastructure lifecycle management Desirable Skills Experience with observability tooling (Datadog, Prometheus, Fluent Bit) Knowledge of admission controllers, OPA/Gatekeeper (optional for governance) Familiarity with cloud cost optimisation and Kubernetes scaling strategies Exposure to security scanning tools (tfsec, Trivy, Snyk) Interest More ❯
/CD pipelines using Jenkins, TeamCity, or similar tools. Establish and maintain key performance metrics through baseline vs. benchmarking analysis. Conduct in-depth performance analysis using tools like Grafana, Prometheus, AWS CloudWatch, and Azure Insights. Work closely with engineers and operational teams to diagnose and troubleshoot performance issues. What We Offer As well as a competitive salary and benefits package … recognised professional qualifications. Preservica is an equal opportunity employer. Desired Experience Expertise in performance testing tools, such as JMeter, Gatling, LoadRunner, etc. Hands-on use of monitoring tools, i.e. Prometheus, Grafana, AWS CloudWatch, or similar Experience with Jenkins, TeamCity, Kubernetes, Docker, RabbitMQ (or similar) Ability to script and modify solutions in Python, Bash, Java, etc. Experience with Playwright and TypeScript More ❯
the Ilkley office for occasional office attendance. VARIED DAY TO DAY RESPONSIBILITIES Ensuring system reliability, performance, and scalability through monitoring and automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Managing incident response and post … incident management, error budgets, and service-level objectives (SLOs) Experience designing and implementing robust observability, monitoring and logging solutions Strong proficiency with observability and monitoring tools such as Grafana, Prometheus, and Loki Strong experience with distributed tracing and telemetry tools such as OpenTelemetry An understanding of cloud networking architecture and load balancing techniques Experience with container orchestration platforms like Kubernetes More ❯
based on set targets will be expected. VARIED DAY TO DAY RESPONSIBILITIES Ensuring system reliability, performance, and scalability through monitoring and automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Managing incident response and post … incident management, error budgets, and service-level objectives (SLOs) Experience designing and implementing robust observability, monitoring and logging solutions Strong proficiency with observability and monitoring tools such as Grafana, Prometheus, and Loki Strong experience with distributed tracing and telemetry tools such as OpenTelemetry An understanding of cloud networking architecture and load balancing techniques Experience with container orchestration platforms like Kubernetes More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4
maintain environments to ensure high availability and security. Design and implement CI/CD pipelines to automate software delivery. Monitor and troubleshoot system performance issues, using observability tools like Prometheus, Grafana, or Datadog . Collaborate with development teams to align infrastructure efforts with project needs and timelines. Build and maintain infrastructure as code (IaC) solutions using tools like Terraform Manage … continuous delivery. AWS or Azure Cloud Expertise: Strong knowledge of AWS/Azure services, Infrastructure-as-Code: Proficiency in Terraform, Ansible, or similar tools. Monitoring and Observability: Experience with Prometheus, Grafana, Datadog, or other observability platforms. Automation and Scripting: Proficiency in Python, Bash, or other scripting languages to automate tasks. Incident Management: Ability to lead incident response efforts and conduct More ❯
YAML; Exposure to both Windows and Linux operating systems; Familiarity with standard DevOps tools such as Git, Jenkins, TeamCity etc; Use of monitoring tools such as Kibana, Nagios, Grafana, Prometheus, etc. The following experience would also be an advantage: Middleware technologies such as web or application platforms, message brokers, etc; Software-Defined Networking configuration, and familiarity with networking components such … YAML; Exposure to both Windows and Linux operating systems; Familiarity with standard DevOps tools such as Git, Jenkins, TeamCity etc; Use of monitoring tools such as Kibana, Nagios, Grafana, Prometheus, etc. The following experience would also be an advantage: Middleware technologies such as web or application platforms, message brokers, etc; Software-Defined Networking configuration, and familiarity with networking components such More ❯
pipelines for microservices and Kafka-related applications using tools like Drone Automate infrastructure provisioning using Terraform or Infrastructure-as-Code tools Build and maintain monitoring and alerting systems using Prometheus, Grafana, or AWS native monitoring tools like CloudWatch Collaborate with development and DevOps teams to design MSK and Kubernetes-based solutions Troubleshoot complex issues related to Kafka and container orchestration. … Infrastructure as Code (IaC) tools like Terraform Knowledge of container build and deployment automation using CI/CD pipelines Experience in observability tools for both MSK and Kubernetes, including Prometheus, Grafana, and AWS CloudWatch for metrics and logs Deep understanding of Kafka and Kubernetes security practices, including network policies and IAM roles Experience with Vault Strong analytical and troubleshooting skills More ❯