in data migration between different storage systems Knowledge of blue-green deployment or other zero-downtime deployment strategies -3 years' monitoring and logging experience Experience with tools like Prometheus, Grafana, ELK stack -3 years' agile methodologies experience Experience working as a Scrum team member -3 years' soft skills experience Experience working with the Agile software development methodologies and collaborating between More ❯
implementing DevOps solutions in cloud-native environments. • Proficiency in Infrastructure as Code (IaC) tools like Terraform, Ansible, Chef, or Puppet. • Familiarity with monitoring and logging tools such as Prometheus, Grafana, Splunk, or the ELK Stack. • Strong knowledge of scripting and programming languages like Python, Bash, Ruby, or Go. • Understanding of security principles and best practices in DevOps (e.g., DevSecOps). … Develop automated processes for managing cloud resources, including configuration management, server provisioning, and network setup. • Implement and maintain robust monitoring, logging, and alerting systems using tools such as Prometheus, Grafana, Splunk, or ELK Stack. • Ensure the performance, reliability, and security of DevOps environments by continuously assessing and optimizing both infrastructure and deployment pipelines. • Define key performance indicators (KPIs) for DevOps More ❯
field) -5-7 years' experience as a cybersecurity network engineer (or similar) -Cloud certifications (e.g., AWS Certified Solutions Architect, Azure Administrator Associate). -Experience with monitoring tools (e.g., Prometheus, Grafana, CloudWatch). -Knowledge of serverless computing and microservices architecture. -Experience with hybrid cloud or multi-cloud environments. -Proficiency with at least one major cloud platform (AWS, Azure, GCP). -Experience More ❯
experience with AWS cloud infrastructure Deep understanding of IaC tools: Terraform, Packer, CloudFormation Proven leadership in multidisciplinary delivery teams Skills in Databases: MongoDB/Atlas; Messaging: Kafka; Observability: Prometheus, Grafana, Splunk Experience working in a DevOps environment, favoring and implementing Continuous Integration & Deployment over manual processes Experience designing, implementing, securing, and supporting Unix/Linux-based platforms (ideally RHEL/ More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
handsworth, yorkshire and the humber, united kingdom
Networker Global Limited
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Frontier Resourcing
or PowerShell Solid understanding of containerization and orchestration using Docker and Kubernetes Experience with AWS networking, including VPCs, subnets, and security groups Familiarity with monitoring and logging solutions: Prometheus, Grafana, ELK Stack, or AWS CloudWatch Knowledge of Zero Trust security models and best practices for securing cloud environments Strong troubleshooting and cloud performance optimization skills Excellent communication and teamwork skills More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Nigel Wright Group
to code Knowledge of Infrastructure as Code tools like Ansible, Terraform, or similar Experience with containerization (Docker) and orchestration (Kubernetes or Docker Swarm) Familiarity with monitoring tools like ELK, Grafana, or similar Experience with CI/CD tools like TeamCity Microsoft SQL Server expertise Windows and Linux administration experience Strong communication skills for interacting with clients and team members Financial More ❯
messaging and streams. o Building RESTful API Services. o Containerisation, Kubernetes, serverless functions. o Microservices, and distributed tracing. o Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). o Automation scripting (using scripting languages such as Terraform, Ansible etc.). • Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. • Experience working within More ❯
experience with AWS cloud infrastructure • Deep understanding of IaC tools: Terraform, Packer, CloudFormation • Proven leadership in multidisciplinary delivery teams • Skills in Databases: MongoDB/Atlas, Messaging: Kafka, Observability: Prometheus, Grafana, Splunk • Experience of working in a DevOps environment - favouring and implementing Continuous Integration & Deployment over manual processes. • Experience of designing, implementing, securing and supporting Unix/Linux based platforms (ideally More ❯
GitLab CI, CircleCI, etc.). In-depth understanding of networking, storage, and compute resources in both cloud and on-prem environments. Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Splunk). Knowledge of Linux/Unix and/or Windows server administration and performance tuning. Proven ability to lead and mentor a team of engineers, facilitating knowledge More ❯
understanding of continuous integration and continuous deployment (CI/CD) pipelines. Familiarity with a scripting language like Python, Bash, or Go. Familiarity with monitoring tools such as Datadog, Prometheus, Grafana, ELK stack, or similar. Strong problem-solving skills, excellent communication skills, and the ability to work independently or in a team. Another thing we'd like to mention There's More ❯
Washington, Washington DC, United States Hybrid / WFH Options
ClearanceJobs
Automation: Enhance DevSecOps pipelines, automate deployments, and improve system resilience through tools like GitLab CI/CD, Jenkins, and Kubernetes. • Incident Response & Monitoring: Implement and manage monitoring solutions (Prometheus, Grafana, ELK Stack), respond to incidents, and conduct post-mortems. • Networking & Security: Configure and maintain VPCs, VPNs, security groups, and firewalls in AWS GovCloud, ensuring compliance with FedRAMP requirements. • GOV Production More ❯
practices, RBAC, IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to work closely with More ❯
as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or More ❯
DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
Gloucester, Gloucestershire, South West Hybrid / WFH Options
CGI
who can adapt to client problems as required. The Tech Stack used is: Java, Python, Javascript (Typescript), Vue, Bash, Jenkins, Ansible ,Cucumber, NiFi, Go, AWS, Gitlab, ELK stack, Terraform, Grafana, Sonarqube, Openshift, Linux Required qualifications to be successful in this role Proven experience in Site Reliability Engineering or a similar DevOps/SRE role supporting cloud-based applications. Strong scripting … tools like Terraform. Solid understanding of AWS services and cloud-native architecture. Strong troubleshooting skills with experience in Linux-based environments. Experience with monitoring and logging tools such as Grafana, ELK Stack, and SonarQube. Familiarity with container orchestration using OpenShift (or Kubernetes equivalent). Ability to support, maintain, and improve deployment environments, ensuring reliability and scalability. Comfortable with live service More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
IO Associates
like Terraform Containerisation knowledge (Docker) and orchestration platforms (Kubernetes, OpenShift, Docker Swarm) Skilled in managing CI/CD platforms such as Jenkins Experience with monitoring and logging tools (e.g., Grafana , Prometheus , InfluxDB ) Familiarity with message queue technologies (e.g., RabbitMQ, AMQP) Solid grasp of SQL and relational databases Linux systems expertise, including command line and shell scripting Understanding of network and More ❯
solutions • Knowledge of container security technologies with experience evaluating and mitigating or resolving vulnerability findings Nice If You Have: • Experience developing in Java or Python • Experience with Prometheus/Grafana and/or ElasticSearch/Kibana and FluentD • Experience with design, deployment, and management of Cloud environments, including AWS or Azure • Experience with using or migrating continuous integration (CI) and More ❯
. Strong understanding of distributed systems, microservices architecture, and RESTful API design. Hands-on experience with Kubernetes and container orchestration. Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position. Hands-on experience with incident response, including designing and improving incident management processes. Expertise in More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
BAE Systems (New)
similar platforms. Operating Systems : Proficiency in Linux environments, including scripting in Bash and/or Python for automation and tooling. Open Source Technologies : Familiarity with tools like Kafka , Elasticsearch , Grafana , or Prometheus for logging, monitoring, and streaming use cases. Version Control : Proficient in using Git for source code management, including branching strategies and code reviews. Desirable Skills & Experience Candidates meeting More ❯