Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Spectrum It Recruitment Limited
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Actions) Work with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience More ❯
Actions) Work with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
solutions using AWS , Kubernetes , and associated DevOps practices. Champion DevOps culture by integrating CI/CD pipelines using Jenkins, GitLab, or similar tools. Leverage monitoring and observability tools like Grafana and Prometheus for system reliability. Enhance security practices and ensure compliance with stringent security and accreditation standards. What You’ll Bring: Active DV or eDV clearance is essential due to More ❯
for infrastructure as code. Solid understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. Preferred Experience Experience with other More ❯
for infrastructure as code. Solid understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. Preferred Experience Experience with other More ❯
language), Bash/Shell, YAML including any Development frameworks Extensive experience and in-depth knowledge of the Linux operating system for effective troubleshooting activities Experience with Observability tools like Grafana, Prometheus, ELK, OCI Observability We highly value ownership and initiative with capabilities to drive projects independently Dealing with changes on a daily basis in a very dynamic work environment Good More ❯
practices, RBAC, IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to work closely with More ❯
as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or More ❯
Docker, Kubernetes etc Extensive experience using Infrastructure as Code for configuration management and code implementation - Terraform etc. Experience setting up and using monitoring and alerting tools such as Dynatrace, Grafana, Cloudwatch etc. Experience using Configuration management tools like Puppet, Ansible, Packer, Chef. Experience with various testing tooling - Selenium, Cucumber etc Experience in scripting - bash/shell Ability to be flexible More ❯
DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
of building and maintaining CI/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. IND_PC1 Carbon60, Lorien & SRG More ❯
releases). Monitoring, Logging & Alerting: Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security best practices and controls within More ❯
releases). Monitoring, Logging & Alerting: Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security best practices and controls within More ❯
other CI tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving More ❯
other CI tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving More ❯
and containerization, Linux, Relational and NoSQL databases, building RESTful API Services, Containerisation, Kubernetes, serverless functions, Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). Automation Scripting (using Scripting languages such as Terraform, Ansible etc.). Strong understanding of security principles in cloud and enterprise systems. Familiarity with audit and compliance considerations in regulated More ❯
Agile teams using tools like Git, Jira, and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault, Consul, Packer Monitoring and observability with Grafana, Prometheus, or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Amber Labs
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Amber Labs
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
agile development methodologies such as Scrum or Kanban Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation Familiarity with monitoring and logging tools such as Prometheus, Grafana, or ELK Stack Experience with machine learning and artificial intelligence technologies Desirable Certifications Strong proficiency in at least one of the following AWS certifications: AWS Certified Solutions Architect - Associate AWS More ❯