Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Spectrum It Recruitment Limited
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or More ❯
DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
of building and maintaining CI/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. IND_PC1 Carbon60, Lorien & SRG More ❯
and containerization, Linux, Relational and NoSQL databases, building RESTful API Services, Containerisation, Kubernetes, serverless functions, Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). Automation Scripting (using Scripting languages such as Terraform, Ansible etc.). Strong understanding of security principles in cloud and enterprise systems. Familiarity with audit and compliance considerations in regulated More ❯
Agile teams using tools like Git, Jira, and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault, Consul, Packer Monitoring and observability with Grafana, Prometheus, or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Amber Labs
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
agile development methodologies such as Scrum or Kanban Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation Familiarity with monitoring and logging tools such as Prometheus, Grafana, or ELK Stack Experience with machine learning and artificial intelligence technologies Desirable Certifications Strong proficiency in at least one of the following AWS certifications: AWS Certified Solutions Architect - Associate AWS More ❯
Experience with Terraform, Kubernetes, Kafka, Docker, Redis, MongoDB. Experience with application clustering, load balancing, high availability, and reliability concepts and supporting technologies. Experience with monitoring systems such as Prometheus, Grafana, Splunk, or the ELK Stack. Clear written and verbal communication skills. Some level of participation in an on-call escalation path. A passion for providing excellent service to all internal More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Arm Limited
Skills and Experience: Experience in a GitOps solution such as ArgoCD, Flux or Fleet Implementation of the Security Development Lifecycle (SDL) in infrastructure Monitoring and observability using Prometheus and Grafana, ELK stack or equivalent Use of Kubernetes management systems such as Rancher Familiarity with open source project development cycles and contribution processes, particularly around CI/CD infrastructure Accommodations at More ❯
Perl, JAVA) and automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Ability to analyse and resolve complex infrastructure resource and application deployment issues. Experience with Git, Jira, Confluence, and ServiceNow for incident and change More ❯
CI/CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills and the ability to work effectively in a More ❯
/AWX), automation pipelines (ArgoCD/Azure DevOps or similar), Infrastructure as Code (Terraform) and Version Control (git) Containerization and Orchestration: Docker/Kubernetes Monitoring and Logging: (Prometheus/Grafana/Elastic stack) Networking and Security: Virtual Private Cloud/Security Groups and Network ACLs/Identity and Access management Storage technologies (Object/File) - Implementing and managing storage accounts More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. Develop and maintain automation … DevOps Engineer level Incident, change & problem management experience. This role is heavily operation-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/Kubernetes operations Working configuration More ❯
Portsmouth, England, United Kingdom Hybrid / WFH Options
Trust In SODA
life cycle. Infrastructure-as-code Bash Delivery methods and techniques, including agile scrum experience. Desirable Skills: RedHat OpenShift Hashicorp (such as Terraform, Packer, Vault) Ansible Observability (such as Prometheus, Grafana, Splunk) Containerised services (such as Postgres, Redis, Kafka, Keycloak, Elk) Experience of doing all the above at OS or S level YAML based pipelines. Immutable infrastructure Experience with MOD delivery More ❯
Hertford, Hertfordshire, South East, United Kingdom
Halian Technology Limited
logistics, warehousing, or multi-region ecommerce platforms. Knowledge of container orchestration (e.g., Kubernetes). Certifications in cloud (AWS, Azure), Linux, or infrastructure automation tools. Familiarity with tools like Prometheus, Grafana, ELK Stack, or Splunk. What We Offer: A visible leadership role with direct access to senior leadership Opportunity to shape infrastructure strategy for global operations A dynamic, fast-paced environment More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
blue/green & canary releases, and automated rollbacks. Proficiency with Docker, Kubernetes, and related cloud-native orchestration patterns. Proven track record building dashboards and visualizations across platforms such as Grafana, Datadog, and AWS. Experience with instrumentation tools like Prometheus and managing time-series stores such as Graphite and VictoriaMetrics. Solid understanding of networking, security, and compliance in cloud environments. Excellent More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Twinstream Limited
Code expertise with tools like Terraform Config management with Ansible, Chef, or similar Docker/container orchestration (e.g. Kubernetes, OpenShift) CI/CD experience (e.g. Jenkins) Monitoring tools like Grafana, Prometheus, or InfluxDB Event-driven architecture with RabbitMQ or other AMQP tools Strong Linux, scripting, and networking fundamentals Experience with AWS (EC2, RDS, S3, Lambda) Desirables: Coding in Java, Go More ❯
BS1, Bristol, City of Bristol, United Kingdom Hybrid / WFH Options
Twinstream Limited
Code expertise with tools like Terraform Config management with Ansible, Chef, or similar Docker/container orchestration (e.g. Kubernetes, OpenShift) CI/CD experience (e.g. Jenkins) Monitoring tools like Grafana, Prometheus, or InfluxDB Event-driven architecture with RabbitMQ or other AMQP tools Strong Linux, scripting, and networking fundamentals Experience with AWS (EC2, RDS, S3, Lambda) Desirables: Coding in Java, Go More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum Hybrid, Great Benefits
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
Code expertise with tools like Terraform Config management with Ansible , Chef , or similar Docker/container orchestration (e.g. Kubernetes , OpenShift ) CI/CD experience (e.g. Jenkins ) Monitoring tools like Grafana , Prometheus , or InfluxDB Event-driven architecture with RabbitMQ or other AMQP tools Strong Linux , scripting, and networking fundamentals Experience with AWS (EC2, RDS, S3, Lambda) Desirables: Coding in Java , Go More ❯
Perl, JAVA) and automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Ability to analyse and resolve complex infrastructure resource and application deployment issues. Experience with Git, Jira, Confluence, and ServiceNow for incident and change More ❯