London, England, United Kingdom Hybrid / WFH Options
Your Next Hire
S3, VPC, etc. IaC: AWS CDK, Terraform, or CloudFormation CI/CD pipelines + scripting (Python, Bash, PowerShell) Containerized applications (Docker + ECS) Observability tooling like New Relic, CloudWatch, Prometheus, Datadog Who we’re looking for: Proven SRE or platform engineering experience in a high-availability environment Passion for reliability, automation, and system performance Strong problem-solving mindset and solid More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with More ❯
tools. Proficient in Linux operating systems and shell scripting Strong understanding of CI/CD pipelines and tools (e.g., Jenkins, GitLab). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana). Soft Skills: Excellent problem-solving and troubleshooting abilities. Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams. Ability to manage multiple priorities More ❯
tools. Proficient in Linux operating systems and shell scripting Strong understanding of CI/CD pipelines and tools (e.g., Jenkins, GitLab). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana). Soft Skills: Excellent problem-solving and troubleshooting abilities. Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams. Ability to manage multiple priorities More ❯
tools. Proficient in Linux operating systems and shell scripting Strong understanding of CI/CD pipelines and tools (e.g., Jenkins, GitLab). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana). Soft Skills: Excellent problem-solving and troubleshooting abilities. Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams. Ability to manage multiple priorities More ❯
Cloud experience and good understanding of Kubernetes and OpenShift Hands on experience deploying, testing, and building CI/CD pipelines Experience working with Monitoring and Logging systems, particularly Splunk, Prometheus & Grafana Excellent analysis, debugging, root-cause identification, and troubleshooting skills Hands-on experience with Oracle Databases and willingness to increase expertise (OCA or OCP certification is a plus) Strong experience More ❯
London, England, United Kingdom Hybrid / WFH Options
CATCHES
features and migrations. REQUIREMENTS Extensive experience orchestrating infrastructure at scale across cloud and baremetal. SRE & Kubernetes expertise (GKE/AKS/EKS) and container-native observability stacks (Datadog/Prometheus/Grafana). Proven ownership of CI/CD pipelines (GitHub Actions, Cloud Build, Azure DevOps, etc.) and release automation. Proven experience with multiplatform scripting languages (Python, bash, PowerShell). More ❯
server administration, at an advanced level (Ubuntu/RHEL). Working knowledge of at least one scripting language, e.g. Python, Bash. Monitoring and logging of systems using tools like Prometheus, Grafana, or ELK. Experience with IaC tools (ideally Ansible). Working knowledge of version control methodologies and practices. Desirable: Experience with air gapped environments. Understanding of cross domain solutions. Working More ❯
Advocacy for CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stealth AI Startup
Advocacy for CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python More ❯
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Ability to analyse and resolve complex infrastructure resource and application deployment issues. Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Knowledge of More ❯
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. Benefits High competitive More ❯
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. Benefits High competitive More ❯
scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes). Ability More ❯
scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes). Ability More ❯
Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef)./li li Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana)./li/ul p Some other highly valued skills may include:/p ul li Experience with cloud platforms (e.g., AWS, Azure, Google Cloud)./li More ❯
etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Powershell SQL Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and More ❯
healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and More ❯
Databases - Postgres, MariaDB, MongoDB, ClickHouse, Redis, JupyterLab, Metabase Data Engineering & Orchestration - Python, Airflow, Kafka, DataHub Cloud & Infrastructure - AWS, K8s DevOps & CI/CD - Git, GitLab CI, DBS, Grafana, ELK, Prometheus, Docker, Docker Compose Why join us? Shape the future of a data business at the forefront of global payments insights A chance to work with a vibrant, friendly team in More ❯
Blair West have been retained to support One utility Bill's search for a DevOps Engineer. This is a fantastic time to join a fast-growing and successful tech business that simplifies household bill management through a single, fixed monthly More ❯
Crawley, Sussex, United Kingdom Hybrid / WFH Options
Thales Group
Location: Crawley, United Kingdom In fast changing markets, customers worldwide rely on Thales. Thales is a business where brilliant people from all over the world come together to share ideas and inspire each other. In aerospace, transportation, defence, security and More ❯
Bedford, Bedfordshire, United Kingdom Hybrid / WFH Options
with AWS and containerisation using Kubernetes (EKS) Expert-level proficiency in Terraform for building Infrastructure as Code Solid scripting skills (Bash, Python, etc.) Experience with monitoring tools such as Prometheus, Grafana , etc. More ❯