such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills More ❯
Oldham, Greater Manchester, North West, United Kingdom
Innovative Technology
CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours 32 days holiday, (pro rata More ❯
CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours 32 days holiday, (pro rata More ❯
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high-performance and More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (e.g., Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineeringwith a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD tools More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
and feature delivery. Experience with Azure Data technologies, such as Azure Data Factory (ADF), to support data integration and pipeline automation. Experience with observability and monitoring tools such as Datadog, Grafana, or the ELK Stack. In-depth knowledge of networking, security protocols, and firewall configurations. Experience with database management and performance optimisation strategies. Familiarity with software development methodologies, including Agile More ❯
orchestration (ECS, EKS, or Kubernetes) Experience setting up CI/CD pipelines using GitHub Actions or similar tools Familiarity with monitoring and alerting tools (e.g. Prometheus, Grafana, CloudWatch, Sentry, DataDog) A security-first mindset when designing and managing infrastructure Nice to Haves Experience working in regulated or high-trust environments Knowledge of zero-downtime deployment patterns and rollback strategies Exposure More ❯
as GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness More ❯
tools and container orchestration (Docker, ECS, or Kubernetes) Solid understanding of system/network security, IAM, VPC, and secure cloud configurations Familiarity with monitoring and logging tools (e.g., CloudWatch, Datadog, Prometheus, Sentry) Experience with Postgres, Redis, and scalable backend systems Bonus: Exposure to fintech or regulated environments, GDPR/data compliance, or SOC2 setup A little about us Our founders More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
CircleCI also welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and tools like Helm Why Join H&B Tech? Be part of a fast-moving More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
optimisation Nice to Have Experience with ML tooling (MLflow, Kubeflow) Knowledge of FastAPI , Databricks, or Snowflake Exposure to SRE practices or cloud security certifications Familiarity with Prometheus , Grafana , or Datadog Interested? If you want to be part of a world-class AI team at an early stage-where your infrastructure decisions will directly shape the company's success-apply today More ❯
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Uniting Ambition
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such as Jenkins; GitHub or More ❯