Oxford, Oxfordshire, United Kingdom Hybrid / WFH Options
Nominet
control (Git) and testing practices (integration, automation). Problem-solving, collaboration, and growth mindset. Nice to have: Containerisation and orchestration (Docker, Kubernetes). Infrastructure as Code (Terraform, Ansible). Observability tools (Prometheus, Grafana, Databricks). What To Expect Next: 1st stage: Introduction call with a member of the TA team (30 mins) 2nd stage: Hiring manager interview (60 mins) What More ❯
and also with another public cloud provider such as AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
london (city of london), south east england, united kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen Group
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (e.g., Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
warrington, cheshire, north west england, united kingdom Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
bolton, greater manchester, north west england, united kingdom Hybrid / WFH Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
AI-enhanced automation. Build and maintain CI/CD (Jenkins, GitLab CI, GitHub Actions, ArgoCD). Cloud infrastructure (AWS, Azure, GCP), container orchestration (Kubernetes, Docker). Logging, monitoring, and observability (Prometheus, Grafana, ELK/EFK), including AI-driven log analysis and incident prediction. Experience supporting MLOps: deploying ML workflows, ensuring model traceability and compliance. Use of AI assistants and workflow More ❯
as Terraform or CloudFormation. Implement and manage CI/CD pipelines , enabling continuous integration and deployment of mission-critical applications. Monitor and optimise system performance, availability, and security, applying observability best practices. Collaborate in an Agile environment, engaging with stakeholders to gather requirements and deliver iterative improvements. This role allows you to apply your expertise to challenging problems while shaping More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
DynamoDB, S3, IAM, and RDS. Understanding of DevOps practices, including CI/CD pipelines and automation. Strong knowledge of cloud security best practices, IAM policies, and networking. Experience with observability tools like CloudWatch, Prometheus, or Grafana. Preferred: Experience mentoring junior team members and promoting DevOps practices. Familiarity with multi-cloud environments (e.g., GCP, Azure). Knowledge of database performance optimisation. More ❯
Derbyshire, Burton upon Trent, Staffordshire, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
london (city of london), south east england, united kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯