London, South East, England, United Kingdom Hybrid / WFH Options
Become
Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong scripting skills (e.g., Bash, Python) and More ❯
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
Oldham, Greater Manchester, North West, United Kingdom
Innovative Technology
CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours 32 days holiday, (pro rata More ❯
CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working hours 32 days holiday, (pro rata More ❯
standards and conventions Deep desire and practice maintaining uniformity and cleanliness in a large codebases and infrastructure projects Desirable Skills & Experience Hands on experience monitoring large production infrastructure using DataDog and CloudWatch Previously owned end-to-end responsibility in a service, including development and production support Experience using configuration management tools such as Chef, Ansible or Puppet Proficient writing code More ❯
. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
. Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent problem-solving and stakeholder management skills. Strong written and oral communication skills. Experience collaborating across multiple topics in parallel. Responsibilities: Collaborating with technical More ❯
Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high-performance and More ❯
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
deploying PHP systems Experience with Amazon ECS, RDS, ElastiCache, S3, Lambda Previously worked on migration projects Used Ansible or other configuration management tools Implemented or used monitoring tools (e.g. Datadog, Sentry, Prometheus) to solve performance problems Understanding of security principles and confident assisting with security issues These skills are preferable, training and development opportunities will be available for any areas More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive More ❯
as GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness More ❯
and feature delivery. Experience with Azure Data technologies, such as Azure Data Factory (ADF), to support data integration and pipeline automation. Experience with observability and monitoring tools such as Datadog, Grafana, or the ELK Stack. In-depth knowledge of networking, security protocols, and firewall configurations. Experience with database management and performance optimisation strategies. Familiarity with software development methodologies, including Agile More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
/CD tools such as GitlabCI, CircleCI, Github Actions, and GitOps using ArgoCD, FluxCD Troubleshooting and debugging applications using Observability tooling across microservices and serverless applications such as Splunk, DataDog Managing ephemeral secrets and credentials using Hashicorp Vault Managing least privileged access to cloud resources using TPAM solutions such as Hashicorp Boundary Bonus Points for experience with: Production experience architecting More ❯
CircleCI also welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and tools like Helm Why Join H&B Tech? Be part of a fast-moving More ❯
containerization (Docker, Kubernetes), and CI/CD practices. Familiarity with Guidewire Cloud architecture models, deployment automation, and support practices. Experience integrating cloud infrastructure with DevOps, Monitoring (e.g., CloudWatch, Prometheus, Datadog), and Logging tools (ELK, Splunk). Solid understanding of cloud security, compliance (including regulatory needs in insurance), and networking. Knowledge of data migration, analytics integration, and insurance data models is More ❯
containerization (Docker, Kubernetes), and CI/CD practices. Familiarity with Guidewire Cloud architecture models, deployment automation, and support practices. Experience integrating cloud infrastructure with DevOps, Monitoring (e.g., CloudWatch, Prometheus, Datadog), and Logging tools (ELK, Splunk). Solid understanding of cloud security, compliance (including regulatory needs in insurance), and networking. Knowledge of data migration, analytics integration, and insurance data models is More ❯
Our stack AWS as our cloud compute platform Kubernetes (EKS) for container runtime and orchestration RDS (PostgreSQL, MySQL), Kafka, Redis Terraform for infrastructure as code Lambda and Step Functions Datadog for Observability Github actions for CICD Frontend is React Backend services are developed in NodeJS (TypeScript) As we are an international team, please submit your application and CV in English. More ❯
Our stack AWS as our cloud compute platform Kubernetes (EKS) for container runtime and orchestration RDS (PostgreSQL, MySQL), Kafka, Redis Terraform for infrastructure as code Lambda and Step Functions Datadog for Observability Github actions for CICD Frontend is React Backend services are developed in NodeJS (TypeScript) As we are an international team, please submit your application and CV in English. More ❯