London, South East, England, United Kingdom Hybrid / WFH Options
Become
Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong scripting skills (e.g., Bash, Python) and More ❯
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
standards and conventions Deep desire and practice maintaining uniformity and cleanliness in a large codebases and infrastructure projects Desirable Skills & Experience Hands on experience monitoring large production infrastructure using DataDog and CloudWatch Previously owned end-to-end responsibility in a service, including development and production support Experience using configuration management tools such as Chef, Ansible or Puppet Proficient writing code More ❯
. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
. Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent problem-solving and stakeholder management skills. Strong written and oral communication skills. Experience collaborating across multiple topics in parallel. Responsibilities: Collaborating with technical More ❯
distributed systems, microservices architecture, and RESTful API design. • Hands-on experience with Kubernetes and container orchestration. • Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position. • Hands-on experience with incident response, including designing and improving incident management processes. • Expertise in Observability practices, including metrics More ❯
Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high-performance and More ❯
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
deploying PHP systems Experience with Amazon ECS, RDS, ElastiCache, S3, Lambda Previously worked on migration projects Used Ansible or other configuration management tools Implemented or used monitoring tools (e.g. Datadog, Sentry, Prometheus) to solve performance problems Understanding of security principles and confident assisting with security issues These skills are preferable, training and development opportunities will be available for any areas More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
e.g., Terraform) Experienced with modern API design and implementations, preferably in Java or Golang. Experienced in Cloud infrastructure's cost management or FinOps. Experience using monitoring tools like Splunk, Datadog, Prometheus, Grafana Exceptional customer focus, ownership, and drive. ABOUT BUSINESS UNIT IBM Software infuses core business operations with intelligence-from machine learning to generative AI-to help make organizations more More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
ARM templates) Proficiency with container technologies like Docker and orchestration (Kubernetes, ECS, AKS, etc.) Strong scripting skills in Python, Bash, or PowerShell Experience with monitoring and logging tools (CloudWatch, Datadog, Prometheus, ELK stack, etc.) Familiarity with CI/CD tools (GitLab CI, Jenkins, GitHub Actions, etc.) The successful candidate must hold and maintain a high level of Security Clearance. Preferred More ❯
tools and container orchestration (Docker, ECS, or Kubernetes) Solid understanding of system/network security, IAM, VPC, and secure cloud configurations Familiarity with monitoring and logging tools (e.g., CloudWatch, Datadog, Prometheus, Sentry) Experience with Postgres, Redis, and scalable backend systems Bonus: Exposure to fintech or regulated environments, GDPR/data compliance, or SOC2 setup A little about us Our founders More ❯
Proficiency with cloud providers like Azure, AWS, or GCP. Experience with Infrastructure as Code tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring and logging tools like DataDog, Prometheus, Grafana, or similar. A track record of maintaining highly available and performant production environments. Ability to develop effective mitigation strategies and operational playbooks. Useful/Bonus Skills: Experience with More ❯
using Kubernetes or similar tools in production deployments Experience with: AWS security best practices including IAM, security groups, encryption, and compliance frameworks Monitoring tools such as CloudWatch, Prometheus, Grafana, DataDog, or NewRelic Infrastructure as Code using Terraform Containerised CI/CD solutions Linux system administration, including shell scripting and system optimisation Desirable Skills Experience with AWS services such as SQS More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive More ❯
as GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness More ❯
software applications and optimizing fleet utilization - Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding) and experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) - Experience scripting operating system tasks in Bash, Python, etc. and with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar) - Experience operating services More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Epsilon Inc
Kubernetes), and cloud-native development practices Advanced knowledge of configuration management tools (Ansible, Puppet, Chef), version control systems (Git), and infrastructure automation frameworks Experience with monitoring and logging tools (DataDog, Splunk, ELK Stack), application performance monitoring solutions, and security scanning tools for vulnerability management Proficiency in programming and scripting languages including Java, Python, PowerShell, Bash, and experience with API development More ❯
and feature delivery. Experience with Azure Data technologies, such as Azure Data Factory (ADF), to support data integration and pipeline automation. Experience with observability and monitoring tools such as Datadog, Grafana, or the ELK Stack. In-depth knowledge of networking, security protocols, and firewall configurations. Experience with database management and performance optimisation strategies. Familiarity with software development methodologies, including Agile More ❯