Bristol, England, United Kingdom Hybrid / WFH Options
Canada Life Assurance Europe plc
Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
Navtech, Inc
control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and More ❯
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
scalable systems in an object-oriented language. Experience deploying to cloud platforms (AWS, GCP, Azure), containerization (Docker), infrastructure-as-code (Terraform), and observability tools (Datadog, Grafana). Curiosity and eagerness to learn. Strong problem-solving skills and attention to detail. Excellent communication skills for technical and non-technical stakeholders. We More ❯
and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms More ❯
and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms More ❯
networking (DNS, DHCP, TCP/IP, firewalls, routing). Experience with Windows and Linux server environments. Experience with monitoring and notification tools (e.g., Cloudwatch, Datadog, Zabbix, Solarwinds, Nagios, PRTG, Opsgenie or Pagerduty). Scripting skills (PowerShell, JavaScript, Bash, Python preferred). Knowledge of backup and recovery tools (e.g., Veeam, Azure More ❯
and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms More ❯
experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Ability to communicate data-based solutions with complex reporting and visualization methods Recognized as an active contributor of the engineering community Continues to More ❯
high-quality contributions. Software Releases & Deployment: Comfortable managing releases, versioning strategies, and troubleshooting issues in live environments. Log Monitoring & Observability: Familiarity with tools like Datadog, Splunk, or ELK Stack for real-time monitoring, troubleshooting, and performance optimization. Problem-Solving & Maintenance: Analytical mindset with the ability to assess, improve, and sustain More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for high More ❯
Bath, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
people to enjoy them). Perform penetration testing services and infrastructure testing ethically. Extend security automation and monitoring with tools like CircleCI, GitHub Actions, DataDog, AWS Security Hub, etc. Harden everything from container runtimes to APIs to artifact pipelines. Write secure code, review others' code, and help everyone improve their More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
get people to enjoy them) Pen-test services and infra (ethically, please). Extend security automation and monitoring with tools like CircleCI, GitHub Actions, DataDog, AWS Security Hub, etc. Harden everything from container runtimes to APIs to artifact pipelines. Write secure code, review other people’s code, and help everyone More ❯
models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python More ❯
models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python More ❯
Gloucester, Gloucestershire, UK Hybrid / WFH Options
Few&Far
models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python More ❯
models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python More ❯