AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high impact role in a fast More ❯
back-end development with reliable, scalable systems in an object-oriented language. Experience deploying to cloud platforms (AWS, GCP, Azure), containerization (Docker), infrastructure-as-code (Terraform), and observability tools (Datadog, Grafana). Curiosity and eagerness to learn. Strong problem-solving skills and attention to detail. Excellent communication skills for technical and non-technical stakeholders. We recognize that few candidates check More ❯
Bolton, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
reliable systems with an object-oriented programming . Cloud- technologies: Experience in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), and observability platforms (e.g., Datadog or Grafana). Curiosity: A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down More ❯
in Computer Science, Management Information Systems, or related fields is desirable but not essential. Nice to have but not essential: Service monitoring and graphing tools (Prometheus + Grafana, Nagios, Datadog) Elastic Stack Repository solutions (JFrog Artifactory, JFrog Bintray) OpenVPN SQL Databases (MongoDB, PostgreSQL, MySQL) Our Values: We work together We believe in people We won't accept the "way it More ❯
AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high impact role in a fast More ❯
AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high impact role in a fast More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Ltd
e.g., Docker, Kubernetes, Terraform, Ansible, Helm). Familiarity with continuous integration and deployment tools (e.g., GitLab CI, Argo workflow, Argo CD). Experience with monitoring/logging solutions (e.g., DataDog, ELK, Prometheus). Good understanding of concepts related to computer architecture, data structures and programming practices. Solid understanding of networking, databases, and security principles. Our Values: We work together We More ❯
administration experience. Strong programming experience in at least one language (.Net, C#, node, java, go, c/C++, ruby). Experience with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. BENEFITS: Competitive Base Salary Employee Performance Related Bonus Clear Career Progression Flexible Working Hours Matched Employee Pension Generous Holiday Package Secure Company Hardware More ❯
test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such as Jenkins; GitHub or More ❯
network management. Proficiency in scripting and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability to collaborate across technical and More ❯
test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such as Jenkins; GitHub or More ❯
Familiarity with Infrastructure as Code and DevOps practices. Knowledge of Hyper-V management. Understanding of networking, security, and system administration (Linux/Windows). Experience with monitoring tools (e.g., DataDog, CloudWatch, Azure Monitor). Strong communication and collaboration skills. Responsibilities: Deploying and managing Kubernetes clusters, including networking, storage, and security. Collaborating with development and platform teams to deliver scalable, secure More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
RS Group plc
The Tech You’ll Be Using AWS serverless technologies (e.g. Lambda, API Gateway, SNS, SQS, DynamoDB) Docker (deployed to Hashicorp Nomad for Java and JavaScript applications) Kong (API Gateway) Datadog (logging, monitoring and thresholds) Vault (secrets management) Terraform (deploying infrastructure) Gitlab + Gitlab CI (git repo and continuous integration) Plus anything else the team finds useful About You You will More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Couchbase
DSA concepts Proficiency with Linux operating systems Exposure to any one of the CSPs like AWS/Azure/GCP Working experience in Grafana, Prometheus, Thanos and/or DataDog Strong debugging skills to mitigate a production issue Preferred Skills And Qualifications Experience with on-call rotations & incident management Experience in developing and managing Kubernetes clusters both self-managed (vanilla More ❯
years of experience Any of the following would be a plus: Experience with Observability across multiple domains (APM, Infrastructure, Synthetics, Logs, etc ) within cloud and on-premise environments using Datadog, Azure Monitor and Application Insights. NewRelic and Grafana Experience working in B2B SaaS companies Experience with cloud containers, specifically Kubernetes Responsibilities & Duties Develop: Architecture, strategy and implementations to enable or More ❯
Dundee, Scotland, United Kingdom Hybrid / WFH Options
Scopely
Join to apply for the Build and Release Engineer role at Scopely Scopely is seeking a Build & Release Engineer to join our talented team in Ireland, Spain, Portugal, or the UK on a hybrid/remote basis. At Scopely, we More ❯
Join to apply for the Performance Test Lead role at Companies House 2 weeks ago Be among the first 25 applicants Join to apply for the Performance Test Lead role at Companies House Get AI-powered advice on this job More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
major transformation programme. This role goes beyond traditional SRE – you’ll champion best practices across product teams, drive observability strategy, and work hands-on with cutting-edge tools like Datadog and AWS. Key Responsibilities: Lead the SRE function and promote observability-first thinking across development and operations teams. Define and implement the observability roadmap across product domains in collaboration with … the client. Be hands-on with Datadog for infrastructure and application-level monitoring. Guide and review daily operations and improvements across observability platforms. Partner with engineering squads to deliver on observability requirements in an agile, demand-led way. Core Skills & Experience: Proven experience as a hands-on SRE Engineer. Deep understanding of observability and monitoring practices. Practical experience with DatadogMore ❯
Azure DevOps, YAML-based) with security scanning and progressive delivery Supporting AKS clusters and Azure services (SQL, Cosmos DB, ADF, Functions, Logic Apps, etc.) Improving monitoring and alerting with Datadog, Grafana, ELK, and proactive failure detection Participating in the on-call rota and leading incident response workflows and blameless postmortems Coaching engineers, upskilling teams, and contributing to a culture of … DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and RBAC Experience with FinOps - tagging … Familiarity with security scanning tools (Trivy, tfsec) integrated into pipelines A proactive approach to problem-solving, documentation, and coaching Additional bonus skills include experience with Azure governance tools, advanced Datadog capabilities, Kubernetes autoscaling solutions, GitOps workflows, automated cost dashboards, compliance frameworks, and internal platform development. What You Can Expect: Competitive salary: £70,000 - £80,000 depending on experience 25 days More ❯
cloud architecture, and Java. Deep understanding of Kubernetes, Docker, and CI/CD tools. Strong scripting skills in Shell and Python. Experience with monitoring and metrics platforms such as Datadog and Prometheus. Knowledge of security best practices in cloud environments. Desirable: Experience with Nix, Crossplane, and autoscaling on AWS. Additional Technologies and Experience Nginx, IPv4 Networking, Test automation, Relational and More ❯
architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and you More ❯
architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and you More ❯
architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and you More ❯
Employment Type: Contract
Rate: £500 - £700/day Day Rate Contract | 6 months
DB etc.) Solid knowledge of networking (DNS, DHCP, TCP/IP, firewalls, routing). Experience with Windows and Linux server environments. Experience with monitoring and notification tools (e.g., Cloudwatch, Datadog, Zabbix, Solarwinds, Nagios, PRTG, Opsgenie or Pagerduty). Scripting skills (PowerShell, JavaScript, Bash, Python preferred). Knowledge of backup and recovery tools (e.g., Veeam, Azure Backup, AWS Backup). Understanding More ❯
architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and you More ❯