such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills More ❯
and other relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
Azure DevOps, YAML-based) with security scanning and progressive delivery Supporting AKS clusters and Azure services (SQL, Cosmos DB, ADF, Functions, Logic Apps, etc.) Improving monitoring and alerting with Datadog, Grafana, ELK, and proactive failure detection Participating in the on-call rota and leading incident response workflows and blameless postmortems Coaching engineers, upskilling teams, and contributing to a culture of … DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and RBAC Experience with FinOps - tagging … Familiarity with security scanning tools (Trivy, tfsec) integrated into pipelines A proactive approach to problem-solving, documentation, and coaching Additional bonus skills include experience with Azure governance tools, advanced Datadog capabilities, Kubernetes autoscaling solutions, GitOps workflows, automated cost dashboards, compliance frameworks, and internal platform development. What You Can Expect: Competitive salary: £70,000 - £80,000 depending on experience 25 days More ❯
Watford, Hertfordshire, United Kingdom Hybrid / WFH Options
Wickes
You'll have a deep understanding of modern cloud ecosystems, with extensive hands-on experience in Amazon Web Services (AWS). Familiarity with modern observability concepts and tools, including Datadog, and proven experience with the "platform as a product" model and driving adoption of internal tools. Strong familiarity with CI/CD principles and pipelines (e.g., Jenkins, GitLab CI, CircleCI More ❯
and optimize CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Automate everything with Terraform, Bicep, and scripting (PowerShell, Bash, Python). Drive observability with tools like Datadog, LogicMonitor, CloudWatch, and Grafana. Champion cloud security, IAM, RBAC, and compliance best practices. Collaborate across teams, mentor peers, and contribute to a culture of continuous improvement. What You Bring: Proven More ❯
and optimize CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Automate everything with Terraform, Bicep, and scripting (PowerShell, Bash, Python). Drive observability with tools like Datadog, LogicMonitor, CloudWatch, and Grafana. Champion cloud security, IAM, RBAC, and compliance best practices. Collaborate across teams, mentor peers, and contribute to a culture of continuous improvement. ?? What You Bring: Proven More ❯
Leeds, West Yorkshire, England, United Kingdom Hybrid / WFH Options
Anson McCade Ltd - IT and Finance Recruitment
pipelines (e.g., GitHub Actions, CircleCI). Bonus Skills That Impress Delivery into cloud platforms (AWS, Azure, GCP). Familiarity with relational and NoSQL databases. Experience with observability tools (e.g., DataDog, Prometheus). Test automation know-how. Exposure to open-source tools and community practices. Ready to build what matters? Apply now to shape the future of digital engineering in an More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Anson McCade
frameworks Desirable Experience Delivery of secure software in government, defence, or other regulated sectors Hands-on cloud-native development and deployment Knowledge of logging and monitoring tools such as DataDog, Prometheus, or StackDriver Experience working with product lifecycle tooling and engineering in complex domains If you’re looking to focus on real engineering work that drives meaningful outcomes and want More ❯
Leeds, West Yorkshire, England, United Kingdom Hybrid / WFH Options
Anson McCade Ltd - IT and Finance Recruitment
Nice to Have (But Not Essential) Cloud experience: AWS, Azure or GCP Solid grasp of databases and data modelling Familiarity with open-source tools and monitoring platforms (e.g., Prometheus, DataDog) Experience with test automation frameworks and performance tools More ❯
Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Checkatrade
Senior Platform Engineer Experience in Cloud Native technologies? Come join us! Are you looking for a new role? We have an exciting opportunity at Checkatrade for a Senior Platform Engineer to join our mission of making home improvements easy by More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
Job description RemoteStar is looking to hire a Senior Site Reliability Engineering Manager on behalf of our client based in the UK with a fully remote work policy. About Client: The client building, the B2B marketplace for diamonds. It's More ❯
Watford, Hertfordshire, United Kingdom Hybrid / WFH Options
Wickes
Service Level Indicators (SLIs), driving initiatives to enhance reliability, performance, and scalability. You will design, implement, and manage observability solutions, including monitoring, logging, and tracing, with strong expertise in Datadog for proactive dashboards and alerts. Automate manual operational tasks to reduce toil and improve system resilience. Collaboration is key both with our Platform Engineers, to ensure we manage and improve More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
We're assembling a team of elite founding software engineers for a startup, building the future of e-commerce in MENA, bringing together community, shopping and entertainment. Location: Remote We are looking for engineers who are passionate about creating scalable More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Develop
and authentication modules Ensure secure, scalable deployment using Azure cloud-native tools Build and support systems using PostgreSQL, Java, and Spring Boot Integrate and monitor using observability tools like Datadog and BigPanda Collaborate closely with architects, DevOps, and security teams across the full SDLC Core Skills & Technologies Strong backend development in Java with Spring Boot Cloud migration experience, particularly Azure … Lift-and-Shift Familiarity with cloud infrastructure and deployment pipelines Exposure to PostgreSQL, authentication/security patterns Monitoring/observability tooling: Datadog, BigPanda Apply now to be considered. More ❯
Burton-on-Trent, Staffordshire, England, United Kingdom Hybrid / WFH Options
Crimson
secure, scalable AWS and Azure infrastructure Build and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Set up monitoring, alerting, and logging with tools like Datadog, Logic Monitor, and Solarwinds. Strong grasp of DevOps principles; hands-on CI/CD experience. Microsoft Certified: DevOps Engineer Expert (AZ-400). Design and deploy containers on AKS/ More ❯
Bradford, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Anson Mccade
microservices. Experience with at least one cloud platform - Azure preferred, but AWS or GCP also welcome. Bonus Skills: Knowledge of database systems, test automation frameworks, and monitoring tools (e.g., DataDog, Prometheus). Passion for continuous learning and using open-source technologies. Why Join? No sales targets. No consulting fluff. Just great engineering. Paid overtime (up to 200 hours/year More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
CET Structures Limited
responsive, user-friendly interfaces and working with component libraries like Vuetify. Experience in writing unit and integration tests Experience working with the Azure stack is essential Experience working with DataDog or other observability platforms is desirable Interest in learning new technologies is desirable Additional Skills & Qualities Agile experience: Familiarity with Scrum, Kanban, or similar methodologies. A team player with strong More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Job Overview: The Engineering IT group provides the high-performance compute environment that fuels product and solutions development for Arm's engineering community. Whether its high-performance compute (HPC) on Arm's on-prem infrastructure and/or in the More ❯