Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
Kubernetes layers. Orchestrate post-incident reviews: document findings, define mitigation plans, and drive tickets to resolution. Reliability Engineering & Automation Develop and maintain robust observability for front-end components: integrate Datadog for observability. Define SLIs/SLOs for page load times, Time to Interactive, and error rates; build alerting that balances sensitivity with noise reduction. Automate deployments via CI/CD More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
self-healing services that allow you to sleep through the night. Support the production systems on prem (legacy) and GCP which includes build, deployment (CI/CD), and monitoring (Datadog, PagerDuty). Deliver high quality code by following coding standards, performing code reviews and ensuring test coverage at all levels. Collaborate with engineers and other teams to elevate technology, follow More ❯
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
East London, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH(phone number removed) To apply for this role or for to be considered for further roles More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
and aligned. ️ The skills Proven experience as a DevOps Engineer, Site Reliability Engineer, Platform Engineer or similar role. Ideally in an entreprise-grade Experience with APM stacks such as Datadog, New Relic, Prometheus or similar. Experience with handling telemetry, tracing and logging data, at scale, in multiple different environments. Familiarity with low-level telemetry daemons and aggregators such as StatsD. More ❯
North West London, London, United Kingdom Hybrid / WFH Options
ByteHire
of infrastructure setup and management Exposure to designing or building distributed systems, preferably in a cloud environment Company Tech Stack PHP, Laravel, ReactJS, TypeScript, Inertia, WordPress MySQL, Redis, ElasticSearch, DataDog, AWS, Terraform, Docker Benefits Hybrid working 1-2 days per week in the London office. Collaborate directly with the founding team and take ownership of product features. Be part of More ❯
Walsall, Staffordshire, United Kingdom Hybrid / WFH Options
CET Structures Limited
responsive, user-friendly interfaces and working with component libraries like Vuetify. Experience in writing unit and integration tests Experience working with the Azure stack is essential Experience working with DataDog or other observability platforms is desirable Interest in learning new technologies is desirable Additional Skills & Qualities Agile experience: Familiarity with Scrum, Kanban, or similar methodologies. A team player with strong More ❯
React Testing Library Backend/Infrastructure PHP 8 (Symfony), Kotlin, MySQL, RabbitMQ, AWS, Docker, Kubernetes Tooling ArgoCD and Github Actions Github, Jira & Confluence, for our code & product management processes Datadog & Sentry, for debugging and reporting Figma, for our design process What we're looking for You have a solid understanding of modern Javascript with Typescript . You have commercial experience More ❯
and technical perspectives. Experience mentoring engineers through pairing, code reviews, and knowledge-sharing. Familiarity with CI/CD pipelines, automated testing strategies, and observability tools (e.g., GitHub Actions, Sentry, Datadog). A mindset geared toward experimentation, measurement, and continuous improvement, especially within growth-driven product teams. Nice to Have Previous experience working in a start-up/scale-up environment. More ❯
e.g. Stripe). Experience mentoring engineers through pairing, code reviews, and knowledge-sharing. Familiarity with CI/CD pipelines, automated testing strategies, and observability tools (e.g., GitHub Actions, Sentry, Datadog). A mindset geared toward experimentation, measurement, and continuous improvement, especially within product teams. Nice To Haves Previous experience working in a start-up/scale-up environment. Familiar with More ❯