Kubernetes layers. Orchestrate post-incident reviews: document findings, define mitigation plans, and drive tickets to resolution. Reliability Engineering & Automation Develop and maintain robust observability for front-end components: integrate Datadog for observability. Define SLIs/SLOs for page load times, Time to Interactive, and error rates; build alerting that balances sensitivity with noise reduction. Automate deployments via CI/CD More ❯
CI/CD tools such as Git, GHA and test automation frameworks such as BDD Experienced in API testing using Postman Experience with observability and monitoring tools such as Datadog Hands-on experience in test artefact creation, Test management and Defect Management testing tools Specific training or certifications in Quality Assurance is preferred Experience validating AI/ML data pipelines More ❯
AWS services Experience with modern CI/CD pipelines like GitLab, GitHub Actions, or CircleCI Awareness of CDNs , shell scripting , and common CLI tools Experience with monitoring tools like Datadog Familiarity with Kubernetes and deployment tools like Helm Why Join H&B Tech? Help shape the future of digital wellness in a purpose-led business Work in a forward-thinking More ❯
experience at leading tech companies, startups, and the enterprise software sphere. Our backers include Y Combinator, Index Ventures, and stellar angels such as the founders of Looker, GitHub, Mulesoft, Datadog and UiPath. More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
self-healing services that allow you to sleep through the night. Support the production systems on prem (legacy) and GCP which includes build, deployment (CI/CD), and monitoring (Datadog, PagerDuty). Deliver high quality code by following coding standards, performing code reviews and ensuring test coverage at all levels. Collaborate with engineers and other teams to elevate technology, follow More ❯
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
East London, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid / WFH Options
Few&Far
achieve optimal outcomes with their models, focusing on aspects like optimisation, scalability & efficiency You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub and Uber Key Responsibilities Partner with customers to identify and address their ML deployment needs Implement and optimise ML solutions using Python, open-source tools & infrastructure Collaborate More ❯
level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH(phone number removed) To apply for this role or for to be considered for further roles More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
and aligned. ️ The skills Proven experience as a DevOps Engineer, Site Reliability Engineer, Platform Engineer or similar role. Ideally in an entreprise-grade Experience with APM stacks such as Datadog, New Relic, Prometheus or similar. Experience with handling telemetry, tracing and logging data, at scale, in multiple different environments. Familiarity with low-level telemetry daemons and aggregators such as StatsD. More ❯
Data/Credit Risk teams to define and evolve the platform roadmap Tech Stack Typescript, Python, general scripting AWS (serverless & container-based stack) Terraform DynamoDB, PostgreSQL, MySQL Github Actions Datadog FCA Compliance & Consumer Duty At Onmo we all take collective responsibility for our individual roles in creating the best outcomes for our customers. In this role that involves; Identifying and More ❯
North West London, London, United Kingdom Hybrid / WFH Options
ByteHire
of infrastructure setup and management Exposure to designing or building distributed systems, preferably in a cloud environment Company Tech Stack PHP, Laravel, ReactJS, TypeScript, Inertia, WordPress MySQL, Redis, ElasticSearch, DataDog, AWS, Terraform, Docker Benefits Hybrid working 1-2 days per week in the London office. Collaborate directly with the founding team and take ownership of product features. Be part of More ❯
usage of text editors and supportive tools for efficient coding Nice-to-Have: Understanding Kubernetes, Helm charts, Azure, and AWS cloud services Experience in working with monitoring tools (NewRelic, Datadog, Elastic) Experience with Big Data or ETL pipelines Interview Steps GT interview with Recruiter (45 minutes) Technical interview with test task (1 hour) Live coding interview (2 hours) Final interview More ❯