London, South East, England, United Kingdom Hybrid/Remote Options
Additional Resources Ltd
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
FinOps practices. Experience with infrastructure-as-code tools (e.g., Terraform, Helm, Ansible). Familiarity with CI/CD pipelines and automation (e.g., GitHub Actions, ArgoCD, Jenkins). Experience on observability tools like Prometheus, Grafana Knowledge of Linux systems administration and networking fundamentals and experience with policy-as-code. Passion for platform engineering, developer experience, and site reliability UAL is a More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mott MacDonald
production-grade products, and with product managers to shape roadmaps based on technical feasibility and user value. DevOps & CI/CD: Support cloud-native deployment pipelines, automated testing, and observability for everything we build. Champion software engineering excellence: Drive continuous improvement across software engineering culture, codebases, and development practices. What You'll Bring Clear communicator, with the ability to engage More ❯
and enhance a cloud-native platform used across the UK public sector. This role is heavily DevOps-focused — you’ll be deep in AWS operations, Lambda-based architectures, monitoring, observability, and automation, while still contributing to feature development and service improvements. Key Responsibilities Provide hands-on support for a large-scale serverless platform built on AWS (Lambda, API Gateway, CloudFormation More ❯
with Helm and ArgoCD Owning CI/CD pipelines across multiple environments (GitHub Actions, Jenkins, etc.) Working closely with software engineers to streamline delivery and performance Bringing structure and observability into their environments using tools like Prometheus, Grafana, and ELK Championing DevOps best practice, security, and reliability across the engineering teams What they’re looking for Proven experience in a More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom Hybrid/Remote Options
REDTECH RECRUIT
sector aligned compliance Drive platform modernisation, customer migration tooling, integration frameworks and data integrity for large scale transitions Maintain engineering standards covering code quality, documentation, testing, CI/CD, observability and security Establish metrics driven engineering practices and lead initiatives to reduce lead time, improve deployment frequency and optimise reliability Ensure compliance with ISO27001, GDPR, PCI DSS and sector specific More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
native systems on the cloud (preferably Azure) using Terraform and Kubernetes. Manage CI/CD pipelines using GitHub Actions and ensure smooth delivery to production. Own monitoring, alerting, and observability, using tools like OpenTelemetry and Dynatrace. Security & Compliance: Champion secure coding practices and data protection across services. Collaboration & Mentoring: Work closely with product owners, engineering leads, and other stakeholders to More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
F S People
SaaS applications. Experience with cloud platforms. (AWS, Google Cloud or Azure). Knowledge of containerisation (Docker or Kubernetes) Familiarity with SOC 2 readiness and security best practices. Experience with observability and monitoring. Background in fintech, lending or other regulated-data environments. Experience or interest in Rust or Go for performance-critical components. Experience designing or maintaining complex front-end state More ❯
scalability and reduce manual intervention. Operational Security, SRE & Assurance: Ensure security platforms are resilient, continuously monitored, and designed for 24x7 support and incident response readiness. Embed security telemetry and observability to enable proactive threat detection and automated response. Apply SRE principles to improve reliability, performance, and maintainability of security services. Lead platform health, patching automation, and vulnerability remediation workflows. Define More ❯
oxford district, south east england, united kingdom
Ellison Institute of Technology
our bioinformaticians and science teams. This role blends hands-on engineering (70–80%) with people leadership and a focus on engineering excellence, raising the bar on standards, security, reliability, observability, and quality. You'll collaborate closely within a cross-functional team, working with architects, platform engineers, and data engineers to deliver platform tools that manage the full data lifecycle, monitor More ❯
databases and retrieval strategies. Knowledge of recommender systems and ranking models. Familiarity with LLM evaluation tools (e.g., RAGAS, TruLens, LangSmith, Arize). Exposure to feature stores, data lineage, and observability stacks. Experience in e-commerce or retail environments. Demonstrable ability to weigh up build/build/configure decisions in the LLM space. Charlotte Tilbury is a fast-paced and More ❯
Bletchley, Buckinghamshire, United Kingdom Hybrid/Remote Options
RedTech Recruitment Ltd
sector aligned compliance Drive platform modernisation, customer migration tooling, integration frameworks and data integrity for large scale transitions Maintain engineering standards covering code quality, documentation, testing, CI/CD, observability and security Establish metrics driven engineering practices and lead initiatives to reduce lead time, improve deployment frequency and optimise reliability Ensure compliance with ISO27001, GDPR, PCI DSS and sector specific More ❯
dartford, south east england, united kingdom Hybrid/Remote Options
Europa Worldwide Group
or a related technical role. Strong, hands-on experience with Microsoft Azure, including services such as AKS and IaC tooling like Terraform. Solid understanding of cloud-native monitoring and observability tools Proficiency with infrastructure-as-code practices and tools (Terraform, HashiCorp ecosystem). Good knowledge of containerisation and container security, including Docker and Kubernetes. Relevant Azure certifications or equivalent experience. More ❯
london (city of london), south east england, united kingdom Hybrid/Remote Options
Gravitee
Helm Charts Cloud experience (AWS and/or Azure) Even better if you also have skills across: Certificate management (ZeroSSL, Let's Encrypt) Argo Workflows & ArgoCD Continuous Delivery tooling Observability tools (Grafana, Prometheus) ESSENTIAL SKILLS The right candidate will possess at least the following skills, if not more: 3+ years of professional experience in infrastructure management Fluent with creating and More ❯
london, south east england, united kingdom Hybrid/Remote Options
Our Future Health
testing, code reviews, design documentation, excellent debugging, troubleshooting skills. Experience with Azure (ideally), AWS or GCP, Docker, Kubernetes, and Helm. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience with Microsoft Sentinel, Microsoft's Defender and Purview suites and Microsoft Entra. Experience of SOAR tooling and automating security capabilities More ❯
offs of architectural and design decisions. Experience with Sequelize or similar tools Knowledge of security, accessibility and performance best practices. Exposure to agile or lean delivery environments. Familiarity with observability tools. Contract Philip Boltt at Lorien Global IND_PC1 Guidant, Carbon60, Lorien & SRG - The Impellam Group Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Tank Recruitment
strategy and AI enablement Partner with Product, Security, Sales and Customer Experience teams to align delivery and ensure successful customer migrations Set standards for secure development, CI/CD, observability and compliance (ISO27001, GDPR, PCI/DSS) What You'll Bring Proven leadership in enterprise SaaS or major SaaS transformation programmes 8+ years in engineering, including 4+ years in senior More ❯