and shared infrastructure Identify and resolve architectural bottlenecks in the current data platform and propose improvements that reduce complexity and boost performance Drive initiatives that improve data quality, lineage, observability, and system reliability Influence and Collaborate Across Teams Act as a technical liaison between engineering, product, and analytics teams, ensuring alignment on architecture and data strategy Provide technical leadership and … workloads Familiarity with data governance, privacy, and compliance frameworks Background in customer-centric or product-driven environments (e.g., digital, eCommerce, SaaS) Experience with infrastructure-as-code and data platform observability (e.g., Terraform) What You Can Expect Interesting work - working in a fast-paced and ever-changing industry, new problems and exciting solutions are never too far away. There are always More ❯
in between - ensuring our platform is resilient, efficient, secure and developer-friendly. Key Responsibilities: Design, build, and maintain platform services and infrastructure used by product engineering teams. Improve reliability, observability, and scalability of existing systems. Develop and maintain CI/CD pipelines to support software delivery. Build tooling and automation that supports self-service infrastructure and deployment. Ensure security best More ❯
DevOps Manager responsibilities: Oversee the team that provides operational support for Linux servers, networks, and AWS cloud infrastructure. Manage security vulnerabilities and implement mitigations. Implement and maintain monitoring and observability solutions. Provision infrastructure for new projects and products. Support project delivery and provide infrastructure design expertise. Maintain and improve configuration management (Puppet) and DevOps processes. Lead the transition to containerization More ❯
financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. We're hiring a Lead Site Reliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You’ll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and More ❯
Code (IaC) using Terraform to automate infrastructure provisioning and management. Establish and maintain robust security controls across all cloud environments, ensuring compliance with relevant standards and regulations. Utilise advanced observability tools to monitor and optimise the performance of production services, proactively identifying and resolving issues. Design and optimise CI/CD pipelines using platforms such as GitLab or Jenkins, enabling More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Premier Group
Code (IaC) using Terraform to automate infrastructure provisioning and management. Establish and maintain robust security controls across all cloud environments, ensuring compliance with relevant standards and regulations. Utilise advanced observability tools to monitor and optimise the performance of production services, proactively identifying and resolving issues. Design and optimise CI/CD pipelines using platforms such as GitLab or Jenkins, enabling More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Premier Group
Code (IaC) using Terraform to automate infrastructure provisioning and management. Establish and maintain robust security controls across all cloud environments, ensuring compliance with relevant standards and regulations. Utilise advanced observability tools to monitor and optimise the performance of production services, proactively identifying and resolving issues. Design and optimise CI/CD pipelines using platforms such as GitLab or Jenkins, enabling More ❯
Code (IaC) using Terraform to automate infrastructure provisioning and management. Establish and maintain robust security controls across all cloud environments, ensuring compliance with relevant standards and regulations. Utilise advanced observability tools to monitor and optimise the performance of production services, proactively identifying and resolving issues. Design and optimise CI/CD pipelines using platforms such as GitLab or Jenkins, enabling More ❯
in the industry. At the heart of the firm’s operations, you’ll design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs More ❯
in the industry. At the heart of the firm’s operations, you’ll design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs More ❯
in the industry. At the heart of the firm’s operations, you’ll design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs More ❯
Zero Trust architecture Experience with healthcare compliance requirements (ISO27001, ISO13485, HIPAA) and security controls Proven track record implementing automated security testing and vulnerability management Strong knowledge of monitoring and observability tools (Azure Monitor, Application Insights) Experience implementing secure networking and identity management solutions in Azure Strong communication skills with ability to collaborate on security requirements across teams Track record of More ❯
Zero Trust architecture Experience with healthcare compliance requirements (ISO27001, ISO13485, HIPAA) and security controls Proven track record implementing automated security testing and vulnerability management Strong knowledge of monitoring and observability tools (Azure Monitor, Application Insights) Experience implementing secure networking and identity management solutions in Azure Strong communication skills with ability to collaborate on security requirements across teams Track record of More ❯
Zero Trust architecture Experience with healthcare compliance requirements (ISO27001, ISO13485, HIPAA) and security controls Proven track record implementing automated security testing and vulnerability management Strong knowledge of monitoring and observability tools (Azure Monitor, Application Insights) Experience implementing secure networking and identity management solutions in Azure Strong communication skills with ability to collaborate on security requirements across teams Track record of More ❯
be: Comfortable working autonomously and taking independent decisions as well as having the ability to work cooperatively within a team, Experience working with microservice architectures and building monitoring/observability metrics, Understanding of cloud native landscapes (AWS or Azure or GCP), Knowledgeable of containerized environments would be beneficial (Docker or Kubernetes). Benefits we offer: 23 days' holiday + all More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
We Are Dcoded Limited
Tech Snapshot: Languages & Frameworks: C#, .NET Core Cloud: AWS (cloud-native, not lift-and-shift) Containerisation: Kubernetes IaC & Pipelines: Terraform, CI/CD APIs: GraphQL, REST Practices: TDD, Monitoring, Observability What Youll Be Doing: Building and maintaining distributed services in a high-traffic, cloud-native environment Leading the development of new features and contributing to architectural decisions Collaborating with Product More ❯
Greater Leeds Area, United Kingdom Hybrid / WFH Options
Anson McCade
CI/CD pipelines (e.g., GitHub Actions, CircleCI). 💡 Bonus Skills That Impress Delivery into cloud platforms (AWS, Azure, GCP). Familiarity with relational and NoSQL databases. Experience with observability tools (e.g., DataDog, Prometheus). Test automation know-how. Exposure to open-source tools and community practices. Ready to build what matters? Apply now to shape the future of digital More ❯
Own and evolve incident management, SLAs, and service performance Diagnose complex system issues including networking, deployment, and application-level debugging Drive automation, self-service, and tooling improvements (ticket workflows, observability, etc.) Liaise closely with Engineering and Product teams , feeding technical insights into the roadmap What You’ll Bring 5+ years in technical support leadership , ideally in a SaaS, cloud-native More ❯
London, England, United Kingdom Hybrid / WFH Options
Client Server
build and maintain secure, repeatable CI/CD pipelines using GitHub Actions and ArgoCD. You'll automate provisioning and scaling of core services including Redis, Kafka and PostgreSQL, implement observability and monitoring and integrate and manage IAM control frameworks and apply infrastructure security best practices. You will support deployments in cloud (AWS) and controlled on-premise environments (Linux). Location More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Client Server
build and maintain secure, repeatable CI/CD pipelines using GitHub Actions and ArgoCD. You'll automate provisioning and scaling of core services including Redis, Kafka and PostgreSQL, implement observability and monitoring and integrate and manage IAM control frameworks and apply infrastructure security best practices. You will support deployments in cloud (AWS) and controlled on-premise environments (Linux). Location More ❯
teams. Drive technical delivery and ensure teams are focused on business goals, measuring success by business impact rather than lines of code. Champion engineering best practices, including TDD, IaC, observability, security by design, and continuous improvement. Contribute to architectural discussions and decisions, internalising the best ideas from across the organisation to set a compelling vision. Develop team members by teaching More ❯
teams. Drive technical delivery and ensure teams are focused on business goals, measuring success by business impact rather than lines of code. Champion engineering best practices, including TDD, IaC, observability, security by design, and continuous improvement. Contribute to architectural discussions and decisions, internalising the best ideas from across the organisation to set a compelling vision. Develop team members by teaching More ❯
teams. Drive technical delivery and ensure teams are focused on business goals, measuring success by business impact rather than lines of code. Champion engineering best practices, including TDD, IaC, observability, security by design, and continuous improvement. Contribute to architectural discussions and decisions, internalising the best ideas from across the organisation to set a compelling vision. Develop team members by teaching More ❯
AWS infrastructure (EC2, S3, IAM) for scalable cloud solutions Test and debug APIs using Postman Work with graph databases (Neo4j) to enhance platform functionality Contribute to platform monitoring and observability solutions What's in it for You Gain hands-on experience with enterprise-level Java and Spring Boot development Contribute to an EdTech and Tech for Good start-up that More ❯
AWS infrastructure (EC2, S3, IAM) for scalable cloud solutions Test and debug APIs using Postman Work with graph databases (Neo4j) to enhance platform functionality Contribute to platform monitoring and observability solutions What's in it for You Gain hands-on experience with enterprise-level Java and Spring Boot development Contribute to an EdTech and Tech for Good start-up that More ❯