Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid/Remote Options
WRK DIGITAL LTD
Lambda, CloudFront, RDS, etc.) and Azure (you don't need to be an expert, but being interested helps!) Promote strong engineering practices around code quality, automated testing, peer reviews, observability, and security, helping to instil a culture of quality and accountability in engineering Collaborate closely with designers, product managers, and QA to ensure solutions are user-focused, technically sound, and More ❯
Manchester, North West, United Kingdom Hybrid/Remote Options
Anson Mccade
GDS Service Standards, OAuth2.0/OIDC, Zero Trust principles and government accreditation requirements . Oversee software quality, engineering standards, testing strategies, CI/CD pipelines, IaC (Terraform/Ansible), observability and resilience . Work alongside product, delivery, user research, DevOps and data teams to align user needs, policy requirements and technical feasibility. Mentor engineering and architecture teams, fostering best-practice More ❯
best practices across the platform (IAM, secrets management, encryption) • Support compliance initiatives (ISO 27001, NIST, GDPR, MCERTS, etc.) • Manage network configuration, firewalls, and secure endpoints Monitoring & Reliability • Set up observability and monitoring tools (Prometheus, Grafana, Datadog, or CloudWatch) • Ensure high availability, scalability, and cost efficiency of cloud services • Define SLIs, SLOs, and SLAs for platform components • Troubleshoot production issues and More ❯
as Terraform or CloudFormation. Implement and manage CI/CD pipelines , enabling continuous integration and deployment of mission-critical applications. Monitor and optimise system performance, availability, and security, applying observability best practices. Collaborate in an Agile environment, engaging with stakeholders to gather requirements and deliver iterative improvements. This role allows you to apply your expertise to challenging problems while shaping More ❯
Liverpool, Merseyside, England, United Kingdom Hybrid/Remote Options
Broster Buchanan
scalability and resilience in applications handling large volumes of traffic and burst events. Work collaboratively with cross-functional teams, including DevOps, Infrastructure, and Product, to deliver robust systems. Leverage observability tools to monitor, alert, and troubleshoot application and integration health. Stay current on AI-driven software development practices (e.g., GPT-assisted development, Agentic AI workflows) and suggest practical implementations. Participate More ❯
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom Hybrid/Remote Options
UK Health Security Agency
using strong problem-solving skills to identify bottlenecks with an engineering mindset. Ensure systems can handle current and future workloads through automation and capacity planning. Continuously improve services through observability, and identify ways to improve observability practices. Follow SRE principles. Guide and educate stakeholders to adopt implemented principles. Provide technical documentation for engineers. Providing training, where appropriate. Working closely with … Response & Troubleshooting Respond swiftly to production incidents, ensuring minimal downtime and quick restoration of services. Perform root cause analysis and postmortems, implementing lessons learned to prevent recurrence. Monitoring, Alerting & Observability Contribute to the design and implementation of effective monitoring and alerting systems using tools and dashboards. Improve observability of services, ensuring issues are identified and addressed before impacting users. Continuously … or similar role Coding skills in programming/scripting languages such as Python, PowerShell or Bash Understanding of Linux/Unix & Windows systems, networking, and distributed systems Experience with observability tools (e.g., Prometheus, Grafana, Datadog) and alerting systems Understanding of infrastructure automation (e.g., Terraform, Ansible, PowerShell, Helm) Excellent communication and collaboration skills Experience with security best practices Possesses problem solving More ❯
Nelson, Lancashire, England, United Kingdom Hybrid/Remote Options
Lorien
cloud infrastructure on Azure or AWS. Driving Infrastructure as Code (IaC) practices using Terraform. Building and optimising CI/CD pipelines to accelerate delivery. Implementing and maintaining monitoring and observability with Prometheus and Grafana. Enabling team collaboration and incident response through Slack and other ChatOps tools. Leading, mentoring, and supporting engineers (or preparing to step into people management if you More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom Hybrid/Remote Options
Lorien
cloud infrastructure on Azure or AWS. Driving Infrastructure as Code (IaC) practices using Terraform. Building and optimising CI/CD pipelines to accelerate delivery. Implementing and maintaining monitoring and observability with Prometheus and Grafana. Enabling team collaboration and incident response through Slack and other ChatOps tools. Leading, mentoring, and supporting engineers (or preparing to step into people management if you More ❯
Manchester, North West, United Kingdom Hybrid/Remote Options
Wilson Brown Limited
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling Youll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. Youll take More ❯
incidents. Flexible and adaptable to technical and business priorities. Nice-to-Have Experience supporting scientific or data-intensive applications. Background in post-mortem facilitation and follow-up. Enthusiasm for observability, performance tuning, and cost optimisation. More ❯
Beckwith, Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
PaaS, governance, networking, and identity). AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs More ❯
Harrogate, North Yorkshire, Yorkshire, United Kingdom Hybrid/Remote Options
Morson Edge
PaaS, governance, networking, and identity). AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs More ❯
England, Beckwith, North Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
PaaS, governance, networking, and identity). AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs More ❯
and contribute to key design decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall bar for backend development. Drive best More ❯
and contribute to key design decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall bar for backend development. Drive best More ❯
Single Sign-On (SSO)/MFA configuration and integration. SaaS Integration & API Usage Ability to integrate SaaS platforms using APIs, connectors, and automation tools (e.g., Power Platform). Monitoring & Observability Tools Experience with tools like Azure Monitor, Splunk or New Relic for proactive service monitoring and incident response. End-User Experience Optimization Understanding of digital experience monitoring and techniques to More ❯
Zero Trust models, and automation. Collaborate with managed service providers and internal teams to prioritise workloads effectively. AWS & Cloudflare Responsibilities Multi-account governance and secure AWS network design. Logging, observability, and security with CloudWatch, CloudTrail, GuardDuty, Security Hub, and Inspector, including CIS-aligned hardening. Resilience and DR planning: multi-AZ architecture, backup and restore strategies, and recovery testing. FinOps practices More ❯
Employment Type: Full-Time
Salary: £45,000 - £50,000 per annum, Inc benefits, OTE
Manchester, Lancashire, United Kingdom Hybrid/Remote Options
Morson Edge
for all aspects of the product they work on, from ideation through to supporting systems in production, so you should be confident in practices like system design, testing, deployment, observability and monitoring. This position would suit someone looking for a role with a high level of autonomy and the opportunity to work on complex technical challenges. Skills: TypeScript (Node, React More ❯
services/message buses and other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of security/performance requirements More ❯
services/message buses and other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of security/performance requirements. More ❯
solving approach Collaborate closely with cross-functional teams to deliver high-quality software Stay up to date with emerging technologies and industry best practices Contribute to application monitoring and observability, helping to identify and resolve system issues Participate in the full software lifecycle from design and development through deployment and maintenance Person Specification Experience Essential Relevant degree or qualification is More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
Newcastle upon Tyne, Tyne & Wear, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
Technology Platform Delivery Oversee the delivery and lifecycle management of: Microsoft 365 and collaboration platforms Cloud platforms (design, automation, cost optimisation) Network and security operations (compliance, threat management) Monitoring, observability, and backup/recovery systems Ensure alignment with architectural standards and regulatory requirements (e.g., DORA, Cyber Essentials Plus). Stakeholder Engagement Act as the escalation point for unresolved issues across More ❯