Leeds, England, United Kingdom Hybrid/Remote Options
Fruition Group
DynamoDB, S3, IAM, and RDS. Understanding of DevOps practices, including CI/CD pipelines and automation. Strong knowledge of cloud security best practices, IAM policies, and networking. Experience with observability tools like CloudWatch, Prometheus, or Grafana. Preferred: Experience mentoring junior team members and promoting DevOps practices. Familiarity with multi-cloud environments (e.g., GCP, Azure). Knowledge of database performance optimisation. More ❯
Implement and manage CI/CD pipelines using Azure DevOps or GitHub Actions. Apply Infrastructure as Code (IaC) with AWS CDK, Terraform, or CloudFormation. Ensure application security, monitoring, and observability using AWS CloudWatch, X-Ray, and IAM best practices. Collaborate with cross-functional teams to deliver high-quality solutions. (Optional) Support front-end development using React.js or Blazor. Skills and More ❯
backend projects. • Knowledge of container orchestration (e.g., Kubernetes). • Experience with mobile application development (Android/iOS). • Knowledge of C# or other backend languages. • Familiarity with monitoring and observability tools (Grafana, Prometheus, etc.). • Experience with AI-assisted development tools (e.g., Copilot, ChatGPT integrations). Attributes & Behaviours • Clear, professional communication with customers and colleagues. • Strong problem-solving and troubleshooting More ❯
Leeds, West Yorkshire, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Leeds, England, United Kingdom Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Leeds, West Yorkshire, United Kingdom Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Beckwith, Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
PaaS, governance, networking, and identity). AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs More ❯
England, Beckwith, North Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
PaaS, governance, networking, and identity). AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs More ❯
establish team governance frameworks, and ensure adoption of deployment best practices across the team Infrastructure as Code: Infrastructure as Code with Terraform, ARM Templates or AWS CloudFormation/CDK Observability Strategy: DataDog, Application Insights or Amazon CloudWatch implementation with comprehensive observability strategy and cloud governance Cloud Strategy: Strategic SAAS using Azure (Functions, Queue, Blob Storage, VMs, etc) or AWS (Lambda … testing strategy, establish quality governance for the team, and measure team adoption with continuous improvement initiatives Application Security: Confident vulnerability management, thread modelling and tracking Production Support: Knowledge of observability and production support practices Essential Experience Requirements 6+ years in software engineering, minimum 1 year of formal people management experience or 2+ years of technical leadership with mentoring responsibilities Demonstrable More ❯
sheffield, south yorkshire, yorkshire and the humber, united kingdom
SBS
evolve our secure, scalable, and resilient cloud platform, the backbone that powers our products and development teams. You’ll be part of a collaborative engineering community that values automation, observability, and continuous improvement, helping us deliver reliable, high-performance services at scale. What will the role involve? Designing and implementing highly available, scalable infrastructure in AWS. Ensuring our cloud environments … Partnering with SRE and software teams to define shared reliability goals and deliver on them. Championing reliability practices such as SLOs, error budgets, and blameless post-incident reviews. Enhancing observability with robust metrics, logging, and tracing for better operational visibility and faster response times. Working closely with security and compliance teams to maintain safe, auditable environments aligned with regulatory standards. … experience. Experience with CI/CD tools such as AWS CodeCommit (or similar). Hands-on experience with Kubernetes (EKS, AKS, or equivalent) for container orchestration. Solid knowledge of observability stacks, monitoring, alerting, and self-healing systems. Experience enforcing reliability and security standards in cloud environments. Excellent collaboration and communication skills. A mindset focused on simplification, standardisation, and continuous improvement. More ❯
Leeds, West Yorkshire, United Kingdom Hybrid/Remote Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some of the … planet's biggest artists and they're now looking for a SRE. Someone that knows their way around classic Observability with Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future … like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | ObservabilityMore ❯
the identity, confidentiality and authenticity of trillions of dollars of transactions globally. We are looking for an SME in cyber security and cryptography who can help improve their HSM observability for their entire encryption estate. The existing HSM observability is ready to be taken to the next level, improving its resilience, increased monitoring capabilities and faster alerting. We specifically require … team members - as SME for HSMs, but also other HSMs. Work closely with stakeholders to understand requirement details. Write a design and test specification for your responsibility in the observability project Contribute to documentation of the project Help define the roadmap for continual improvements in the management of cryptographic services Flag potential issues timely, think outside the box and be More ❯