Liverpool, England, United Kingdom Hybrid/Remote Options
Love2shop
Cover on-call rotation for production support (1 week out of 6) As well as making improvements to: • Deployment automation and release management processes • Application and infrastructure monitoring and observability • Security scanning and vulnerability management in pipelines • Performance optimization and capacity planning • Development team productivity through tooling and automation What we would like from you • Strong experience with CI/ More ❯
Welwyn Garden City, England, United Kingdom Hybrid/Remote Options
PayPoint plc
Cover on-call rotation for production support (1 week out of 6) As well as making improvements to: • Deployment automation and release management processes • Application and infrastructure monitoring and observability • Security scanning and vulnerability management in pipelines • Performance optimization and capacity planning • Development team productivity through tooling and automation What we would like from you • Strong experience with CI/ More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Additional Resources Ltd
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Mansfield, England, United Kingdom Hybrid/Remote Options
Future Talent Group
using Terraform. Implement and optimise CI/CD pipelines using GitHub Actions, Docker, and GitOps practices. Deploy, orchestrate, and manage Kubernetes (AKS/Container Apps) workloads. Configure monitoring and observability with Azure Monitor, Application Insights, Log Analytics, and OpenTelemetry. Partner with software engineering and infrastructure teams to drive DevOps best practices across the organisation. Manage security and compliance in Azure More ❯
Leeds, England, United Kingdom Hybrid/Remote Options
Fruition Group
DynamoDB, S3, IAM, and RDS. Understanding of DevOps practices, including CI/CD pipelines and automation. Strong knowledge of cloud security best practices, IAM policies, and networking. Experience with observability tools like CloudWatch, Prometheus, or Grafana. Preferred: Experience mentoring junior team members and promoting DevOps practices. Familiarity with multi-cloud environments (e.g., GCP, Azure). Knowledge of database performance optimisation. More ❯
Manchester, England, United Kingdom Hybrid/Remote Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Manchester, Lancashire, United Kingdom Hybrid/Remote Options
Datalex
Champion CI/CD and test automation practices across the team Performance, Caching & DevOps: Optimize performance with caching solutions (e.g., Redis, Memcached) Maintain stateless service architecture principles Contribute to observability with Prometheus, Grafana, and ELK Stack Collaborate closely with product, QA, DevOps, and platform teams Mentor junior engineers and support technical decision-making across sprints Advocate for engineering excellence, TDD More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Inspire People
Edinburgh or Belfast. About the Role As a Senior Site Reliability Engineer, you will: - Build and scale DBT's product platform and services in AWS. - Provide development teams with observability, monitoring, CI/CD pipelines and service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
Management (IAM) and Single Sign-On (SSO) solutions using tools like Azure AD, Okta and Oracle Identity Cloud Service. Establish and maintain CI/CD pipelines, test automation and observability practices using tools such as Azure DevOps, GitHub and Jenkins to streamline the development life cycle. Provide technical guidance and mentorship to junior engineers, participate in code reviews and collaborate More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Salt Search
non-functional requirements Deep understanding of microservices architecture , cloud-native applications, and API development Experience with distributed systems - managing workloads at scale using modern practices for availability, performance, and observability Knowledge and experience with Test-Driven Development (TDD) and automated testing frameworks Excellent collaboration and communication skills, with a track record of working effectively in Agile environments Familiarity with public More ❯
Belfast, Northern Ireland, United Kingdom Hybrid/Remote Options
Ocho
delivering high-quality, reusable services and components that scale across multiple projects. Guide prioritisation and decomposition of work, ensuring delivery efficiency without compromising quality. Champion test automation, measurement, and observability to support continuous improvement. Promote high-quality documentation and clear communication across engineering and product teams. Experience & Technical Skills Frontend Development Advanced proficiency in HTML5, CSS3, TypeScript, and modern JavaScript More ❯
Bristol, Gloucestershire, United Kingdom Hybrid/Remote Options
Hargreaves Lansdown PLC
Java 11+, Springboot, RDBMS and SQL Experience with unit, integration, and end-to-end testing tools and practices Experience with CI/CD and Trunk Based Development Advocate for observability, experienced in monitoring, logging, and tracing to ensure system reliability and performance Awareness of website performance implications, best practices and other non-functional requirements Proficient in collaborative code reviews, technical More ❯
Liverpool, Merseyside, England, United Kingdom Hybrid/Remote Options
Broster Buchanan
scalability and resilience in applications handling large volumes of traffic and burst events. Work collaboratively with cross-functional teams, including DevOps, Infrastructure, and Product, to deliver robust systems. Leverage observability tools to monitor, alert, and troubleshoot application and integration health. Stay current on AI-driven software development practices (e.g., GPT-assisted development, Agentic AI workflows) and suggest practical implementations. Participate More ❯
Bridgend, Wales, United Kingdom Hybrid/Remote Options
Socium - Teams Done Differently
Bring Deep knowledge of at least one major cloud provider (AWS, GCP, or Azure). Strong experience with Docker, Kubernetes, Terraform, Linux, and Bash. Familiarity with Cloudflare, SQL, and observability tools. Proficiency in at least one programming language for scripting. Excellent communication and collaboration skills when working with developers. Nice to Have: Knowledge of IoT/hardware systems. Understanding of More ❯
Warwick, England, United Kingdom Hybrid/Remote Options
Ocho
in Git, SQL optimisation, and async architecture. Excellent communicator who values clarity, documentation, and collaboration. Nice to Have Experience with Supabase , Kubernetes , Docker , Azure , GitHub Actions , vector databases , or observability tools like Prometheus , Grafana , and Langfuse . What Success Looks Like 3 months: You’ve established your 1:1 rhythm, shipped your first automation workflow, and built a trusted partnership More ❯
Leigh, Greater Manchester, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Bolton, Greater Manchester, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Bury, Greater Manchester, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Leeds, West Yorkshire, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯