/CD pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins) Experience withconfiguration managementtools such asChef/Puppet Strong proficiency in scripting/programming (Python, Go, or similar) Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana) Knowledge of microservices architecture and service mesh technologies Understanding of security best practices and compliance frameworks Comfortable with asynchronous collaboration tools (Slack, Teams More ❯
Welwyn Garden City, England, United Kingdom Hybrid/Remote Options
PayPoint plc
Cover on-call rotation for production support (1 week out of 6) As well as making improvements to: • Deployment automation and release management processes • Application and infrastructure monitoring and observability • Security scanning and vulnerability management in pipelines • Performance optimization and capacity planning • Development team productivity through tooling and automation What we would like from you • Strong experience with CI/ More ❯
Liverpool, England, United Kingdom Hybrid/Remote Options
Love2shop
Cover on-call rotation for production support (1 week out of 6) As well as making improvements to: • Deployment automation and release management processes • Application and infrastructure monitoring and observability • Security scanning and vulnerability management in pipelines • Performance optimization and capacity planning • Development team productivity through tooling and automation What we would like from you • Strong experience with CI/ More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Additional Resources Ltd
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Mansfield, England, United Kingdom Hybrid/Remote Options
Future Talent Group
using Terraform. Implement and optimise CI/CD pipelines using GitHub Actions, Docker, and GitOps practices. Deploy, orchestrate, and manage Kubernetes (AKS/Container Apps) workloads. Configure monitoring and observability with Azure Monitor, Application Insights, Log Analytics, and OpenTelemetry. Partner with software engineering and infrastructure teams to drive DevOps best practices across the organisation. Manage security and compliance in Azure More ❯
Leeds, England, United Kingdom Hybrid/Remote Options
Fruition Group
DynamoDB, S3, IAM, and RDS. Understanding of DevOps practices, including CI/CD pipelines and automation. Strong knowledge of cloud security best practices, IAM policies, and networking. Experience with observability tools like CloudWatch, Prometheus, or Grafana. Preferred: Experience mentoring junior team members and promoting DevOps practices. Familiarity with multi-cloud environments (e.g., GCP, Azure). Knowledge of database performance optimisation. More ❯
FinOps practices. Experience with infrastructure-as-code tools (e.g., Terraform, Helm, Ansible). Familiarity with CI/CD pipelines and automation (e.g., GitHub Actions, ArgoCD, Jenkins). Experience on observability tools like Prometheus, Grafana Knowledge of Linux systems administration and networking fundamentals and experience with policy-as-code. Passion for platform engineering, developer experience, and site reliability UAL is a More ❯
Manchester, England, United Kingdom Hybrid/Remote Options
Suits Me
implementing AWS infrastructure and services using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices More ❯
Manchester, Lancashire, United Kingdom Hybrid/Remote Options
Datalex
Champion CI/CD and test automation practices across the team Performance, Caching & DevOps: Optimize performance with caching solutions (e.g., Redis, Memcached) Maintain stateless service architecture principles Contribute to observability with Prometheus, Grafana, and ELK Stack Collaborate closely with product, QA, DevOps, and platform teams Mentor junior engineers and support technical decision-making across sprints Advocate for engineering excellence, TDD More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
with Helm and ArgoCD Owning CI/CD pipelines across multiple environments (GitHub Actions, Jenkins, etc.) Working closely with software engineers to streamline delivery and performance Bringing structure and observability into their environments using tools like Prometheus, Grafana, and ELK Championing DevOps best practice, security, and reliability across the engineering teams What they’re looking for Proven experience in a More ❯
GitHub Actions, or GitLab CI. Solid understanding of containerization technologies (Docker, Kubernetes). Working knowledge of Python and SQL for automation and data pipeline development. Familiarity with monitoring and observability tools (Grafana, Prometheus, CloudWatch). Strong grasp of data architecture principles and ETL design patterns. Financial services or regulated industry experience (desirable). More ❯
Jenkins, Spinnaker). Security & Reliability: Embed secure-by-design principles from architecture to runtime, safeguarding colleagues and customers. Performance & Automation: Define and operate against SLOs/SLIs, implement actionable observability, performance tuning, and automation to reduce toil. API-First Enablement: Enable API-first delivery with reusable patterns and golden paths for API producers and consumers. Community & Mentorship: Mentor engineers, contribute More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Inspire People
Edinburgh or Belfast. About the Role As a Senior Site Reliability Engineer, you will: - Build and scale DBT's product platform and services in AWS. - Provide development teams with observability, monitoring, CI/CD pipelines and service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute More ❯
a positive, organised and team-focused approach to solving complex problems. Key Responsibilities: Design, implement and maintain infrastructure as code for development and production environments. Build, maintain and improve observability and monitoring solutions, ensuring system health and uptime, including participation in on-call rotation. Design, implement and optimise continuous delivery systems, advising development teams on resiliency, operability and best practices. More ❯
a positive, organised and team-focused approach to solving complex problems. Key Responsibilities: Design, implement and maintain infrastructure as code for development and production environments. Build, maintain and improve observability and monitoring solutions, ensuring system health and uptime, including participation in on-call rotation. Design, implement and optimise continuous delivery systems, advising development teams on resiliency, operability and best practices. More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
Management (IAM) and Single Sign-On (SSO) solutions using tools like Azure AD, Okta and Oracle Identity Cloud Service. Establish and maintain CI/CD pipelines, test automation and observability practices using tools such as Azure DevOps, GitHub and Jenkins to streamline the development life cycle. Provide technical guidance and mentorship to junior engineers, participate in code reviews and collaborate More ❯
Implement and manage CI/CD pipelines using Azure DevOps or GitHub Actions. Apply Infrastructure as Code (IaC) with AWS CDK, Terraform, or CloudFormation. Ensure application security, monitoring, and observability using AWS CloudWatch, X-Ray, and IAM best practices. Collaborate with cross-functional teams to deliver high-quality solutions. (Optional) Support front-end development using React.js or Blazor. Skills and More ❯
CI/CD pipelines to enable fast, reliable releases Implement cloud-native deployments with Kubernetes , Docker , and Helm Collaborate with developers to remove delivery bottlenecks Enhance monitoring, logging, and observability Support cloud security & ISO compliance best practices Key skills: Proven experience in DevOps engineering with automated CI/CD pipelines Strong knowledge of Terraform , Azure , Kubernetes , and Helm Hands-on More ❯
SQL queries for relational databases. Integrate and manage applications in AWS cloud environments. Collaborate with cross-functional teams to ensure smooth delivery and integration of features. Implement monitoring and observability solutions (e.g., Datadog) for system health and performance tracking. Maintain high standards of code quality, reliability, and security. Primary Skills Strong programming skills in Java and Spring Boot. Hands-on More ❯
Infrastructure as Code: Implement and manage cloud resources using Terraform and Helm templates Cloud Platform Management: Administer and optimize Azure services including AKS, Storage, Networking, and Virtual Machines. Monitoring & Observability: Configure monitoring, logging, and alerting with Azure Monitor, Log Analytics, and Application Insights. Security & Compliance: Apply security best practices (RBAC, Key Vault, Managed Identities) and ensure compliance with organizational standards. More ❯
Solid understanding of DevOps principles, tools, and CI/CD pipelines using Azure DevOps, Git, and GitHub. Experience with Docker and Kubernetes containerization and orchestration. Familiarity with monitoring and observability tools: Azure Monitor, Log Analytics, Application Insights. Knowledge of identity, access management, governance, and compliance in Azure. Strong troubleshooting and root-cause analysis skills across cloud and hybrid infrastructures. Excellent More ❯