Demonstrated expertise in ICAP implementation and proxy server integration for content adaptation and security enforcement. Hands-on scripting and programming experience for automation (e.g., Python, Java). Exposure to observability practices, including monitoring, logging, metrics, and traces, to ensure operational excellence in deployments. Experience supporting U.K. Government customers, with knowledge of security clearance processes and compliance requirements. Work/Life More ❯
pipelines handling millions of requests with low latency Deploy and operate services on Kubernetes and Docker, leveraging AWS infrastructure such as EC2, S3, Lambda, and RDS Implement monitoring and observability using tools like Grafana and Prometheus to track system performance Collaborate with product, frontend, and analytics teams to deliver features that make a tangible impact on user experience Contribute to More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
the delivery pipeline, introducing improvements to speed up deployment cycles and increase reliability. Work collaboratively with Engineering and Product teams to enhance developer experience and streamline release processes. Implement observability tooling and practices to ensure proactive monitoring, incident response, and system reliability. Ensure compliance with FinTech security and regulatory standards across infrastructure and platform services. Evaluate and introduce new DevOps More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
evolve exiting topologies and connectivity solutions. Escalation points for connectivity issues with cloud systems and resources. Monitor and optimise performance across hybrid, cross-tenancy, and multi-cloud environments. Implement observability tooling. Ensure compliance with ISO27001 standards, Zero Trust principles, and Well-Architected Frameworks. Maintain accurate documentation and mentor junior engineers. This role gives you the opportunity to deliver secure, modern More ❯
evolve exiting topologies and connectivity solutions. Escalation points for connectivity issues with cloud systems and resources. Monitor and optimise performance across hybrid, cross-tenancy, and multi-cloud environments. Implement observability tooling. Ensure compliance with ISO27001 standards, Zero Trust principles, and Well-Architected Frameworks. Maintain accurate documentation and mentor junior engineers. This role gives you the opportunity to deliver secure, modern More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Motilent
services, infrastructure automation, and cloud security best practices. Background in medical imaging, life sciences, or other regulated software environments (e.g. ISO 13485, IEC 62304). Familiarity with DevOps pipelines, observability, and automated deployment at scale. Demonstrated ability to mentor junior engineers, lead code reviews, and shape architectural decisions. Knowledge of DICOM/DICOMWeb standards or healthcare data integration workflows. Our More ❯
services, infrastructure automation, and cloud security best practices. Background in medical imaging, life sciences, or other regulated software environments (e.g. ISO 13485, IEC 62304). Familiarity with DevOps pipelines, observability, and automated deployment at scale. Demonstrated ability to mentor junior engineers, lead code reviews, and shape architectural decisions. Knowledge of DICOM/DICOMWeb standards or healthcare data integration workflows. Our More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
pipelines Drive platform modernisation Manage a small team of engineers Align DevOps capabilities with the wider business Champion DevEx, reliability, and security Embed operational excellence and incident response Promote observability and performance optimisation Lead DevOps Engineer Requirements Proven line management experience Cloud-native expertise (any cloud provider is fine: GCP, AWS or Azure) Knowledge of GitLab CI/CD, Terraform More ❯
with Helm and ArgoCD Owning CI/CD pipelines across multiple environments (GitHub Actions, Jenkins, etc.) Working closely with software engineers to streamline delivery and performance Bringing structure and observability into their environments using tools like Prometheus, Grafana, and ELK Championing DevOps best practice, security, and reliability across the engineering teams What they’re looking for Proven experience in a More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Experis UK
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
GitHub Actions, or GitLab CI. Solid understanding of containerization technologies (Docker, Kubernetes). Working knowledge of Python and SQL for automation and data pipeline development. Familiarity with monitoring and observability tools (Grafana, Prometheus, CloudWatch). Strong grasp of data architecture principles and ETL design patterns. Financial services or regulated industry experience (desirable). More ❯
Jenkins, Spinnaker). Security & Reliability: Embed secure-by-design principles from architecture to runtime, safeguarding colleagues and customers. Performance & Automation: Define and operate against SLOs/SLIs, implement actionable observability, performance tuning, and automation to reduce toil. API-First Enablement: Enable API-first delivery with reusable patterns and golden paths for API producers and consumers. Community & Mentorship: Mentor engineers, contribute More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Inspire People
Edinburgh or Belfast. About the Role As a Senior Site Reliability Engineer, you will: - Build and scale DBT's product platform and services in AWS. - Provide development teams with observability, monitoring, CI/CD pipelines and service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute More ❯
a positive, organised and team-focused approach to solving complex problems. Key Responsibilities: Design, implement and maintain infrastructure as code for development and production environments. Build, maintain and improve observability and monitoring solutions, ensuring system health and uptime, including participation in on-call rotation. Design, implement and optimise continuous delivery systems, advising development teams on resiliency, operability and best practices. More ❯
a positive, organised and team-focused approach to solving complex problems. Key Responsibilities: Design, implement and maintain infrastructure as code for development and production environments. Build, maintain and improve observability and monitoring solutions, ensuring system health and uptime, including participation in on-call rotation. Design, implement and optimise continuous delivery systems, advising development teams on resiliency, operability and best practices. More ❯
dozens of Trimble products and millions of users. Key Responsibilities: Develop and maintain infrastructure as code (IaC) using Terraform to ensure reliable and scalable cloud environments Implement and enhance observability solutions using tools like New Relic, DataDog, Sumologic and Splunk for monitoring, logging, and alerting Perform code deployments and manage CI/CD pipelines using Jenkins, Github, and related tooling More ❯