services, infrastructure automation, and cloud security best practices. Background in medical imaging, life sciences, or other regulated software environments (e.g. ISO 13485, IEC 62304). Familiarity with DevOps pipelines, observability, and automated deployment at scale. Demonstrated ability to mentor junior engineers, lead code reviews, and shape architectural decisions. Knowledge of DICOM/DICOMWeb standards or healthcare data integration workflows. Our More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Motilent
services, infrastructure automation, and cloud security best practices. Background in medical imaging, life sciences, or other regulated software environments (e.g. ISO 13485, IEC 62304). Familiarity with DevOps pipelines, observability, and automated deployment at scale. Demonstrated ability to mentor junior engineers, lead code reviews, and shape architectural decisions. Knowledge of DICOM/DICOMWeb standards or healthcare data integration workflows. Our More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
with Helm and ArgoCD Owning CI/CD pipelines across multiple environments (GitHub Actions, Jenkins, etc.) Working closely with software engineers to streamline delivery and performance Bringing structure and observability into their environments using tools like Prometheus, Grafana, and ELK Championing DevOps best practice, security, and reliability across the engineering teams What they’re looking for Proven experience in a More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Experis UK
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
GitHub Actions, or GitLab CI. Solid understanding of containerization technologies (Docker, Kubernetes). Working knowledge of Python and SQL for automation and data pipeline development. Familiarity with monitoring and observability tools (Grafana, Prometheus, CloudWatch). Strong grasp of data architecture principles and ETL design patterns. Financial services or regulated industry experience (desirable). More ❯
Jenkins, Spinnaker). Security & Reliability: Embed secure-by-design principles from architecture to runtime, safeguarding colleagues and customers. Performance & Automation: Define and operate against SLOs/SLIs, implement actionable observability, performance tuning, and automation to reduce toil. API-First Enablement: Enable API-first delivery with reusable patterns and golden paths for API producers and consumers. Community & Mentorship: Mentor engineers, contribute More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Inspire People
Edinburgh or Belfast. About the Role As a Senior Site Reliability Engineer, you will: - Build and scale DBT's product platform and services in AWS. - Provide development teams with observability, monitoring, CI/CD pipelines and service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute More ❯
Implement and manage CI/CD pipelines using Azure DevOps or GitHub Actions. Apply Infrastructure as Code (IaC) with AWS CDK, Terraform, or CloudFormation. Ensure application security, monitoring, and observability using AWS CloudWatch, X-Ray, and IAM best practices. Collaborate with cross-functional teams to deliver high-quality solutions. (Optional) Support front-end development using React.js or Blazor. Skills and More ❯
CI/CD pipelines to enable fast, reliable releases Implement cloud-native deployments with Kubernetes , Docker , and Helm Collaborate with developers to remove delivery bottlenecks Enhance monitoring, logging, and observability Support cloud security & ISO compliance best practices Key skills: Proven experience in DevOps engineering with automated CI/CD pipelines Strong knowledge of Terraform , Azure , Kubernetes , and Helm Hands-on More ❯
SQL queries for relational databases. Integrate and manage applications in AWS cloud environments. Collaborate with cross-functional teams to ensure smooth delivery and integration of features. Implement monitoring and observability solutions (e.g., Datadog) for system health and performance tracking. Maintain high standards of code quality, reliability, and security. Primary Skills Strong programming skills in Java and Spring Boot. Hands-on More ❯
supporting gameplay, user management, platform and content management systems, collaborating with product and game teams to ensure alignment of features with backend architecture and with DevOps to ensure uptime, observability and deployment reliability. This is a senior role where you'll take ownership of complex systems and proactively address potential performance and scalability bottlenecks. Location/WFH: You can work More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Client Server
supporting gameplay, user management, platform and content management systems, collaborating with product and game teams to ensure alignment of features with backend architecture and with DevOps to ensure uptime, observability and deployment reliability. This is a senior role where you'll take ownership of complex systems and proactively address potential performance and scalability bottlenecks. Location/WFH: You can work More ❯
Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). Desirable Experience deploying AI inference engines (vLLM, Ray Serve, Triton). Familiarity with observability tools for LLMs (TruLens, Helicone, LangSmith). Understanding of AI safety and reliability frameworks (Guardrails AI). This is an exciting opportunity to help define the infrastructure powering the next More ❯
Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). Desirable Experience deploying AI inference engines (vLLM, Ray Serve, Triton). Familiarity with observability tools for LLMs (TruLens, Helicone, LangSmith). Understanding of AI safety and reliability frameworks (Guardrails AI). This is an exciting opportunity to help define the infrastructure powering the next More ❯
Strong experience with AWS (VPCs, EC2, ECS/EKS, RDS, S3, etc.) Solid understanding of database systems (Postgres, SQL Server) IaC mastery (Terraform, CloudFormation, Ansible) Passion for monitoring and observability (Grafana, Elastic, PagerDuty, etc.) Familiarity with configuration management tools (Puppet, etc.) Git, Docker, and scripting skills (bash or similar) A collaborative mindset and the ability to communicate technical concepts clearly More ❯
Solid understanding of DevOps principles, tools, and CI/CD pipelines using Azure DevOps, Git, and GitHub. Experience with Docker and Kubernetes containerization and orchestration. Familiarity with monitoring and observability tools: Azure Monitor, Log Analytics, Application Insights. Knowledge of identity, access management, governance, and compliance in Azure. Strong troubleshooting and root-cause analysis skills across cloud and hybrid infrastructures. Excellent More ❯
Strong experience with AWS (VPCs, EC2, ECS/EKS, RDS, S3, etc.) Solid understanding of database systems (Postgres, SQL Server) IaC mastery (Terraform, CloudFormation, Ansible) Passion for monitoring and observability (Grafana, Elastic, PagerDuty, etc.) Familiarity with configuration management tools (Puppet, etc.) Git, Docker, and scripting skills (bash or similar) A collaborative mindset and the ability to communicate technical concepts clearly More ❯
twins, and operational intelligence. Define and maintain asset hierarchies, semantic models, and metadata frameworks for contextualized industrial data. Implement CI/CD pipelines for data workflows and ensure lineage, observability, and compliance across environments. Collaborate with AI/ML teams to support model training, deployment, and monitoring using MLOps frameworks. Establish and enforce data governance policies, stewardship models, and metadata More ❯
City, London, United Kingdom Hybrid/Remote Options
client server
issues and optimising system performance and reliability You can code with Python or Golang You have a strong understanding of DevOps tooling, CI/CD, Gitlab, Jenkins and infrastructure observability tools such as Splunk, Prometheus and Grafana You have strong experience with containerisation technologies such as Docker and orchestration platforms like Kubernetes You have experience within a financial services or More ❯