Assurance: Conduct rigorous testing, including unit, integration, and user acceptance testing (UAT), to ensure reliability, performance, and scalability of data systems. Operational Excellence: Monitor and optimize production environments using observability tools and implement CI/CD pipelines for continuous delivery using tools like Azure DevOps. Qualifications Proven experience in a senior role combining software engineering and data engineering, with a More ❯
Assurance: Conduct rigorous testing, including unit, integration, and user acceptance testing (UAT), to ensure reliability, performance, and scalability of data systems. Operational Excellence: Monitor and optimize production environments using observability tools and implement CI/CD pipelines for continuous delivery using tools like Azure DevOps. Qualifications Proven experience in a senior role combining software engineering and data engineering, with a More ❯
Assurance: Conduct rigorous testing, including unit, integration, and user acceptance testing (UAT), to ensure reliability, performance, and scalability of data systems. Operational Excellence: Monitor and optimize production environments using observability tools and implement CI/CD pipelines for continuous delivery using tools like Azure DevOps. Qualifications Proven experience in a senior role combining software engineering and data engineering, with a More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
london (city of london), south east england, united kingdom
Damia Group
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
the delivery pipeline, introducing improvements to speed up deployment cycles and increase reliability. Work collaboratively with Engineering and Product teams to enhance developer experience and streamline release processes. Implement observability tooling and practices to ensure proactive monitoring, incident response, and system reliability. Ensure compliance with FinTech security and regulatory standards across infrastructure and platform services. Evaluate and introduce new DevOps More ❯
the delivery pipeline, introducing improvements to speed up deployment cycles and increase reliability. Work collaboratively with Engineering and Product teams to enhance developer experience and streamline release processes. Implement observability tooling and practices to ensure proactive monitoring, incident response, and system reliability. Ensure compliance with FinTech security and regulatory standards across infrastructure and platform services. Evaluate and introduce new DevOps More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stackslogging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments (broadcasting More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
london (city of london), south east england, united kingdom
SoTalent
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
london (city of london), south east england, united kingdom
Reelables
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯