swindon, wiltshire, south west england, united kingdom Hybrid/Remote Options
Humana
Become a part of our caring community and help us put health first Why Join Enterprise Observability Engineering? The Enterprise Observability Engineering team is a high-impact, high-autonomy group focused on building intelligent, scalable, and resilient observability solutions. We foster a culture of innovation, agility, and ownership—empowering engineers to solve complex problems, drive strategic initiatives, and shape the … challenges, and leading with technical excellence—this is the team for you. About the Role We're looking for a Lead Software Engineer with deep expertise in logging and observability engineering. You should be fluent in the principles of open telemetry, log ingestion, and event correlation across distributed systems. While familiarity with platforms like Splunk or Dynatrace is a plus … to design resilient, scalable logging solutions using the best-fit tools for the environment. As a Lead Software Engineer, you will drive the design, implementation, and evolution of our observability and logging platforms. You'll lead enterprise-scale initiatives, mentor engineers, and collaborate across disciplines to ensure our systems are reliable, scalable, and performant. Applying deep technical expertise to solve More ❯
City Of Westminster, London, United Kingdom Hybrid/Remote Options
Additional Resources
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
Westminster, City of Westminster, Greater London, United Kingdom Hybrid/Remote Options
Additional Resources
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Additional Resources Ltd
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mercor
fault-tolerant microservices. Build and maintain CI/CD pipelines, deployment workflows, and infrastructure-as-code. Manage Kubernetes clusters, cloud infrastructure (AWS/GCP), and container orchestration. Implement monitoring, observability, and security best practices. Collaborate with backend and AI teams to optimize system performance and reliability. Continuously improve automation, deployment speed, and operational efficiency. Requirements 3+ years of experience in More ❯
queues (eg, Kafka, RabbitMQ) for Real Time data processing. Experience with automated testing frameworks and continuous delivery tools like Jenkins, GitLab CI, or CircleCI. Understanding of performance monitoring and observability tools such as CloudWatch, Prometheus, or Datadog. Interested? Please Apply! Golang Go AWS Kubernetes Docker Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading Exchange Digital Assets Hybrid Flexible More ❯
DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience creating technical roadmaps and conducting cost-benefit analyses Track record presenting to C More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mott MacDonald
production-grade products, and with product managers to shape roadmaps based on technical feasibility and user value. DevOps & CI/CD: Support cloud-native deployment pipelines, automated testing, and observability for everything we build. Champion software engineering excellence: Drive continuous improvement across software engineering culture, codebases, and development practices. What You'll Bring Clear communicator, with the ability to engage More ❯
high-quality, well-tested code using modern testing frameworks and patterns. Occasionally contributing to simple React/Next.js based UI screens for configuration or internal tools. Improving system reliability, observability, and developer efficiency through metrics, logging, and automation. Working closely with cross-functional teams in an agile environment, contributing to planning, delivery, and code reviews. Supporting continuous improvement of codebases More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £650 to £750 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm seeking to engage a Lead SRE Engineer (Observability … SME) to support the implementation and instrumentation of their new Observability solution. This role will be critical in delivering against our Digital OKRs by embedding observability best practices, frameworks, and tooling across digital platforms and engineering teams. Key Responsibilities: Strategy & Roadmap: Define and drive the observability roadmap in alignment with business priorities and digital platform objectives. Champion observability-by-design … manage SLIs, SLOs, and error budgets to track and improve system reliability. Support capacity and availability planning through real-time telemetry and predictive analytics. Instrumentation & Runbooks: Design and implement observability runbooks covering metrics, logs, traces, synthetics, and customer journey monitoring. Set standards for instrumentation, dashboards, alerting, and enable teams to self-serve their system metrics and traces. Implementation & Enablement: Assist More ❯
pipelines Drive platform modernisation Manage a small team of engineers Align DevOps capabilities with the wider business Champion DevEx, reliability, and security Embed operational excellence and incident response Promote observability and performance optimisation Lead DevOps Engineer Requirements Proven line management experience Cloud-native expertise (any cloud provider is fine: GCP, AWS or Azure) Knowledge of GitLab CI/CD, Terraform More ❯
with Helm and ArgoCD Owning CI/CD pipelines across multiple environments (GitHub Actions, Jenkins, etc.) Working closely with software engineers to streamline delivery and performance Bringing structure and observability into their environments using tools like Prometheus, Grafana, and ELK Championing DevOps best practice, security, and reliability across the engineering teams What they’re looking for Proven experience in a More ❯
london, south east england, united kingdom Hybrid/Remote Options
FindErnest
virtualization (NFV) and service mesh (e.g., Istio). Exposure to service orchestration and management frameworks (ONAP, OSM). Contributions to open-source telecom projects are a plus. Knowledge of observability tools (Prometheus, Grafana, Jaeger, ELK stack). Linux scripting - Shell scripting, Python Knowledge and experience with Test Automation tools such as Jenkins, Robot or similar Has led a small team More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
Management (IAM) and Single Sign-On (SSO) solutions using tools like Azure AD, Okta and Oracle Identity Cloud Service. Establish and maintain CI/CD pipelines, test automation and observability practices using tools such as Azure DevOps, GitHub and Jenkins to streamline the development life cycle. Provide technical guidance and mentorship to junior engineers, participate in code reviews and collaborate More ❯
strongly typed programming language and one dynamic programming. ideally Rust & nodeJS Experience with Public Cloud providers, ideally AWS Experience with CI/CD tooling and pipelines Any experience with Observability platforms such as Grafana would be advantageous. Our Commitment to Diversity and Inclusion Build your job in a place that thrives on diversity, inclusion, and belonging. We believe in maintaining More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
london, south east england, united kingdom Hybrid/Remote Options
IO TECH SOLUTIONS LIMITED
skills (Python and/or Bash). Experience with infrastructure-as-code tooling (Terraform, Ansible). Nice-to-Have Containerization (Docker/Kubernetes/EKS) in production. Monitoring and observability tools (Prometheus, Grafana, ELK, Splunk). Experience managing vendor relationships or external cloud providers. Why Youll Love This Job Work in a fast-paced, cutting-edge crypto environment. Small, flat More ❯
GitHub Actions, or similar Knowledge of microservices architecture and containerization (Docker, Kubernetes, OpenShift Exposure to enterprise-scale distributed systems in the banking/financial domain. Familiarity with monitoring and observability tools (Grafana, Dynatrace, Splunk, etc. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Client Server
supporting gameplay, user management, platform and content management systems, collaborating with product and game teams to ensure alignment of features with backend architecture and with DevOps to ensure uptime, observability and deployment reliability. This is a senior role where you'll take ownership of complex systems and proactively address potential performance and scalability bottlenecks. Location/WFH: You can work More ❯
Strong experience with AWS (VPCs, EC2, ECS/EKS, RDS, S3, etc.) Solid understanding of database systems (Postgres, SQL Server) IaC mastery (Terraform, CloudFormation, Ansible) Passion for monitoring and observability (Grafana, Elastic, PagerDuty, etc.) Familiarity with configuration management tools (Puppet, etc.) Git, Docker, and scripting skills (bash or similar) A collaborative mindset and the ability to communicate technical concepts clearly More ❯
Strong experience with AWS (VPCs, EC2, ECS/EKS, RDS, S3, etc.) Solid understanding of database systems (Postgres, SQL Server) IaC mastery (Terraform, CloudFormation, Ansible) Passion for monitoring and observability (Grafana, Elastic, PagerDuty, etc.) Familiarity with configuration management tools (Puppet, etc.) Git, Docker, and scripting skills (bash or similar) A collaborative mindset and the ability to communicate technical concepts clearly More ❯
twins, and operational intelligence. Define and maintain asset hierarchies, semantic models, and metadata frameworks for contextualized industrial data. Implement CI/CD pipelines for data workflows and ensure lineage, observability, and compliance across environments. Collaborate with AI/ML teams to support model training, deployment, and monitoring using MLOps frameworks. Establish and enforce data governance policies, stewardship models, and metadata More ❯