team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
london (city of london), south east england, united kingdom
Damia Group
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
with DevOps teams to integrate Elastic into CI/CD, automation, and cloud environments. Manage client expectations and ensure effective stakeholder communication. Stay up to date with Elastic and observability best practices. Tech Skills: Extensive hands-on experience with the Elastic Stack (Elasticsearch, Kibana, Logstash, Beats, etc.) . Familiarity with DevOps practices and tools (CI/CD, automation, infrastructure-as More ❯
london (city of london), south east england, united kingdom
NETbuilder
with DevOps teams to integrate Elastic into CI/CD, automation, and cloud environments. Manage client expectations and ensure effective stakeholder communication. Stay up to date with Elastic and observability best practices. Tech Skills: Extensive hands-on experience with the Elastic Stack (Elasticsearch, Kibana, Logstash, Beats, etc.) . Familiarity with DevOps practices and tools (CI/CD, automation, infrastructure-as More ❯
london (city of london), south east england, united kingdom
Bayforest Technologies
trade reporting workflows for internal and external stakeholders (brokers, investors, and counterparties). Develop and automate data pipelines for research, trading signals, risk metrics, and performance analytics. Enhance system observability using monitoring tools such as Grafana and Prometheus. Work closely with developers and researchers to enhance post-trade analytics and reporting. Design, build, and maintain CI/CD pipelines for More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Experis
Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka Hands-on More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Experis
Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka Hands-on More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
version control , automated testing , and modular design Supporting the development of a lakehouse architecture using Apache Iceberg Collaborating with product and business teams to deliver data-driven solutions Embedding observability and quality checks into data workflows Participating in code reviews, pair programming, and architectural discussions Gaining domain knowledge in financial data and sharing insights with the team What They're More ❯
Knowledge of Data Fabric and Data Mesh concepts, with practical experience in their implementation. Strong understanding of Software Development Lifecycle (SDLC) principles in a data platform context. Expertise in observability, monitoring, and security for data platforms. Strong experience in DevOps, automation, CI/CD, and Infrastructure-as-Code (IaC). Proven track record of leading large technical teams and driving More ❯
with Product, Data Science, and Operations teams Mentor developers, promote best practices, and improve engineering workflows Shape technical strategy and contribute to long-term system improvements Drive code quality, observability, and resiliency across services Tech Stack Frontend : React, JavaScript/TypeScript Backend : Python (FastAPI, Flask, or Django), ideally with geospatial data processing Cloud : AWS (Lambda, ECS, RDS, S3, API Gateway More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Huxley Associates
availability, secure deployments, and efficient agent orchestration using AKS. You will create and maintain CI/CD pipelines for Azure services, Semantic Kernel agents, manage Kubernetes clusters, and integrate observability tools to monitor system health and performance. You'll also ensure alignment with enterprise-grade security practices, including zero trust principles, identity-aware routing, and integration with Azure API Management More ❯
City of London, London, United Kingdom Hybrid / WFH Options
La Fosse
modern development practices, IaC, DevOps, and cloud infrastructure Experience working with or managing teams using graph databases, search technologies, and data pipelines Familiarity with IaC, GitHub Copilot, and modern observability tooling (e.g., Grafana) A strong ability to run skills gaps analysis, utilising delivery data, and an understanding how to use them to improve team effectiveness Experience guiding teams through change More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
La Fosse
modern development practices, IaC, DevOps, and cloud infrastructure Experience working with or managing teams using graph databases, search technologies, and data pipelines Familiarity with IaC, GitHub Copilot, and modern observability tooling (e.g., Grafana) A strong ability to run skills gaps analysis, utilising delivery data, and an understanding how to use them to improve team effectiveness Experience guiding teams through change More ❯
designed infrastructure that scales without slowing anyone down. Tame complex LLM infrastructure (real-time usage, flaky providers, token routing - the lot). Raise the quality bar across the board: observability, auth, reliability, and more. This isn't a role for passengers. It's for engineers who love ambiguity, thrive under pressure, and see infrastructure as a multiplier. What We're More ❯
core software products. Expect a collaborative engineering culture, modern cloud-native stack, and plenty of freedom to influence tooling, architecture, and reliability practices. If you’re passionate about automation, observability, and designing systems that just don’t fail , this is the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS … Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash or Go What You’ll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating operational tasks and improving system efficiency. Implementing observability tooling to monitor system health, performance, and capacity. Working closely with … how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click “APPLY NOW” to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the latest Cloud, Platform & SRE opportunities. More ❯
DevOps, infrastructure, and platform engineering. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, CloudWatch, Lambda) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible Monitoring & Observability: Grafana, Prometheus CI/CD: GitHub Actions Automation & Scripting: Python, Bash, Go or Java What We’re Looking For Proven experience running AWS cloud infrastructure in a production or regulated … financial) environment. Hands-on experience managing Kubernetes clusters (preferably EKS). Strong understanding of Infrastructure as Code using Terraform. Familiarity with monitoring and observability stacks such as Prometheus and Grafana. Experience building and maintaining CI/CD pipelines (GitHub Actions or similar). Strong scripting or automation skills using Python, Bash, Go or Java . A collaborative mindset — comfortable working More ❯
AWS (Core Services – EC2, RDS, S3, IAM, Lambda, CloudWatch) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible CI/CD Pipelines: GitHub Actions Monitoring & Observability: Grafana, Prometheus Scripting/Automation: Python or Java What We’re Looking For Proven experience managing and scaling AWS cloud environments , ideally supporting live software products or high-traffic platforms. … Strong background in Terraform and Infrastructure as Code best practices. Practical experience with Kubernetes (EKS) in production. Familiarity with monitoring and observability tools such as Grafana and Prometheus. Hands-on experience building CI/CD pipelines (GitHub Actions, Jenkins, CircleCI, etc.). Solid scripting and automation experience using Python or Java . A collaborative engineer who enjoys working closely with More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Devonshire Hayes
and documentation skills. Desirable: Relevant certifications (CCNA, CCNP, CompTIA Network+/Security+, Azure Administrator, etc.). Experience with infrastructure automation or scripting (PowerShell, Python). Knowledge of monitoring and observability tools (SolarWinds, PRTG, Grafana, etc.). Experience with ITIL practices and ServiceNow or equivalent ticketing systems. Personal Attributes Technically curious, proactive, and solutions oriented. Confident engaging with technical and business More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Devonshire Hayes
and documentation skills. Desirable: Relevant certifications (CCNA, CCNP, CompTIA Network+/Security+, Azure Administrator, etc.). Experience with infrastructure automation or scripting (PowerShell, Python). Knowledge of monitoring and observability tools (SolarWinds, PRTG, Grafana, etc.). Experience with ITIL practices and ServiceNow or equivalent ticketing systems. Personal Attributes Technically curious, proactive, and solutions oriented. Confident engaging with technical and business More ❯
make technical decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in … AWS cloud infrastructure (ideally in a regulated or high-traffic environment) Previous experience working with Monitoring and Observability Tools Hands-on Kubernetes know-how, specifically with EKS. Solid IaC experience with Terraform. Experience with containerisation (Docker, Helm) and CI/CD (GitHub Actions or similar) Solid scripting/Automation experience with Python, Bash or Go A good communicator who enjoys More ❯
and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Configuration Management Ansible Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python or Java (scripting, automation) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS … cloud infrastructure (ideally in a regulated or high-traffic environment) Previous experience working with Monitoring and Observability Tools Hands-on Kubernetes know-how, specifically with EKS. Solid IaC experience with Terraform. Experience with containerisation (Docker, Helm) and CI/CD (GitHub Actions or similar) Solid scripting/Automation experience with Python or Java A good communicator who enjoys working collaboratively More ❯
Modeling and Performance tuning. Should have experience in designing and developing dashboards Strong Knowledge in Hadoop, Kafka, SQL/NoSQL Should have experience in creating roadmap to improve platform Observability Experience in leading mid-scale teams with strong communication skills Experience in Machine Learning and GCP would be added advantage Must have experience in Banking or Insurance domain Must have More ❯
london (city of london), south east england, united kingdom
HCLTech
Modeling and Performance tuning. Should have experience in designing and developing dashboards Strong Knowledge in Hadoop, Kafka, SQL/NoSQL Should have experience in creating roadmap to improve platform Observability Experience in leading mid-scale teams with strong communication skills Experience in Machine Learning and GCP would be added advantage Must have experience in Banking or Insurance domain Must have More ❯
Establish best practices for prompt engineering, model safety, bias mitigation, and responsible AI. Ensure compliance with data privacy regulations (GDPR, HIPAA, etc.) and internal governance policies. Define monitoring and observability strategies for GenAI systems in production. Stakeholder Engagement Translate business requirements into technical specifications and solution blueprints. Present architectural decisions and trade-offs to technical and non-technical stakeholders. Support More ❯