london (city of london), south east england, united kingdom
Damia Group
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
london (city of london), south east england, united kingdom
SoTalent
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
london (city of london), south east england, united kingdom
Reelables
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
or Ansible) Experience building and maintaining CI/CD pipelines (Jenkins, GitLab CI, CircleCI) Knowledge of containerisation with Docker and Kubernetes Scripting in Python, Bash, or Go Monitoring and observability using Prometheus, Splunk, or similar Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation More ❯
or Ansible) Experience building and maintaining CI/CD pipelines (Jenkins, GitLab CI, CircleCI) Knowledge of containerisation with Docker and Kubernetes Scripting in Python, Bash, or Go Monitoring and observability using Prometheus, Splunk, or similar Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation More ❯
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Experis UK
ARM templates . Experience with AKS , Docker , and container orchestration. Understanding of networking , security , and governance in Azure. CI/CD best practices and Git branching strategies. Monitoring and observability experience using Prometheus , Grafana , or Azure-native tools . Excellent communication and collaboration skills in a cross-functional Agile environment. Nice to Have Experience with Azure DevSecOps and security scanning More ❯
Streamline build and deployment processes using Maven, Gradle, Jenkins, and/or TeamCity to achieve continuous integration and continuous delivery (CI/CD). Set up robust tracing and observability tools, such as Stackdriver, Prometheus, Grafana, and Jaeger, to monitor system health, performance, and reliability. Collaborate closely with cross-functional teams including development, QA, and operations to troubleshoot issues, optimize More ❯
Streamline build and deployment processes using Maven, Gradle, Jenkins, and/or TeamCity to achieve continuous integration and continuous delivery (CI/CD). Set up robust tracing and observability tools, such as Stackdriver, Prometheus, Grafana, and Jaeger, to monitor system health, performance, and reliability. Collaborate closely with cross-functional teams including development, QA, and operations to troubleshoot issues, optimize More ❯
Streamline build and deployment processes using Maven, Gradle, Jenkins, and/or TeamCity to achieve continuous integration and continuous delivery (CI/CD). Set up robust tracing and observability tools, such as Stackdriver, Prometheus, Grafana, and Jaeger, to monitor system health, performance, and reliability. Collaborate closely with cross-functional teams including development, QA, and operations to troubleshoot issues, optimize More ❯
london (city of london), south east england, united kingdom
Solytics Partners
Streamline build and deployment processes using Maven, Gradle, Jenkins, and/or TeamCity to achieve continuous integration and continuous delivery (CI/CD). Set up robust tracing and observability tools, such as Stackdriver, Prometheus, Grafana, and Jaeger, to monitor system health, performance, and reliability. Collaborate closely with cross-functional teams including development, QA, and operations to troubleshoot issues, optimize More ❯
Implement and maintain MLOps best practices, including CI/CD pipelines, infrastructure as code (Terraform, Kubernetes, Docker), and ML lifecycle automation using tools like MLflow, Kubeflow, and Airflow. Drive observability, monitoring, and performance optimization for deployed AI systems. Provide technical leadership and mentorship to AI engineering teams, ensuring adherence to design patterns, coding standards, and best practices in AI and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
Figma and Vercel is a big plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and More ❯
with cloud migrations or large-scale infrastructure modernisation projects Proficiency in at least one major cloud platform ( AWS , Azure , or GCP ) Experience with automation, CI/CD, and infrastructure observability Scripting experience in Python Excellent communication skills and a collaborative, delivery-focused mindset Contract Details 📅 Start Date: ASAP 💰 Day Rate: £500+ per day (depending on experience) ⏳ Duration: Initial 6 months More ❯
City of London, London, United Kingdom Hybrid / WFH Options
rmg digital
with cloud migrations or large-scale infrastructure modernisation projects Proficiency in at least one major cloud platform ( AWS , Azure , or GCP ) Experience with automation, CI/CD, and infrastructure observability Scripting experience in Python Excellent communication skills and a collaborative, delivery-focused mindset Contract Details 📅 Start Date: ASAP 💰 Day Rate: £500+ per day (depending on experience) ⏳ Duration: Initial 6 months More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Client Server
supporting gameplay, user management, platform and content management systems, collaborating with product and game teams to ensure alignment of features with backend architecture and with DevOps to ensure uptime, observability and deployment reliability. This is a senior role where you'll take ownership of complex systems and proactively address potential performance and scalability bottlenecks. Location/WFH: You can work More ❯
Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). Desirable Experience deploying AI inference engines (vLLM, Ray Serve, Triton). Familiarity with observability tools for LLMs (TruLens, Helicone, LangSmith). Understanding of AI safety and reliability frameworks (Guardrails AI). This is an exciting opportunity to help define the infrastructure powering the next More ❯