built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
london (city of london), south east england, united kingdom
BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
london (city of london), south east england, united kingdom
Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
london, south east england, united kingdom Hybrid / WFH Options
Intellect Group
real-time services. Automate workflows and deployments using Terraform and CI/CD tools (GitHub Actions, CircleCI, or Jenkins). Support and optimize containerized environments (Docker, Kubernetes). Build observability and monitoring solutions with Prometheus, Grafana, and ELK . Manage and tune messaging systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Intellect Group
real-time services. Automate workflows and deployments using Terraform and CI/CD tools (GitHub Actions, CircleCI, or Jenkins). Support and optimize containerized environments (Docker, Kubernetes). Build observability and monitoring solutions with Prometheus, Grafana, and ELK . Manage and tune messaging systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. More ❯
real-time services. Automate workflows and deployments using Terraform and CI/CD tools (GitHub Actions, CircleCI, or Jenkins). Support and optimize containerized environments (Docker, Kubernetes). Build observability and monitoring solutions with Prometheus, Grafana, and ELK . Manage and tune messaging systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Intellect Group
real-time services. Automate workflows and deployments using Terraform and CI/CD tools (GitHub Actions, CircleCI, or Jenkins). Support and optimize containerized environments (Docker, Kubernetes). Build observability and monitoring solutions with Prometheus, Grafana, and ELK . Manage and tune messaging systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation
new products Building and maintaining CI/CD pipelines, automated testing suites, and infrastructure-as-code setups (Docker, Kubernetes, Terraform) Monitoring system performance and continuously improving scalability, reliability, and observability Collaborating cross-functionally with product and research teams to bring new AI features to life You'll bring: Strong commercial experience developing production-grade systems in Python with proven experience More ❯
understanding of containerisation (Docker, ECS, or Kubernetes). Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef More ❯
london, south east england, united kingdom Hybrid / WFH Options
Signify Technology
understanding of containerisation (Docker, ECS, or Kubernetes). Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Signify Technology
understanding of containerisation (Docker, ECS, or Kubernetes). Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef More ❯
scale using one of GKE, AKS, EKS or RKE. Experience with Kubectl and Helm. - Worked on EKS with Kubectl. Containers: Experience deploying Java (Spring Boot) microservices in dockerized environments. Observability - Experience in setting up tools like Prom/Grafana, Datadog, AppDynamics, Splunk. to give actionable intel on a microservice environment including but not limited to synthetics, Application performance monitoring, logging More ❯
london (city of london), south east england, united kingdom
Infoplus Technologies UK Limited
scale using one of GKE, AKS, EKS or RKE. Experience with Kubectl and Helm. - Worked on EKS with Kubectl. Containers: Experience deploying Java (Spring Boot) microservices in dockerized environments. Observability Experience in setting up tools like Prom/Grafana, Datadog, AppDynamics, Splunk. to give actionable intel on a microservice environment including but not limited to synthetics, Application performance monitoring, logging More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
london (city of london), south east england, united kingdom
Arrows
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
london (city of london), south east england, united kingdom
Damia Group
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
london (city of london), south east england, united kingdom
SoTalent
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stackslogging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments (broadcasting More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯