built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Intellect Group
real-time services. Automate workflows and deployments using Terraform and CI/CD tools (GitHub Actions, CircleCI, or Jenkins). Support and optimize containerized environments (Docker, Kubernetes). Build observability and monitoring solutions with Prometheus, Grafana, and ELK . Manage and tune messaging systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. More ❯
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
with both SQL and NoSQL databases. Strong containerization background (Kubernetes, Docker) and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stacks—logging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments More ❯
development on AWS (e.g., Lambda, ECS/EKS, API Gateway, SQS/SNS). Experience integrating application services with CI/CD pipelines and contributing to application monitoring/observability using tools like Prometheus, Grafana, or Datadog. Experience with containerization (Docker) and a solid understanding of how application code runs within a Kubernetes or serverless environment. Leadership & Collaboration Strong mentoring More ❯
or Ansible) Experience building and maintaining CI/CD pipelines (Jenkins, GitLab CI, CircleCI) Knowledge of containerisation with Docker and Kubernetes Scripting in Python, Bash, or Go Monitoring and observability using Prometheus, Splunk, or similar Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation More ❯
Streamline build and deployment processes using Maven, Gradle, Jenkins, and/or TeamCity to achieve continuous integration and continuous delivery (CI/CD). Set up robust tracing and observability tools, such as Stackdriver, Prometheus, Grafana, and Jaeger, to monitor system health, performance, and reliability. Collaborate closely with cross-functional teams including development, QA, and operations to troubleshoot issues, optimize More ❯
slough, south east england, united kingdom Hybrid / WFH Options
rmg digital
with cloud migrations or large-scale infrastructure modernisation projects Proficiency in at least one major cloud platform ( AWS , Azure , or GCP ) Experience with automation, CI/CD, and infrastructure observability Scripting experience in Python Excellent communication skills and a collaborative, delivery-focused mindset Contract Details 📅 Start Date: ASAP 💰 Day Rate: £500+ per day (depending on experience) ⏳ Duration: Initial 6 months More ❯
Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). Desirable Experience deploying AI inference engines (vLLM, Ray Serve, Triton). Familiarity with observability tools for LLMs (TruLens, Helicone, LangSmith). Understanding of AI safety and reliability frameworks (Guardrails AI). This is an exciting opportunity to help define the infrastructure powering the next More ❯
slough, south east england, united kingdom Hybrid / WFH Options
TreasurySpring
queuing technologies, i.e. RabbitMQ Experience of REST and/or GraphQL APIs Knowledge of the core AWS services: i.e. EC2/ECS, RDS, S3 Experience using DataDog or similar observability tools Knowledge of containerisation: Docker, Kubernetes, AWS Fargate etc Any experience of front-end or fullstack development using TypeScript & React Experience building software for financial services and/or investment More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Client Server
running production workloads on Kubernetes (Amazon EKS preferred) You have a good knowledge of DevOps practices including CI/CD, IaC (Terraform) and container orchestration You have experience with observability tooling You have a solid understanding of secure coding and deployment practices You're collaborative and pragmatic with great communication skills What's in it for you: Salary to £100k More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Hunter Bond
Are excited to learn more about financial markets and trading systems Bonus experience: Ruby, Spark, Trino, Kafka Financial markets exposure SQL (Postgres, Oracle) Cloud-native deployments (AWS, Docker, Kubernetes) Observability tools (Splunk, Prometheus, Grafana) Why Apply? This is a fantastic opportunity to join a high-performance engineering team in a business that invests heavily in technology and talent. You’ll More ❯
with DevOps teams to integrate Elastic into CI/CD, automation, and cloud environments. Manage client expectations and ensure effective stakeholder communication. Stay up to date with Elastic and observability best practices. Tech Skills: Extensive hands-on experience with the Elastic Stack (Elasticsearch, Kibana, Logstash, Beats, etc.) . Familiarity with DevOps practices and tools (CI/CD, automation, infrastructure-as More ❯
platform services that store and provide access to data. This is a hands-on role (70-80%) with leadership responsibilities focused on engineering excellence, raising standards in security, reliability, observability, and quality. Responsibilities Designing, developing, and maintaining high-performance, scalable, and secure backend services and APIs using technologies such as Python and NodeJS. Collaborating with science teams, full stack engineers More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Arrows
architecture and development of backend services using C#, ASP.NET, .NET Core Automate infrastructure, CI/CD pipelines, and cloud operations (AWS/Azure) Promote engineering best practices, security, and observability Mentor engineers and foster a culture of continuous improvement Contribute to technology direction, including adoption of tools like Go and Python What We’re Looking For Deep expertise in C# More ❯
including a junior engineer and a DevSecOps specialist, mentoring through example and helping to guide architectural decisions. Projects range from building new digital products to improving CI/CD, observability, and automation across a growing ecosystem. Two days each week are typically spent in the West London office, with flexibility for the rest of the week. The team is modern More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Bourne Search Ltd
the new infrastructure Required Skills & Experience 10+ years’ experience in DevOps/DevSecOps , ideally within finance or trading environments Deep technical understanding of Kubernetes (5–10 years), containers, and observability tooling Strong Python scripting or Go programming skills Proficient in Infrastructure as Code (Terraform, Helm, Ansible) Solid knowledge of Linux , networking, and storage subsystems Experience automating security controls and integrating More ❯
Provide architectural governance and enforce best practices for cloud-native development, security, and compliance. o Lead the evaluation and adoption of emerging technologies including AI-driven code analysis and observability platforms. •Leadership & Collaboration o Mentor development teams and ensure alignment with enterprise architecture standards. o Collaborate with cross-functional teams to define rollout strategies, integration plans, and transition roadmaps. o More ❯
Oxfordshire, England, United Kingdom Hybrid / WFH Options
Humand Talent
cloud and edge environments using Terraform, Helm, and Kubernetes (EKS/OpenShift) Automating deployments via CI/CD tools such as GitHub Actions and Argo CD Managing infrastructure security, observability, and cost-efficiency Collaborating with product, research, and engineering teams — and mentoring others in agentic AI About You You’ll bring experience in both AI development and systems engineering, including More ❯
oxford district, south east england, united kingdom Hybrid / WFH Options
Humand Talent
cloud and edge environments using Terraform, Helm, and Kubernetes (EKS/OpenShift) Automating deployments via CI/CD tools such as GitHub Actions and Argo CD Managing infrastructure security, observability, and cost-efficiency Collaborating with product, research, and engineering teams — and mentoring others in agentic AI About You You’ll bring experience in both AI development and systems engineering, including More ❯
including EKS/ECS, serverless, RDS/Aurora, DynamoDB, S3, CloudFront, ALB/NLB, SNS/SQS/Kinesis, KMS/Secrets), infrastructure as code (Terraform, CloudFormation, CDK), and observability practices, ensuring best-in-class standards are set and maintained across engineering teams. Experience stewarding the implementation of scalable architectures (multi-tenant, API-first, streaming/events, caching) and resiliency More ❯
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯