London, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
services environment Strong technical skills in Linux/Unix systems, SQL, and scripting Strong experience with a programming language such as Python, Java, etc Strong experience with monitoring and observability tools (Prometheus, Grafana, Splunk, Geneos, OpenTelemetry, Corvil) Familiarity with cloud platforms, containerization (e.g., Kubernetes, Docker), and CI (Continuous Integration)/CD (continuous Delivery) pipelines Strong understanding of the trade lifecycle More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group Inc
both independently and collaboratively. Key Responsibilities: Collaborate with senior SREs and Product engineering teams to monitor, maintain, and troubleshoot our Markets systems. Collaborate with Product teams to continuously improve observability and alerting of our applications to enable data-driven business decision, faster issue detection and incident resolution. Take accountability for delivery of moderately-complex features. Lead technical discussions for own More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group
both independently and collaboratively. Key Responsibilities Collaborate with senior SREs and Product engineering teams to monitor, maintain, and troubleshoot our Markets systems. Collaborate with Product teams to continuously improve observability and alerting of our applications to enable data-driven business decision, faster issue detection and incident resolution. Take accountability for delivery of moderately-complex features. Lead technical discussions for own More ❯
London, England, United Kingdom Hybrid / WFH Options
BBC
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
Experience with unit, integration, and end to end testing tools and practices (e.g. Jest, Cypress, Backstop, Playwright). Experience with CI/CD and Trunk Based Development. Experience with observability tools and practices, including monitoring, logging, and tracing to ensure system reliability and performance. Understanding of Microservices & principles of RESTful API development, including structuring, documenting, versioning, testing and stubbing/ More ❯
mentoring engineers and collaborating with stakeholders. Proven ability to resolve technical incidents in unfamiliar production systems. Technical and process documentation champion. Experience of operationally managing production software components, including observability, logging, metrics, error reporting, debugging, and live incident management. Your time will be spent roughly as follows: 60% - Proactive technical work (e.g. migrating DB hosting provider, new message bus system More ❯
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
London, England, United Kingdom Hybrid / WFH Options
Smartcat Platform Inc
familiar with DevOps tools and processes. Confidently navigate through Platform Infrastructure. Day 60 Join the process of being on duty in a team, be able to analyze problems, use observability/monitoring tools and handle investigations. Support Production releases and address blockers of CI/CD process. Day 90 Complete two quarter deliverable in alignment with Outcomes. WHAT YOU’VE More ❯
London, England, United Kingdom Hybrid / WFH Options
Kadence Limited
operations. Manage and enhance our container orchestration stack using Kubernetes (EKS) and Docker. Develop and maintain robust, scalable CI/CD pipelines with Jenkins, GitHub Actions, and ArgoCD. Strengthen observability across the platform through effective monitoring, logging, and alerting (AWS services, Grafana, etc). Contribute to platform security through infrastructure hardening, role-based access controls, and infrastructure as code (Terraform … CI/CD pipelines using Jenkins, GitHub Actions, and/or ArgoCD. Familiarity with infrastructure as code practices using Terraform, CloudFormation, or similar tools. A solid grasp of system observability, monitoring, and alerting practices (CloudWatch, Grafana, or equivalent). Exposure to platform security principles including identity/access management, secrets handling, and environment isolation. Strong scripting and automation skills (e.g. … Desktop: Cross platform desktop app built with Electron (TypeScript). Cloud & DevOps: AWS (20+ services), Kubernetes (EKS), Docker, Infrastructure as Code (CloudFormation, Terraform), CI/CD (Jenkins, GitHub Actions), Observability (AWS, Grafana). Development tools: GitHub, Jira, Notion, ChatGPT, Gemini, LangChain, AI-native IDE's (Cursor, JetBrains), LLM-powered internal tools. Test automation: Cypress (E2E), Postman (API), Jest (frontend unit More ❯
London, England, United Kingdom Hybrid / WFH Options
Durlston Partners
automation and internal tools for deployment, monitoring, and incident response Tune performance across OS, network, and cloud layers — this role is hands-on and detail-oriented Improve system resilience, observability, and security in a high-stakes production environment Requirements: Fluent in Linux — not just using it, but understanding how it works under the hood Advanced terminal skills — manipulating systems efficiently … time environments Hands-on with Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at the lowest levels More ❯
London, England, United Kingdom Hybrid / WFH Options
CFP Energy (UK) Ltd
and enhance CI/CD pipelines, infrastructure/app templates, and automation workflows. Explore and integrate emerging technologies to evolve our platform offerings and support developer needs. Fine-tune observability tools to resolve issues quickly and deliver actionable alerts to the right people. Ideal candidate: Infrastructure as Code (IaC): Proven experience with cloud infrastructure automation (Terraform and Azure preferred). … GitOps workflows and Helm charts. Security: Hands-on experience with token/secret management tools (e.g., HashiCorp Vault, Azure Key Vault) and SSO/authentication systems (e.g., Okta). Observability: Hands-on experience with platforms like DataDog, Grafana, or Azure Monitor. Networking: Strong understanding of networking principles, DNS, and related technologies. CI/CD: Skilled in creating and maintaining CI More ❯
London, England, United Kingdom Hybrid / WFH Options
Ikerian
scalable AWS cloud environments and services. Manage and prioritise tasks in the cloud infrastructure backlog to address immediate needs and plan long-term improvements. Set up infrastructure monitoring and observability solutions, proactively addressing availability, performance or security issues. Assess new technologies, systems, and services for production readiness, ensuring seamless and stable integration. Prepare and maintain documentation on cloud processes, procedures … CI/CD pipelines and tools, including GitLab (preferred), GitHub Actions, Jenkins, etc. Basic understanding of cloud networking concepts, including VPC, Subnets, and Load Balancing. Familiarity with monitoring and observability tools for cloud environments, such as Grafana, Prometheus, OpenSearch, and the ELK stack. Strong analytical and problem-solving skills, with a proactive approach to challenges. A genuine interest in staying More ❯
Architect for Scale & Resilience: Make critical decisions on system design and performance to support a growing platform with increasing complexity and scale. Elevate Operational Maturity: Lead improvements to monitoring, observability, and developer workflows - ensuring backend systems are resilient and teams can ship confidently. Embed Security by Design: Take responsibility for backend security posture, ensuring systems meet best practices and compliance … and SQS. Infrastructure as Code: Experience with Terraform or similar tools for infrastructure automation. High-Throughput Systems: Strong experience in real production projects handling large-scale data flows. Monitoring & Observability: Proficiency in tools like Datadog, Prometheus, and Grafana. Security & Networking: Solid understanding of networking principles, security best practices, and cloud security. Agile & Fast-Paced Environments: Experience in agile teams, working More ❯
London, England, United Kingdom Hybrid / WFH Options
CF Pathways Limited
and enhance CI/CD pipelines, infrastructure/app templates, and automation workflows. Explore and integrate emerging technologies to evolve our platform offerings and support developer needs. Fine-tune observability tools to resolve issues quickly and deliver actionable alerts to the right people. Ideal Candidate Infrastructure as Code (IaC): Proven experience with cloud infrastructure automation (Terraform and Azure preferred). … GitOps workflows and Helm charts. Security: Hands-on experience with token/secret management tools (e.g., HashiCorp Vault, Azure Key Vault) and SSO/authentication systems (e.g., Okta). Observability: Hands-on experience with platforms like DataDog, Grafana, or Azure Monitor. Networking: Strong understanding of networking principles, DNS, and related technologies. CI/CD: Skilled in creating and maintaining CI More ❯
London, England, United Kingdom Hybrid / WFH Options
Elwood Technologies Services Limited
environment. Automate manual processes and workflows, reducing operational overhead. Work closely with engineering teams to design and deploy scalable, fault-tolerant infrastructure solutions on AWS or GCP . Improve observability by utilizing monitoring, logging, and alerting systems (e.g., CloudWatch , Datadog ). Lead post-incident reviews , contribute to the continuous improvement of system reliability and follow up on strategic fixes. Develop … you have experience of some or all of the following: Experience with client-impact triage , working cross-functionally with account managers or product teams. Proficiency with Datadog or similar observability platforms. Knowledge of serverless architectures (e.g., AWS Lambda, GCP Cloud Functions). Familiarity with RDBMS and NoSQL databases , such as RDS, CloudSQL, DynamoDB. Prior experience in fintech , trading platforms, or More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
and will help clients adopt modern DevOps practices with a strong emphasis on automation, self-service, and operational excellence. Tech You'll Use: Terraform & GitHub Actions CI/CD, observability tooling (Grafana, Prometheus), containerisation (Docker) What You'll Be Doing: Designing and implementing secure, resilient AWS infrastructure Building CI/CD pipelines and reusable deployment patterns Advising on cloud-native More ❯
Edinburgh, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
ideally with Terraform or CloudFormation. Hands-on experience with CI/CD pipelines and automation tooling. Background in containerisation and orchestration – e.g., Docker, Kubernetes. Familiarity with monitoring, alerting, and observability tools (e.g., Prometheus, Grafana, CloudWatch). Proven ability to troubleshoot and resolve complex infrastructure issues. Experience working in cross-functional engineering teams, ideally in a DevOps or SRE capacity. Strong More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting More ❯
London, England, United Kingdom Hybrid / WFH Options
Arcus Search
and will help clients adopt modern DevOps practices with a strong emphasis on automation, self-service, and operational excellence. Tech You'll Use: Terraform & GitHub Actions CI/CD, observability tooling (Grafana, Prometheus), containerisation (Docker) What You'll Be Doing: Designing and implementing secure, resilient AWS infrastructure Building CI/CD pipelines and reusable deployment patterns Advising on cloud-native More ❯
London, England, United Kingdom Hybrid / WFH Options
Anson McCade Pty
to automate provisioning. • Deploy and manage Kubernetes solutions, including AKS, EKS, and OpenShift. • Implement DevSecOps practices, integrating CI/CD pipelines and security controls. • Optimize cloud environments using FinOps, observability tooling, and SRE methodologies. • Work closely with Cloud Architects, Engineers, and Business Leaders to build scalable, high-performance platforms. • Enhance networking and security capabilities across hybrid cloud environments. The ideal More ❯
London, England, United Kingdom Hybrid / WFH Options
0840 Deutsche Bank Aktiengesellschaft, Filiale London
SRE, or DevOps within trading or financial services Strong Linux/Unix, SQL, and scripting skills Experience with programming languages such as Python or Java Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Splunk, Geneos, OpenTelemetry, Corvil) Familiarity with cloud platforms, containerization (Kubernetes, Docker), and CI/CD pipelines Knowledge of trade lifecycle, trading systems, FX products, market structure More ❯
London, England, United Kingdom Hybrid / WFH Options
One World GTM
in Site Reliability Engineering. Mentor engineers, advocate for DevOps culture, and drive improvements across development, security, and operations. Stay ahead of the curve by adopting the latest AWS services, observability tools (Grafana), and Kubernetes based architectures. Implement security best practices and governance controls to protect critical systems and data. Work closely with cross-functional teams to support fast, efficient, and More ❯