City Of Westminster, London, United Kingdom Hybrid/Remote Options
Additional Resources
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
Westminster, City of Westminster, Greater London, United Kingdom Hybrid/Remote Options
Additional Resources
high-volume processing. Deploying and managing containerised workloads through Kubernetes, Helm, and Docker. Automating infrastructure using Infrastructure-as-Code tools such as Terraform and Ansible. Ensuring system reliability through observability, monitoring, and proactive issue resolution. Collaborating with cross-functional teams to align data solutions with wider business needs. Supporting the continuous improvement of processes, deployment, and data quality standards. What More ❯
DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience creating technical roadmaps and conducting cost-benefit analyses Track record presenting to C More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Client Server
supporting gameplay, user management, platform and content management systems, collaborating with product and game teams to ensure alignment of features with backend architecture and with DevOps to ensure uptime, observability and deployment reliability. This is a senior role where you'll take ownership of complex systems and proactively address potential performance and scalability bottlenecks. Location/WFH: You can work More ❯
City of London, London, United Kingdom Hybrid/Remote Options
83zero Limited
of experimentation, curiosity, and bold thinking. The Role As Dev/Ops Lead, you'll: Architect and optimise a high-scale, cloud-native PaaS. Champion CI/CD, automation, observability, and reliability. Design and maintain secure, performant public APIs. Build real-time, distributed systems on AWS (Lambda, DynamoDB, Kinesis, ECS/EKS). Mentor a cross-functional team and elevate More ❯
london (city of london), south east england, united kingdom Hybrid/Remote Options
Bupa Occupational Health
scripting skills in SQL, Python, or Bash for automation and tooling. Solid understanding of Snowflake security features such as data masking, encryption, identity federation, and network policies. Familiarity with observability practices, including query profiling, usage tracking, and integration with monitoring tools. Demonstrated ability to optimise performance and manage costs in large-scale Snowflake environments. Excellent collaboration and communication skills, with More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Hargreaves Lansdown
our services. About You Experience building and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An understanding of observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with some AWS services and how to integrate them into modern applications. A keen focus More ❯
london (city of london), south east england, united kingdom Hybrid/Remote Options
Gravitee
Helm Charts Cloud experience (AWS and/or Azure) Even better if you also have skills across: Certificate management (ZeroSSL, Let's Encrypt) Argo Workflows & ArgoCD Continuous Delivery tooling Observability tools (Grafana, Prometheus) ESSENTIAL SKILLS The right candidate will possess at least the following skills, if not more: 3+ years of professional experience in infrastructure management Fluent with creating and More ❯
City of London, London, England, United Kingdom Hybrid/Remote Options
Lorien
and implement robust testing (unit/integration/contract). Collaborate closely with trading, risk, and operations to refine requirements and ship increments in Agile sprints. Harden production with observability (logging/metrics/tracing), CI/CD, and secure-by-design patterns. Own features end-to-end-from design and documentation to deployment and support. What you'll bring More ❯
with Product, Data Science, and Operations teams Mentor developers, promote best practices, and improve engineering workflows Shape technical strategy and contribute to long-term system improvements Drive code quality, observability, and resiliency across services Tech Stack Frontend : React, JavaScript/TypeScript Backend : Python (FastAPI, Flask, or Django), ideally with geospatial data processing Cloud : AWS (Lambda, ECS, RDS, S3, API Gateway More ❯
other internal teams to fully understand client requirements and deliver tailored technical solutions. Design and implement scalable, future-proof architectures for new third-party connectors and integrations. Enhance system observability by improving diagnostics, logging, and tracing to aid technical support teams in resolving issues swiftly. Oversee the ongoing development and management of the public API, covering REST and event streaming More ❯
DevOps, infrastructure, and platform engineering. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, CloudWatch, Lambda) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible Monitoring & Observability: Grafana, Prometheus CI/CD: GitHub Actions Automation & Scripting: Python, Bash, Go or Java What We’re Looking For Proven experience running AWS cloud infrastructure in a production or regulated … financial) environment. Hands-on experience managing Kubernetes clusters (preferably EKS). Strong understanding of Infrastructure as Code using Terraform. Familiarity with monitoring and observability stacks such as Prometheus and Grafana. Experience building and maintaining CI/CD pipelines (GitHub Actions or similar). Strong scripting or automation skills using Python, Bash, Go or Java . A collaborative mindset — comfortable working More ❯
AWS (Core Services – EC2, RDS, S3, IAM, Lambda, CloudWatch) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible CI/CD Pipelines: GitHub Actions Monitoring & Observability: Grafana, Prometheus Scripting/Automation: Python or Java What We’re Looking For Proven experience managing and scaling AWS cloud environments , ideally supporting live software products or high-traffic platforms. … Strong background in Terraform and Infrastructure as Code best practices. Practical experience with Kubernetes (EKS) in production. Familiarity with monitoring and observability tools such as Grafana and Prometheus. Hands-on experience building CI/CD pipelines (GitHub Actions, Jenkins, CircleCI, etc.). Solid scripting and automation experience using Python or Java . A collaborative engineer who enjoys working closely with More ❯
Monitor and optimise network performance across cloud and on-premise environments Troubleshoot and resolve connectivity issues quickly and effectively Automate network configuration using Terraform, PowerShell and Azure CLI Maintain observability using Azure Monitor, Log Analytics and Network Watcher Ensure deployments align with security and compliance standards Produce technical documentation and support knowledge sharing Required Experience: Strong hands-on experience with More ❯
City of London, London, United Kingdom Hybrid/Remote Options
ARC IT Recruitment Ltd
/MTTR via automation, clear SLAs, and robust RCAs/post-mortems. Safer, faster releases (blue/green, canary, feature flags) in partnership with Trading, Quant, and Engineering. Mature observability (logs/metrics/traces), capacity planning, and performance tuning for low-latency flows. Strong production hygiene and controls aligned to MiFID II/MAR/best-ex. Leadership of More ❯
their core software products. Expect a collaborative engineering culture, modern cloud-native stack, and plenty of freedom to influence tooling, architecture, and reliability practices. If youre passionate about automation, observability, and designing systems that just dont fail , this is the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure … as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash, or Go What Youll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating operational tasks and improving system efficiency. Implementing observability tooling to monitor system health, performance, and capacity. Working closely with development teams … how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the latest Cloud, Platform & SRE opportunities. More ❯
Central London, London, England, United Kingdom Hybrid/Remote Options
Addition
complex sales cycles and build C-level relationships Confident presenter with a consultative sales approach Experience working with nearshore/offshore delivery models is a plus Knowledge of AIOps, observability, or platform engineering is advantageous What’s in It for You: Be part of a global team of 6,000+ technologists, with autonomy to shape a key growth sector Hybrid More ❯
is a UK-based financial institution. Your new role You will be helping the client move to an AIOps environment. What you'll need to succeed Extensive experience in observability/SRE/platform engineering roles Strong experience with OpenTelemetry, Prometheus, Grafana, Splunk, Elastic etc Python, Go or Java programming Experience with Terraform, Helm or other IAC tools What you More ❯
london (city of london), south east england, united kingdom
Log my Care
the team. Solve challenging problems: Proactively identify root causes, implement durable solutions, and share learnings that help others solve problems more effectively. Be data-driven: Use product metrics and observability tools to guide decisions during discovery and development, and deliver work that demonstrably improves product or business metrics. Deliver at pace: Improve team velocity by unblocking others, streamlining workflows, and More ❯
with Azure. Ability to troubleshoot build failures, manage YAML pipeline configurations, support deployment processes across Azure environments, manage service connections, and collaborate with development teams on release automation. Monitoring & Observability - Proficient in implementing and managing Azure Monitor, Log Analytics workspaces, Application Insights, and Azure dashboards. Experience creating alert rules, action groups, workbooks, and analysing metrics and logs using KQL (Kusto … Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and Azure Firewall, Azure Policy for governance, and More ❯
Central London, London, United Kingdom Hybrid/Remote Options
Halian Technology Limited
A leading fintech company is seeking a Lead AppSec Engineer to join their established team. Youll be instrumental in embedding security into every stage of the software development lifecycleguiding engineers, shaping best practices, and driving secure, scalable solutions across our More ❯
to translate complex business requirements into data-driven solutions. Write production-grade SQL and ensure data quality through testing, documentation, and version control. Promote best practices around data reliability, observability, and maintainability. (Optional but valued) Contribute to Infrastructure as Code and CI/CD pipelines (e.g., Terraform, GitHub Actions). Skills & Experience 5+ years of experience in data-focused roles … other data visualisation tools. Familiarity with orchestration tools such as Airflow, Prefect, or Dagster. Understanding of CI/CD practices in data and analytics engineering. Knowledge of data governance, observability, and security best practices in cloud environments. More ❯
Monitoring and Observability Architect 3 months initially Location: Remote with travel to London as required Upto 750.00 Per Day - Inside IR35 via Umbrella A highly accomplished, UK based Monitoring and Observability Architect is required to work on the following deliverables - Key deliverables: Gain an understanding of the current state architecture Design the target state architecture Work with the product owner … to design an implementation roadmap and transitionary architectures Assist with technical inputs for development of M&O business case Experience: Deep experience in building enterprise monitoring and observability platforms Active knowledge on enterprise deployments of dynatrace, ServiceNow and Grafana Experience of highly regulated environments within Tier 1 insurance providers Understanding of how monitoring is used in ITIL processes If your More ❯
Monitoring and Observability Architect 3 months initially Location: Remote with travel to London as required Upto £750.00 Per Day - Inside IR35 via Umbrella A highly accomplished, UK based Monitoring and Observability Architect is required to work on the following deliverables - Key deliverables: Gain an understanding of the current state architecture Design the target state architecture Work with the product owner … to design an implementation roadmap and transitionary architectures Assist with technical inputs for development of M&O business case Experience: Deep experience in building enterprise monitoring and observability platforms Active knowledge on enterprise deployments of dynatrace, ServiceNow and Grafana Experience of highly regulated environments within Tier 1 insurance providers Understanding of how monitoring is used in ITIL processes If your More ❯