our enterprise messaging infrastructure, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, network optimization, and system observability using industry-standard monitoring tools. Required Skills & Qualifications: 3+ years of experience administering enterprise-grade messaging systems. Strong background in production support, preferably in a 24x7 enterprise environment. Experience working More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
and ability to collaborate across multidisciplinary teams Desirable Attributes Exposure to event-driven architectures and messaging systems (e.g., Kafka) Experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with observability tools and performance tuning Ability to mentor junior engineers and contribute to backend design leadership Prior consulting experience or experience in client-facing roles Engagement Model Outside IR35 12-month More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
a strong consulting mindset is highly desirable Desirable Attributes Experience with event-driven architectures and messaging systems (e.g., Kafka) Exposure to Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with observability tools and performance tuning Ability to mentor junior engineers and lead backend design initiatives Engagement Model Outside IR35 12-month initial contract with potential for extension or permanent employment Hybrid More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. We're hiring a Lead Site Reliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and More ❯
Bracknell, Berkshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
South East London, London, United Kingdom Hybrid / WFH Options
TEN10 SOLUTIONS LIMITED
stakeholder management skills. Nice-to-Have: Hands-on experience with Databricks , Apache Spark , and Azure Deequ . Familiarity with Big Data tools and distributed data processing. Experience with data observability and data quality monitoring. Proficiency with CI/CD tools like Jenkins, Azure DevOps, or GitLab CI. Previous consultancy or client-facing experience. Additional languages like SQL, TypeScript, or Bash More ❯
years in platform/SRE/DevOps roles * Strong Kubernetes experience (config and deployment) * Deep CI/CD experience - Jenkins, GitLab CI/CD or similar * Skilled with infra observability tooling (Prometheus, Grafana, etc.) * Confident with Git and repo management workflows * Strong automation mindset - reducing manual intervention wherever possible * Cloud experience (AWS, Azure or GCP) * Must be a sole UK More ❯
architectures across Azure, AWS, and Google Cloud Leading platform engineering squads using DevSecOps, Kubernetes, and automation tooling Enabling edge and private cloud capabilities (e.g., Azure Stack, AWS Outposts) Implementing observability and governance tooling to support modern operations Supporting Agile and product-based delivery using SRE, CI/CD, and Infrastructure as Code Advising clients on architecture optimisation, security, cost control More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
collaboration skills across multidisciplinary teams Desirable Attributes Exposure to microservices architecture and event-driven systems (e.g., Kafka) Experience with design systems and component libraries (e.g., Material, Storybook) Familiarity with observability tools and performance tuning Prior consulting experience or experience in client-facing roles Engagement Model Outside IR35 12-month initial contract with potential for extension or permanent employment Hybrid working More ❯
Maidenhead, Berkshire, United Kingdom Hybrid / WFH Options
dynaTrace software GmbH
Docker, Kubernetes etc. Ideal candidates will have 2+ years of Dynatrace Technology experience Dynatrace Product Certification. Why you will love being a Dynatracer Dynatrace is a leader in unified observability and security. We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance. Our employees work with the largest cloud providers, including AWS, Microsoft, and More ❯
GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater … visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms … with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring & Observability SME within customer delivery teams Support incident response activities and postmortems by identifying patterns, root causes, and optimisation opportunities Work collaboratively with cross-functional teams to define and implement best practices in observability and monitoring Attend customer and More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
of IaC principles and tools such as Terraform and Pulumi You have experience of building and improving CI/CD pipelines for product teams You have experience with cloud observability (logging, tracing, metrics, monitoring and alerting) You have experience with Containerisation - Azure Container Apps preferred You have strong scripting skills with PowerShell and/or C# .Net coding You enjoy More ❯
teams to execute effectively. DataOps Enablement and Optimization: Drive the adoption of modern DataOps principles to streamline engineering workflows. Partner with platform teams to establish CI/CD pipelines, observability standards that improve operational efficiency, reliability, and speed across data pipelines. Data Governance and Quality Assurance: Embed governance, security, and data quality practices into engineering workflows. Define guardrails and reference More ❯
south west london, south east england, united kingdom
Mars
teams to execute effectively. DataOps Enablement and Optimization: Drive the adoption of modern DataOps principles to streamline engineering workflows. Partner with platform teams to establish CI/CD pipelines, observability standards that improve operational efficiency, reliability, and speed across data pipelines. Data Governance and Quality Assurance: Embed governance, security, and data quality practices into engineering workflows. Define guardrails and reference More ❯
in Uxbridge, in leafy West London. But if commuting isn't for you, most of our roles can be hybrid or remote, or anywhere in between. The must haves Observability, "you build it you run it" attitude Mentoring, good communication, giving and receiving feedback A commitment to make ethical and responsible decisions that support giffgaff's impact on people, planet More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
native infrastructure (AWS, Azure, or GCP) using Infrastructure as Code (IaC) tools Collaborate with engineering teams to automate deployment, monitoring, and scaling of applications Ensure platform security, compliance, and observability through DevSecOps best practices Participate in architecture reviews, sprint planning, and incident response processes Contribute to the development of reusable DevOps assets and CoE engineering standards Support the integration of … DevOps) Proficiency in cloud platforms (AWS, Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Pontoon
support) What You Bring: Strong Java (streams, lambdas, concurrency) and front-end skills with React.js Deep knowledge of multithreaded, distributed systems and asynchronous architecture Experience with JVM tuning and observability tools (Prometheus, Elastic, etc.) TDD, CI/CD, and agile delivery experience Ability to deliver from design to deployment Bonus Points: Experience in Front Office, Risk, or Pricing within investment More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Method Resourcing
teams to operationalize models and ship ML-powered features into production. Continuously assess and iterate on production models, balancing long-term ML strategy with tactical improvements. Champion code quality, observability, and resilience within their ML systems through reviews and hands-on contributions. Help shape their internal ML standards and practices, ensuring they stay ahead of industry advancements. Offer technical mentorship More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
Electronic Arts
Source control management tools (e.g. Perforce, Git) Configuration management tools (e.g. Chef, Ansible, Terraform, Packer) Secrets management tools (e.g Vault) Virtualization environments and tools (e.g. VMs, vSphere) Data and Observability tools (e.g. Splunk, Grafana, New Relic, Open Telemetry) Growth-oriented mindset About Electronic Arts We're proud to have an extensive portfolio of games and experiences, locations around the world More ❯
Manual Tester (DV Security Clearance) Position Description Are you an experienced Test Analyst with a background in secure or classified programmes, ready to contribute to projects of national importance? Step into a role where you'll challenge the complex to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
WüNDER TALENT
with third-party APIs to support real-time marketing insights. Collaborate closely with cross-functional teams including Data Science, Software Engineering and Product. Champion best practices in data governance, observability and compliance. Contribute to CI/CD pipeline development and infrastructure automation (Terraform, AWS DevOps). Provide input into technical decisions, peer reviews and solution design. Requirements Proven experience as More ❯
will involve designing robust software solutions that enhance system performance while ensuring high availability for critical applications. You will work hand-in-hand with product engineering teams to improve observability tools and telemetry systems, driving forward automation initiatives that reduce manual intervention. By participating in incident management processes-facilitating transparent communication with stakeholders and leading blameless post-mortems-you will … a focus on automating these activities wherever possible.* Provide on-call support during production incidents outside standard working hours as required by the business needs.* Contribute to enhancing product observability and telemetry by supporting ongoing modernisation efforts within the infrastructure.* Collaborate closely with engineering teams to brainstorm ideas that simplify infrastructure management and streamline SRE practices. What you bring: * Proficiency More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morgan Hunt Recruitment
Postgres) Implement OCR, NLP, and ML for document analysis and automation risk assessment Lead R&D spikes and validate system improvements through robust data analysis Ensure code quality, testing, observability, and non-functional compliance (security, UX, performance) Coach team members and contribute to Agile delivery practices Essential Skills Strong commercial experience with Python, TypeScript, SpaCy, and AWS (serverless) Background in More ❯
while exhibiting strong interpersonal skills. Ability to support multiple time zones to work with regional stakeholders Why you will love being a Dynatracer Dynatrace is a leader in unified observability and security. We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance. Our employees work with the largest cloud providers, including AWS, Microsoft, and More ❯