Observability Jobs in London

26 to 50 of 361 Observability Jobs in London

Senior Software Engineer

London, United Kingdom
Archa
FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Engineer

London
Hybrid / WFH Options
BAE Systems
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
Employment Type: Permanent
Posted:

Senior Software Engineer (London)

London, UK
Archa
FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯
Employment Type: Full-time
Posted:

Cloud Observability Engineer

Tower Hamlets, London, United Kingdom
Barclays Bank PLC
Join us as a Cloud Observability Engineer at Barclays, where you will lead our enterprise observability strategy across multi-cloud environments. This senior role combines technical leadership with team management, driving operational excellence while architecting resilient solutions and mentoring high-performing teams. To be successful as a Cloud Observability Engineer, you should have experience with The ability to lead and … scale technical teams in multi-faceted governance environments AWS/Azure cloud platforms and enterprise observability tools (Elastic, Grafana, Splunk, DataDog, or similar) SRE/DevOps methodologies with Python proficiency for automation and infrastructure-as-code practices Some other highly valued skills may include AWS or Azure cloud certifications Experience implementing AI-driven observability and AIOps solutions Background in large More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal DevOps Engineer (London)

London, UK
TP ICAP
testers and operations to automate builds, deployment and release of applications running in the cloud and on-premise Provide guidance on industry best practices for software deployment, development, and observability Engineer tooling to implement those practices Assist and architect where appropriate solutions using containerisation and serverless technologies Drive automation for environment management, logging and monitoring Engage with vendors and service … stack CI/CD, GitLab, Jenkins, Sonatype Nexus Knowledge and working experience of containerising application components including writing DockerFiles and deploying to Kubernetes Deep understanding of pipelines as code Observability concepts and tooling; Opensearch, Cribl, Grafana, Prometheus, CloudWatch #J-18808-Ljbffr More ❯
Employment Type: Full-time
Posted:

Principal Platform Engineer

London, United Kingdom
Institutional Shareholder Services Inc
Stoxx's GCP platform infrastructure Ensure the platform's scalability, reliability, and efficiency meets business and client requirements Develop, build and support a robust CI/CD pipeline and observability stack Be the go-to person for the most critical Platform issues, leading cross-functional teams where necessary, to deliver best-in-class engineering solutions. Drive continuous improvement initiatives to … Experience working in a global or multinational team setting Strong documentation, communication and collaboration skills Proven ability to drive innovation and continuous improvement initiatives Focus on simplicity, automation and observability Expertise in Python, GitHub Actions, Apigee, Airflow Expertise in Observability tooling such as Prometheus/Grafana, ELK, Splunk or similar Bachelor's or Master's degree in Computer Science or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Messaging Administator - Solace

East London, London, United Kingdom
Marlin Selection
For: 3+ years hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well across technical teams Bonus More ❯
Employment Type: Permanent
Posted:

Senior Software Engineer (London)

London, UK
Hybrid / WFH Options
Humanitec
documents, to implementing clean solutions that address them. Hands-on experience with infrastructure: whether you’ve been part of an on-call rotation or just working day-to-day observability tools, you are comfortable rolling up your sleeves to understand the factors at play in an incident or service degradation. Pragmatism: You are comfortable balancing “perfect” with “good enough,” as More ❯
Employment Type: Full-time
Posted:

Cloud Platform Engineer (DV Security Clearance)

London
CGI
a bias for Infrastructure (Python, Go, C#) • IAM Policy and Authentication/Authorization schemes • Web Services and REST API • Databases and Storage Systems • Development Build, Test, and Deployment Pipelines • Observability and Monitoring (Open Telemetry, TIG and ELK stacks) #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and More ❯
Employment Type: Permanent
Posted:

Senior Director - Operations and Reliability Engineering

London, United Kingdom
The Boston Consulting Group GmbH
Locations : Canary Wharf Boston Who We Are Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Cloud Engineer

London, United Kingdom
The Portfolio Group
our production systems. Key Responsibilities Design, implement, and manage AWS cloud infrastructure. Develop and maintain automation scripts and tooling. Support production systems and ensure high availability and performance. Implement observability and monitoring solutions. Collaborate closely with the PBS (Platform/Backend Services) team. Contribute to infrastructure as code (IaC) and DevOps best practices. Requirements Hands-on experience with AWS. Automation … experience (e.g., Terraform, Ansible, CI/CD tools). Strong understanding of infrastructure and cloud architecture. Experience supporting production environments. Familiarity with observability tools (e.g., Prometheus, Grafana, CloudWatch). Excellent problem-solving and communication skills. Desirable Experience working in a fast-paced or agile development environment. Familiarity with container technologies (e.g., Docker, Kubernetes). Previous experience in a similar role More ❯
Employment Type: Permanent
Salary: £70000/annum
Posted:

AWS Cloud Engineer

London, South East, England, United Kingdom
The Portfolio Group
our production systems. Key Responsibilities Design, implement, and manage AWS cloud infrastructure. Develop and maintain automation scripts and tooling. Support production systems and ensure high availability and performance. Implement observability and monitoring solutions. Collaborate closely with the PBS (Platform/Backend Services) team. Contribute to infrastructure as code (IaC) and DevOps best practices. Requirements Hands-on experience with AWS. Automation … experience (e.g., Terraform, Ansible, CI/CD tools). Strong understanding of infrastructure and cloud architecture. Experience supporting production environments. Familiarity with observability tools (e.g., Prometheus, Grafana, CloudWatch). Excellent problem-solving and communication skills. Desirable Experience working in a fast-paced or agile development environment. Familiarity with container technologies (e.g., Docker, Kubernetes). Previous experience in a similar role More ❯
Employment Type: Full-Time
Salary: £70,000 per annum
Posted:

Head of Platform Engineering (Relocate To Bangkok) (London)

London, UK
Manatal
both strategic vision and the ability to dive deep into technical challenges. Responsibilities Lead and Manage the Platform Engineering Initiatives Define and execute the technical roadmap for platform infrastructure, observability, and developer experience Drive DevOps, SRE, and Infrastructure initiatives to ensure platform reliability and performance Foster a culture of automation, observability, and continuous improvement Architect and Implement Scalable Solutions Design … optimal performance and scalability across all regions Own Platform Reliability and Operations Define and maintain SLOs/SLIs/SLAs for critical platform services Implement comprehensive monitoring, alerting, and observability solutions Design and maintain disaster recovery and business continuity plans Lead incident response and post-mortem processes Optimize Platform Performance and Costs Implement strategies to optimize infrastructure costs without compromising … in solving complex technical issues Contribute to codebases as needed to drive projects forward Requirements Technical Expertise Proven experience managing Kubernetes clusters and expertise in container orchestration. Experience with observability tools (e.g., DataDog, Prometheus, Grafana) Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation Experience in Database optimization and management (especially for multi-tenant architectures) Extensive knowledge of More ❯
Employment Type: Full-time
Posted:

Lead Backend Engineer

London, United Kingdom
Hybrid / WFH Options
Fruition Group
the evolution of our platform's microservices ecosystem. What You'll Do Architect, build, and maintain scalable Python microservices deployed in cloud environments Lead architectural decisions focusing on performance, observability, fault tolerance, and scalability Own complex backend features end-to-end-design, implement, test, deploy, and monitor Mentor and guide engineers through code reviews, design discussions, and best practices Collaborate More ❯
Employment Type: Permanent, Work From Home
Posted:

Solutions Architect [UAE Based] (London)

Surbiton, Greater London, UK
ZipRecruiter
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . #J-18808-Ljbffr More ❯
Employment Type: Full-time
Posted:

Site Reliability Engineer

City of London, London, England, United Kingdom
Certain Advantage
execution of disaster recovery tests & seek to automate these activities where possible Covering on-call schedule when Production support is required outside of working hours Participate in enhancing product observability and telemetry, support modernization. Brainstorm ideas to simplify and streamline infrastructure by closely working with infrastructure and SRE teams. Required qualifications, capabilities and skills Knowledge of Python/Unix Shell More ❯
Employment Type: Temporary
Salary: Salary negotiable
Posted:

Principal Solutions Architect

London, United Kingdom
Hybrid / WFH Options
Parser Limited
Architectures (Kafka). Collaborate with DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to understand requirements and translate them More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Technical Delivery Manager (London)

London, UK
ZipRecruiter
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Employment Type: Full-time
Posted:

Senior Technical Delivery Manager (London)

London, UK
Ownera
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Employment Type: Full-time
Posted:

Senior Network Architect

London, United Kingdom
London Stock Exchange Group
focus on goals and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) LSEG is a leading global financial markets infrastructure and data More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (London)

London, UK
Visa
on AWS are key to our next phase of growth, are written to 12-factor principles and fit into our microservices architecture Cloud-related tools, services, and distributed system observability to support these applications, such as Docker, Kubernetes, ElasticSearch, log management systems, and Datadog APM, to name but a few API specifications, conforming to the OpenAPI (Swagger) standard, provide a More ❯
Employment Type: Full-time
Posted:

Senior Network Architect (London)

London, UK
London Stock Exchange Group
focus on goals and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) LSEG is a leading global financial markets infrastructure and data More ❯
Employment Type: Full-time
Posted:

Director of Software Engineering (London)

London, UK
TRG Screen
US, and India Advanced experience with AWS, Azure, or GCP and large-scale legacy-to-cloud migration programs Proven record implementing DevOps/CloudOps practices, including IaC, autmation, and observability Hands-on experience with AI code-generation tools (e.g. GitHub Copilot, Cursor.io, Windsurf, Devin) Exceptional communication and stakeholder management skills, translating technical strategy into measurable business impact; Able to work More ❯
Employment Type: Full-time
Posted:

DV Cleared Site Reliability / DevOps Engineer

South West London, London, United Kingdom
JAM Recruitment Ltd
Site Reliability/DevOp Engineer London - 5 Days Onsite Up to £550 per day (Umbrella, Inside IR35) 12-Month Contract Must hold live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this More ❯
Employment Type: Contract
Rate: £500 - £550 per day + Umbrella, inside IR35
Posted:

DV Cleared Site Reliability / DevOps Engineer

London, United Kingdom
JAM Recruitment
Site Reliability/DevOp Engineer London - 5 Days Onsite Up to £550 per day (Umbrella, Inside IR35) 12-Month Contract Must hold live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
London
10th Percentile
£65,000
25th Percentile
£73,125
Median
£82,500
75th Percentile
£108,125
90th Percentile
£120,000