201 to 225 of 266 Observability Jobs in London

Enterprise AI Architect (Industry-Focused)

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
From £700 to £800 per day
Retail, CPG, Travel-Transportation-Hospitality (TTH), Life Sciences Health Care, Manufacturing and Education sectors. Key Outcomes - Stand up secure AI foundation stack with observability and guardrails. - Deliver rapid MVPs in priority industry domains. - Establish AI operating model & governance. - Upskill client & internal teams with AI fluency programs. Responsibilities - Lead C-suite ...

Platform Product Manager

Hiring Organisation
Heart Mind Talent
Location
London Area, United Kingdom
product planning and prioritisation for a small engineering squad (3–5 engineers) Translate operational and product needs into scalable platform capabilities Balance platform investments (observability, modularity, tech debt) with product delivery Prototype ideas and analyse data using AI-assisted tools and workflows What they’re looking for This role ...

Technical Architect Principal (UK)

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
From £500 to £550 per day
days/week Contact Duration - 6 months The Role The Technical Architect Principal will lead the architecture, design, and technical governance of an enterprise observability and telemetry platform. This role is responsible for designing major solution components, defining reference architectures, and guiding development teams through POCs to production-scale implementations. ...

Principal Data Scientist

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
optimisation, elasticity modelling and experimentation Designing and evaluating A/B tests, bandits and quasi-experiments Building and deploying production‐grade models with strong observability Setting practical standards for modelling, experimentation and DS workflows Mentoring senior and mid‐level data scientists through review and example Advising stakeholders on trade‐offs ...

Regional Sales Manager

Hiring Organisation
Virtana Corp
Location
London, England, United Kingdom
/Y meeting and exceeding their sales plans. A strong background in storage, ITOM, APM SaaS sales and On-premises or SaaS Observability platforms. Further, strong financial acumen and skill developing new sales opportunities within a "green" field territory as well as expanding sales within existing accounts. Additionally, candidates will ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
South London, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Engineering Manager

Hiring Organisation
La Fosse
Location
London Area, United Kingdom
understanding of CI/CD pipelines , including build automation, testing, and deployment Familiarity with modern engineering practices: automated testing, infrastructure as code, monitoring, and observability Technology Stack Backend development across modern JVM frameworks including Spring , Spring Boot , and Micronaut , primarily using Java Cloud-native services deployed on Azure , with orchestration … Kubernetes and system monitoring/observability using tools such as Dynatrace Data persistence and storage using a mix of relational and NoSQL technologies, including SQL Server and MongoDB Frontend applications built with contemporary JavaScript frameworks and languages such as React , Next.js , Angular , and TypeScript In-memory data grids and caching ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
performant infrastructure that underpins critical public-sector services. You’ll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. You’ll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Azure Engineering Manager - Fully Remote

Hiring Organisation
GBV Ltd
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
engineers. What youll be doing: Leading a distributed engineering team focused on platform reliability and scalability Driving SRE best practices (SLOs, automation, observability, incident management) Partnering with product, security, and engineering teams to shape infrastructure strategy Improving CI/CD, developer experience, and system performance Championing a culture of continuous … teams Deep Azure expertise (Terraform/IaC preferred) Background in software engineering (C#, Java, Python, or Ruby) Experience with Kubernetes, CI/CD, and observability tooling Passion for automation, reliability, and scalable systems Package highlights: Salary around £110-130k Private healthcare, pension, and strong benefits Clear progression and development ...

Platform Engineering Manager

Hiring Organisation
Prism Digital
Location
London Area, United Kingdom
cloud environments Architecture governance and design authority Security-by-design and Zero Trust Terraform or Bicep (production IaC) CI/CD and infrastructure automation Observability (SLOs, monitoring, incident management) Disaster recovery and resilience planning Vendor and third-party management Strong stakeholder communication What You’ll Work With Azure (landing zones … shared services) Terraform/Bicep CI/CD pipelines Kubernetes (AKS) Observability tooling (logs, metrics, tracing) Networking (VNets, ExpressRoute, private endpoints) Security controls and compliance frameworks Event Hubs, Service Bus, API Management Hybrid Windows/Linux infrastructure Nice to Haves FinOps (cost control, budgeting, optimisation) Financial services or regulated environments ...

Typescript Developer

Hiring Organisation
Get2Talent
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
performance trading platform . Work primarily with: TypeScript ( Node. js & React) Monorepo tooling, GitHub, GitHub Actions Jest, Playwright Redis, MS SQL, WebSockets Docker, Kubernetes Observability tools ( Grafana, Prometheus, SonarQube) Take end- to- end ownership of features from design to production. Collaborate closely with platform and DevOps engineers on build pipelines … observability, and operational concerns. Communicate directly with clients to clarify requirements and propose solutions. Contribute to and improve automated testing practices. Participate in peer code reviews and maintain high engineering standards. Leverage LLM/AI- enabled development tools as part of day- to- day development. Requirements 8+ years of professional ...

Lead React Developer

Hiring Organisation
Get2Talent
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
performance trading platform . Work primarily with: TypeScript ( Node. js & React) Monorepo tooling, GitHub, GitHub Actions Jest, Playwright Redis, MS SQL, WebSockets Docker, Kubernetes Observability tools ( Grafana, Prometheus, SonarQube) Take end- to- end ownership of features from design to production. Collaborate closely with platform and DevOps engineers on build pipelines … observability, and operational concerns. Communicate directly with clients to clarify requirements and propose solutions. Contribute to and improve automated testing practices. Participate in peer code reviews and maintain high engineering standards. Leverage LLM/AI- enabled development tools as part of day- to- day development. Requirements 8+ years of professional ...

Senior Lead Engineer

Hiring Organisation
Investigo
Location
City of London, London, United Kingdom
change management tools like Liquibase into automated pipelines Apply DevSecOps best practices across the lifecycle: static analysis, dependency scanning, and secure credential management Ensure observability, monitoring, and performance using GCP Operations Suite or New Relic Mentor engineers and collaborate across global, distributed teams What We’re Looking For Proven experience … expertise : BigQuery, Dataproc, Cloud Composer Deep data architecture and engineering knowledge : Spark, DBT, Oracle, BigQuery Experience designing scalable architectures (Microservices, Monoliths, Batch) Skilled in observability, monitoring, and DevSecOps integration Excellent communication with a record of collaborating globally Why You’ll Love It Combine architecture, coding, and leadership in one role ...

AWS Senior Platform Engineer

Hiring Organisation
SR2 | Socially Responsible Recruitment | Certified B Corporation™
Location
London, UK
platforms Writing and managing infrastructure as code (Terraform) Building and improving CI/CD pipelines Working with containerised workloads (Kubernetes/EKS) Implementing observability (monitoring, logging, alerting) Collaborating closely with product teams and stakeholders Contributing to platform best practices, reliability, and scalability What We’re Looking For Strong AWS platform …/DevOps engineering experience Solid experience with: Terraform (essential) CI/CD pipelines Kubernetes/container platforms Observability tooling Experience working in consulting or client-facing environments Strong communicator – able to influence and explain decisions clearly Comfortable pushing back constructively with stakeholders Experience working on real production platforms/products ...

Senior / Lead Data Engineer (AI-Focused)

Hiring Organisation
PaymentGenes
Location
City of London, London, United Kingdom
inference (batch and real-time) Evaluate and integrate emerging AI tooling where strategically valuable 🔧 Technical Leadership Set best practices for testing, documentation, lineage, and observability Lead code reviews and mentor data & analytics engineers Drive CI/CD and infrastructure-as-code adoption Own platform reliability, performance optimisation, and cost efficiency … Infrastructure Feature engineering architecture ML pipeline and deployment workflows Experience supporting production ML systems Familiarity with embeddings, vector databases, LLM orchestration (desirable) Data observability and model monitoring Platform & DevOps CI/CD for data workflows Git-based engineering standards Docker/containerisation Infrastructure-as-code (e.g., Terraform) Monitoring and alerting ...

AWS Devops Engineer

Location
London, United Kingdom
pipelines, and cloud infrastructure delivery, ensuring compliance with DDaT standards and best practices. You will provide technical leadership, mentor junior engineers, and embed reliability, observability, and security across platforms. This is a hybrid s role with travel to the client required on a frequent basis. SC clearance will be required … residency to be eligible). In this role, you will: Implement advanced monitoring, logging, and alerting solutions to guarantee system reliability and performance. Enhance observability for proactive issue detection. Work closely with developers, testers, and product teams to deliver robust solutions. Mentor junior engineers and promote best practices in DevOps ...

Technical Architect Principal (UK)

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 500 - 550 Daily
days/week Contact Duration - 6 months The Role The Technical Architect Principal will lead the architecture, design, and technical governance of an enterprise observability and telemetry platform. This role is responsible for designing major solution components, defining reference architectures, and guiding development teams throu click apply for full ...

Hybrid Domain Consolidation Analyst | IT Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
leading observability solutions provider in Greater London is seeking a Domain Consolidation Analyst for a 6-month full-time contract with hybrid work. The role involves managing a project to consolidate IT domains, coordinating with third parties, and ensuring compliance with ISO 27001 standards. Candidates must have at least ...

Partner Manager

Hiring Organisation
Timebeat
Location
London Area, United Kingdom
written communication and CRM discipline Nice to have Experience with channel models (reseller, referral, MSP, SI), co-sell motions, or marketplace partnerships Familiarity with observability/monitoring, networking, infrastructure tooling, or developer-facing products Experience building partner programs from scratch (tiering, enablement, certification, MDF) Success metrics (examples) Number ...

Lead Platform Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£100,000 - £110,000 per annum
deeply hands-on with modern infrastructure tooling. The company builds all it's software in-house and has been investing heavily in its platform, observability, and cloud capabilities as they continue to scale. The Opportunity: You’ll join as the Lead Platform Engineer, working closely with engineering leadership to drive … currently operates in a hybrid environment: ~60% on-premise infrastructure ~40% Microsoft Azure The long-term strategy is focused on modernising the platform, improving observability, and evolving cloud capabilities, making this an excellent opportunity for someone who enjoys building and shaping systems. Tech Stack: You’ll be working across ...

Site Reliability Engineering Lead – Financial Services

Hiring Organisation
Alexander Ash Consulting
Location
London Area, United Kingdom
operations, and improvement of the SRE platforms, teams, and organisation. You will be responsible for leading and scaling the SRE function, driving intelligent automation, observability, and resilience, across the organisation, and leading on production incidents, from frameworks to resolution. You will work in a hybrid on-premise/AWS-based … related fields (platform engineering, DevOps etc.) Deep technical experience in cloud-native AWS and on-premise systems architecture Strong incident management and observability experience for large scale systems Intelligent automation/Agentic AI experience preferred Excellent AWS services, data platforms, software engineering, CI/CD, IaC, experience Degree educated ...

AWS Site Reliability Engineer ( Data Platform)

Hiring Organisation
FBI &TMT
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£450 - £455 per day
cloud-native data platform built on AWS, Snowflake, and Databricks. This role focuses on enhancing reliability through automation, disaster recovery testing, resiliency engineering, observability, and proactive SLO/SLI/SLA management. Key Responsibilities: Design, build, and maintain automation for infrastructure provisioning, platform operations, and incident response using … manage SLIs, SLOs, and SLAs for critical data pipelines and platform services; utilise error budgets to guide reliability improvements. Build and operate robust observability solutions (metrics, logs, traces, alerts) for AWS services, Snowflake, and Databricks workloads. Partner with data engineering and platform teams to embed reliability-by-design into architecture ...

AI Engineer – Production LLM Systems

Hiring Organisation
Redimeer
Location
London Area, United Kingdom
orchestration . You will work on: Multi‐agent architectures Intelligent tool and API integrations RAG pipelines and vector‐based retrieval Evaluation frameworks and AI observability Production workflows that ensure reliability, consistency, and scale You’ll play a critical role in crafting the orchestration layer that makes LLM systems trustworthy—handling … improving robustness across diverse use cases. Key Responsibilities Build production AI systems using LLMs, RAG pipelines, vector databases, and agentic frameworks Design evaluation and observability frameworks to measure performance, accuracy, and reliability Develop clean, scalable applications with proper error handling, APIs, and data pipelines Implement and maintain retrieval systems (vector ...

Observability Specialist

Hiring Organisation
Pontoon
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£700 - £750/day
Observability Specialist** (Contract) Duration: 6 Months (Possibility for extension) Location: London, Birmingham, Edinburgh or Leeds/Hybrid (2 days per week on site) Rate: A highly competitive Umbrella Day Rate is available for suitable candidates Role Overview As an Observability Specialist, you will work closely with our Enterprise Monitoring & Alerting … coverage of these assets. Identify and recommend enhancements to monitoring configurations and capabilities across critical applications. Review and refine roles and responsibilities related to observability, emphasizing operational resilience. Develop automatically maintained end-to-end business flows for key processes within the Dynatrace toolset. Ensure optimal and purpose-fit alerting configurations ...

Head of Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
platform and infrastructure strategy Design and evolve cloud architecture to support scale, resilience, and performance Set standards for infrastructure, CI/CD, environments, and observability Make architectural decisions and trade‐offs Developer Experience (DevEx) Provide infrastructure for the development team to code, test and deploy efficiently Advise during design sessions … growing company Ability to operate production systems under pressure Deep hands‐on experience with the AWS cloud platform Strong background in reliability, observability, and incident management Experience leading or mentoring engineers What we offer in return 💰 Competitive salary depending on experience 🏝️ 27 days of annual leave (including 3 days Christmas ...