276 to 296 of 296 Observability Jobs in London

Staff Backend Engineer (Python | AI Lab | £170,000)

Hiring Organisation
Paradigm Talent
Location
City of London, London, United Kingdom
Role: Staff Software Engineer (Python | Backend | Infrastructure) Location: Hybrid - 2-3 days in London Office Compensation: Up to £170,000 + equity We’re working with a frontier AI lab pushing the boundaries of computational ...

Back End Integration Engineer

Hiring Organisation
VIQU IT Recruitment
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
£400 - 500 per day + Inside IR35
Back End Integration Engineer – Hybrid – Inside IR35 We are seeking a Back End Integration Engineer to support a major modernisation programme, delivering secure, scalable and reliable integrations across the organisation. This role focuses on improving ...

Back End Integration Engineer

Hiring Organisation
VIQU IT Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£400 - £500 per day
Back End Integration Engineer – Hybrid – Inside IR35 We are seeking a Back End Integration Engineer to support a major modernisation programme, delivering secure, scalable and reliable integrations across the organisation. This role focuses on improving ...

Back End Integration Engineer

Hiring Organisation
VIQU IT
Location
London, Broad Street, United Kingdom
Employment Type
Contract
Contract Rate
£400 - £500/day Inside IR35
Back End Integration Engineer – Hybrid – Inside IR35 We are seeking a Back End Integration Engineer to support a major modernisation programme, delivering secure, scalable and reliable integrations across the organisation. This role focuses on improving ...

Site Reliability Program Manager

Hiring Organisation
HCLTech
Location
London Area, United Kingdom
experience, ideally managing complex cross-functional or globally distributed teams. Must have hands-on experience with packet captures analysis through tools like Observer and Observability development using Splunk, ELF & Grafana. Must have domain experience in Payment Card real time transaction processing, clearing & settlement, dispute and fraud management. Must work from … office minimum 4 days a week and be flexible for 5 days if necessary. Experience with PaaS/SaaS, cloud environments, distributed systems, observability tooling, on-call/incident management tools. Data-driven mindset: comfortable analysing metrics, generating reports, and driving improvements based on data. Familiarity with SRE principles — high ...

Site Reliability Program Manager

Hiring Organisation
HCLTech
Location
City of London, London, United Kingdom
experience, ideally managing complex cross-functional or globally distributed teams. Must have hands-on experience with packet captures analysis through tools like Observer and Observability development using Splunk, ELF & Grafana. Must have domain experience in Payment Card real time transaction processing, clearing & settlement, dispute and fraud management. Must work from … office minimum 4 days a week and be flexible for 5 days if necessary. Experience with PaaS/SaaS, cloud environments, distributed systems, observability tooling, on-call/incident management tools. Data-driven mindset: comfortable analysing metrics, generating reports, and driving improvements based on data. Familiarity with SRE principles — high ...

Senior Platform & Backend Engineer (NodeJs/Typescript)

Hiring Organisation
Healthera
Location
London, England, United Kingdom
/CD pipelines that enable fast, safe, and repeatable deployments Embed security and compliance by default through DevSecOps practices and platform guardrails Implement observability as a standard: metrics, logs, traces, alerts, SLIs/SLOs Define and own cost and observability standards, including budgets, spend alerts, unit economics, SLIs/SLOs ...

Guidewire Consultant(Technical)

Hiring Organisation
IBU
Location
Greater London, England, United Kingdom
systemic issues, defect trends, and improvement opportunities. Define, socialize, and execute corrective and preventive action plans to reduce defect recurrence. Enhance application logging and observability standards across Guidewire services to improve MTTR , proactive alerting, and anomaly detection. Design, manage, and maintain centralized monitoring and observability dashboards for real-time application ...

Engineering Manager (Python) - AI/ML SaaS Platform

Hiring Organisation
Creo Recruitment
Location
London, UK
Employment Type
Full-time
performance. Run technical design reviews, guide architecture decisions, and support engineers in navigating trade-offs around performance, cost, and reliability. Champion operational excellence — strong observability, testing discipline, incident response, and SLO ownership. Collaborate with Product & Design to define technical requirements, prioritise roadmaps, and drive measurable outcomes. Tech Environment … quality software and scalable data pipelines with predictable velocity. Clear improvements in system reliability, throughput, and cost efficiency. Strong engineering discipline across design, testing, observability, and incident management. Improved technical foundations and reduced operational toil. Clear, thoughtful communication and alignment across engineering, product, and design. ...

Development Lead

Hiring Organisation
Michael Page Technology
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £95,000 per annum
supporting the environment. Perform initial code-level triage when issues arise before involving the development team. Build custom tooling and automation to improve observability, operational efficiency, and reliability. DevOps & Infrastructure Define, build, and configure monitoring, alerting, and instrumentation - choosing when to build custom solutions vs. when … supporting the environment. Perform initial code-level triage when issues arise before involving the development team. Build custom tooling and automation to improve observability, operational efficiency, and reliability. DevOps & Infrastructure Define, build, and configure monitoring, alerting, and instrumentation - choosing when to build custom solutions vs. when ...

Director of Artificial Intelligence

Hiring Organisation
Omnis Partners
Location
City of London, London, United Kingdom
multi-agent systems from scratch using frameworks such as ReAct, CoT loops, LangGraph, and MCP. Build and productionise agentic AI solutions with strong evaluation, observability, and orchestration. Scale deployments across diverse environments (SQL-based workflows, pandas, RAG pipelines, distributed compute). Define evaluation standards : Pass@N, multi-run testing, retriever … engineering systems in production. Strong track record of leading complex technical delivery while remaining hands-on. Solid software engineering foundations : deployment, observability, monitoring, memory orchestration, optimisation. Familiarity with agentic architectures (ReAct, CoT loops, LangGraph, tool orchestration). Excellent communication skills with the ability to run workshops and shape technical direction. ...

Director of Artificial Intelligence

Hiring Organisation
Omnis Partners
Location
London Area, United Kingdom
multi-agent systems from scratch using frameworks such as ReAct, CoT loops, LangGraph, and MCP. Build and productionise agentic AI solutions with strong evaluation, observability, and orchestration. Scale deployments across diverse environments (SQL-based workflows, pandas, RAG pipelines, distributed compute). Define evaluation standards : Pass@N, multi-run testing, retriever … engineering systems in production. Strong track record of leading complex technical delivery while remaining hands-on. Solid software engineering foundations : deployment, observability, monitoring, memory orchestration, optimisation. Familiarity with agentic architectures (ReAct, CoT loops, LangGraph, tool orchestration). Excellent communication skills with the ability to run workshops and shape technical direction. ...

Product Manager, Managed Services

Hiring Organisation
GTT
Location
London, UK
Employment Type
Full-time
SASE, and IoT to ensure seamless integration across the broader network portfolio. This includes aligning capabilities with GTT's Envision Strategy, observability, and digital experience initiatives. This role is well suited for a proactive, growth-oriented leader who drives results autonomously while ensuring alignment with organizational priorities. The ideal candidate … Ensure alignment with GTT's Envision Strategy, enabling automation, visibility, and a unified digital experience. Identify opportunities where AI-driven insights, automation, or enhanced observability can elevate customer outcomes. Monitor market trends and enterprise needs to guide investment priorities across LAN and WLAN services. Product & Services Leadership Build and maintain ...

Product Manager, Managed Services

Hiring Organisation
GTT
Location
South London, UK
Employment Type
Full-time
SASE, and IoT to ensure seamless integration across the broader network portfolio. This includes aligning capabilities with GTT's Envision Strategy, observability, and digital experience initiatives. This role is well suited for a proactive, growth-oriented leader who drives results autonomously while ensuring alignment with organizational priorities. The ideal candidate … Ensure alignment with GTT's Envision Strategy, enabling automation, visibility, and a unified digital experience. Identify opportunities where AI-driven insights, automation, or enhanced observability can elevate customer outcomes. Monitor market trends and enterprise needs to guide investment priorities across LAN and WLAN services. Product & Services Leadership Build and maintain ...

Python Developer

Hiring Organisation
Ncounter
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£160,000 - £180,000 per annum
maintaining the performance, stability, and availability of our software systems. You'll be working closely with mission-critical applications, developing reliability features, improving observability, and building automation tools to streamline operations. About the Role - 5+ years in Python, with familiarity with version control (e.g., Git), and experience working … technologies like Slurm, Airflow, Kafka, or AMPS. - Background in enhancing system stability, scalability, and performance while conducting root cause analyses to resolve incidents efficiently. - Observability skillset, monitoring and analysis of system performance. - Ability to identify and address bottlenecks to improve response times and resource usage for our production systems ...

Robotics Engineer Manager

Hiring Organisation
Wave Recruitment
Location
Greater London, England, United Kingdom
Engineering Manager – Robotics (Software, Hardware, Systems, AI & Functional Safety) Role Overview You’ll lead a new multi-disciplinary engineering group building embedded and full-stack systems for advanced robotics. This includes direct responsibility for functional ...

Presales Consultant

Hiring Organisation
HCLTech
Location
Greater London, England, United Kingdom
Sales Team. Prepare and present compelling “Proof of Value” product demonstrations as needed for the solution defense. You will work closely with multiple Observability (AIOPS) portfolio of products, acting as a subject matter expert and Trusted Advisor across multiple Observability Suite of solutions Your role will involve direct interaction with … Sales team and customers, helping them to rearchitect their enterprise Observability landscape To conceptualize and create the technical architecture framework designed as per project specifications. To effectively design and review high level technical product designs as per client requirements. To guide the teams and ensure timely delivery. To implement ...

Lead Technical Consultant - Service Operations - Dynatrace, AppDynamic

Hiring Organisation
VIQU IT
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £100000/annum
Consultant who thrives in complex enterprise environments and loves working with cutting-edge technology. You will design, implement, and optimise IT Operations solutions across observability, AIOps, and ITSM platforms, help clients adopt best practices in Event Management and OpenTelemetry, and act as a trusted technical advisor bridging technology and business … that define how the company delivers its services. Skills & Experience: 5+ years in IT Operations, consulting, or related technical roles Hands-on experience with observability platforms : Dynatrace, AppDynamics, Datadog Experience with AIOps/ITSM tools : BigPanda, Splunk ITSM, ServiceNow, or equivalent Expertise in Event Management and OpenTelemetry Strong knowledge ...

Lead Technical Consultant - Service Operations - Dynatrace, AppDynamics, Datadog

Hiring Organisation
Morela
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £100,000 per annum
Consultant who thrives in complex enterprise environments and loves working with cutting-edge technology. You will design, implement, and optimise IT Operations solutions across observability, AIOps, and ITSM platforms, help clients adopt best practices in Event Management and OpenTelemetry, and act as a trusted technical advisor bridging technology and business … that define how the company delivers its services. Skills & Experience: 5+ years in IT Operations, consulting, or related technical roles Hands-on experience with observability platforms : Dynatrace, AppDynamics, Datadog Experience with AIOps/ITSM tools : BigPanda, Splunk ITSM, ServiceNow, or equivalent Expertise in Event Management and OpenTelemetry Strong knowledge ...

Data Engineer - CRM Platforms

Hiring Organisation
Supporting Education Group
Location
London, England, United Kingdom
identity resolution for single customer views, and creating the data models that power segmentation, attribution, and reporting. You'll build enrichment pipelines, set up observability and SLAs, and design the automations that handle lead routing and lifecycle management. You'll work directly with Sales Ops, Marketing, and Analytics, translating requirements … attribution models, and designed automations that actually solve business problems. You think in terms of data products: reusable, trusted datasets with clear SLAs, proper observability, and documentation that others can actually use. You're comfortable working with both technical and commercial stakeholders, translating requirements into robust architectures while keeping solutions ...

Interim Head of Infrastructure and Service Management

Hiring Organisation
Tria
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£800 - £1000/day Outside IR35
ITSM processes, tooling, assets and licensing Defining the target state for infrastructure, service management and operational resilience, covering service architecture, operating model, processes, tooling, observability and recovery capabilities Establishing clear principles for standardisation, automation, cloud and hybrid adoption, and determining which services should be insourced or outsourced Reviewing third-party … operating at executive and Board level, while retaining deep technical credibility across modern infrastructure architectures, cloud and hybrid platforms, enterprise connectivity, ITSM tooling, automation, observability, and operational resilience. This interim role offers a high-profile opportunity to shape the infrastructure and service foundations of a growing FTSE 250 organisation. Interim ...