126 to 150 of 285 Observability Jobs in London

Artificial Intelligence Engineer

Hiring Organisation
Retelligence
Location
London, UK
Employment Type
Full-time
pipelines, and automation – Apply the latest ML/LLM research to practical product needs within a regulated environment – Contribute to design reviews, code quality, observability, and incident response You Will Need – Strong Python engineering background with experience building scalable back-end services – Solid ML foundations with exposure to LLM integration ...

Technical Development Lead - Enfield, London

Hiring Organisation
Crimson
Location
Enfield, Middlesex, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £80,000 per annum
CIAM flows, and adhering to ISO 27001 standards. Develop resilient architectures for retail and e-commerce systems, considering networking and SD-WAN performance. Configure observability tools for monitoring, logging, and performance metrics. Mentor and guide a small technical team, enforce coding standards, and apply Agile principles. Translate business objectives into ...

Senior Data Engineer, Azure

Hiring Organisation
Arc IT Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Server and Power BI Solid understanding of data warehousing, lakehouse and datalake architectures Familiarity with modern data engineering patterns (ETL/ELT, medallion architecture, observability) Excellent Python and SQL skills Experience working with Apache Spark and large-scale data processing For a full consultation on this exciting new role, please ...

Azure CloudOps Engineer

Hiring Organisation
Adecco
Location
Croydon, London, United Kingdom
Employment Type
Contract, Temporary
Salary
£516/day
Reliability Engineering (SRE) principles. Knowledge of GDS standards , compliance frameworks, and ITSM integration. Expertise in IaC (Bicep/Terraform) , scripting ( PowerShell/Python ), and observability tools. Commercial awareness and ability to manage cloud costs effectively . Solid understanding of networking, Active Directory, DNS , and hybrid cloud scenarios. Why Join ...

Azure CloudOps Engineer

Hiring Organisation
Adecco
Location
South Croydon, Surrey, England, United Kingdom
Employment Type
Contractor
Contract Rate
£516 per day
Reliability Engineering (SRE) principles. Knowledge of GDS standards , compliance frameworks, and ITSM integration. Expertise in IaC (Bicep/Terraform) , scripting ( PowerShell/Python ), and observability tools. Commercial awareness and ability to manage cloud costs effectively . Solid understanding of networking, Active Directory, DNS , and hybrid cloud scenarios. Why Join ...

Palantir Consultant

Hiring Organisation
Staffworx
Location
London, UK
Employment Type
Full-time
Scalability, Reliability & Operations Help investigate performance issues (e.g. parallelisation, partitioning, caching, compute configuration) with mentorship from more senior colleagues. Contribute to monitoring, alerting and observability setup for pipelines, applications and integrations. Participate in incident response and root cause analysis for platform and application issues. Assist in applying non-functional requirements ...

Palantir Consultant

Hiring Organisation
Staffworx Limited
Location
Central London, London, United Kingdom
Employment Type
Permanent
Scalability, Reliability & Operations Help investigate performance issues (eg parallelisation, partitioning, caching, compute configuration) with mentorship from more senior colleagues. Contribute to monitoring, alerting and observability setup for pipelines, applications and integrations. Participate in incident response and root cause analysis for platform and application issues. Assist in applying non-functional requirements ...

Head of Product Operations and Support

Hiring Organisation
Gray Global Placements LTD
Location
Central London, London, United Kingdom
Employment Type
Permanent
Requirements: - 15+ years of experience in leading support/operations roles in enterprise SaaS or technology environments. - Familiarity with cloud-based environments (AWS) and observability platforms. - Background in managing support across hybrid or multi-tenant platforms. - Proven experience in building and scaling global support teams and operational processes. - Expertise ...

Head of Product Operations and Support

Hiring Organisation
Gray Global Placements
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Requirements: - 15+ years of experience in leading support/operations roles in enterprise SaaS or technology environments. - Familiarity with cloud-based environments (AWS) and observability platforms. - Background in managing support across hybrid or multi-tenant platforms. - Proven experience in building and scaling global support teams and operational processes. - Expertise ...

Lead Decision Intelligence Consultant (Palantir)

Hiring Organisation
Staffworx
Location
London, UK
Employment Type
Full-time
Foundry artefacts. Scalability, Reliability & Operations Lead performance tuning for large-scale production deployments (eg parallelisation, partitioning, caching, compute configuration). Design monitoring, alerting and observability for pipelines, applications and integrations. Handle incident response and root cause analysis for platform and application issues. Define and enforce non-functional requirements (SLA/ ...

Lead Decision Intelligence Consultant (Palantir)

Hiring Organisation
Staffworx
Location
South London, UK
Employment Type
Full-time
Foundry artefacts. Scalability, Reliability & Operations Lead performance tuning for large-scale production deployments (eg parallelisation, partitioning, caching, compute configuration). Design monitoring, alerting and observability for pipelines, applications and integrations. Handle incident response and root cause analysis for platform and application issues. Define and enforce non-functional requirements (SLA/ ...

Head of Integrations SaaS / Software

Hiring Organisation
RedTech Recruitment
Location
East London, London, United Kingdom
Employment Type
Professional qualifications
understand client requirements and deliver tailored technical solutions. Design and implement scalable, future-proof architectures for new third-party connectors and integrations. Enhance system observability by improving diagnostics, logging, and tracing to aid technical support teams in resolving issues swiftly. Oversee the ongoing development and management of the public ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
London, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Software Engineer - AI & ML (Based in Dubai)

Hiring Organisation
Property Finder
Location
London Area, United Kingdom
/semantic search infrastructure Evaluation dashboards, prompt/version management, and feedback loops Own services end-to-end: from design and implementation to monitoring, observability, and on-call, ensuring high availability, performance, and reliability. Collaborate with cross-functional teams (Product, Data Science, Data Engineering, Design, DevOps/SRE) to translate … NodeJS, or Python Solid understanding of cloud architecture and cloud-native technologies, preferably AWS. Experience designing and operating highly distributed, scalable services with strong observability (metrics, logs, traces, dashboards, alerts). Familiarity with MLOps practices and tools: CI/CD for ML, model deployment patterns, monitoring model performance and data ...

Senior Software Engineer - AI & ML (Based in Dubai)

Hiring Organisation
Property Finder
Location
City of London, London, United Kingdom
/semantic search infrastructure Evaluation dashboards, prompt/version management, and feedback loops Own services end-to-end: from design and implementation to monitoring, observability, and on-call, ensuring high availability, performance, and reliability. Collaborate with cross-functional teams (Product, Data Science, Data Engineering, Design, DevOps/SRE) to translate … NodeJS, or Python Solid understanding of cloud architecture and cloud-native technologies, preferably AWS. Experience designing and operating highly distributed, scalable services with strong observability (metrics, logs, traces, dashboards, alerts). Familiarity with MLOps practices and tools: CI/CD for ML, model deployment patterns, monitoring model performance and data ...

Site Reliability Engineer - Global Hedge Fund

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
London Area, United Kingdom
performance trading platform, with a strong focus on automation, reliability, and system resilience. You will be responsible for building operational tooling and automation, improving observability and incident response, and applying core SRE principles to ensure the stability, performance, and scalability of mission-critical trading systems. Stack: Python, Linux, Kubernetes, Terraform ...

Site Reliability Engineer - Global Hedge Fund

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
City of London, London, United Kingdom
performance trading platform, with a strong focus on automation, reliability, and system resilience. You will be responsible for building operational tooling and automation, improving observability and incident response, and applying core SRE principles to ensure the stability, performance, and scalability of mission-critical trading systems. Stack: Python, Linux, Kubernetes, Terraform ...

OpenAI Architect (FDE)

Hiring Organisation
HCLTech
Location
City of London, London, United Kingdom
/function calling, Responses/Chat Completions, Embeddings, Files/Batch, Moderations), fine‐tuning pipelines, and agentic RAG then drive PoC → Production with governance, observability, and cost control. Keep solutions portable with pragmatic use of cloud services, LangChain/LangGraph/Semantic Kernel, and standard vector stores. What …/perf SLOs. • Fine‐tuning lifecycle: Own dataset curation, training/eval, bias checks, rollback/versioning, and telemetry for tuned models. • Operability: Add observability (OpenAI Observability/OpenTelemetry), token/cost telemetry, retries/backoff, idempotency, and feature flags/canaries; document runbooks and SOPs. Cross‐platform & enterprise integration ...

OpenAI Architect (FDE)

Hiring Organisation
HCLTech
Location
London Area, United Kingdom
/function calling, Responses/Chat Completions, Embeddings, Files/Batch, Moderations), fine‐tuning pipelines, and agentic RAG then drive PoC → Production with governance, observability, and cost control. Keep solutions portable with pragmatic use of cloud services, LangChain/LangGraph/Semantic Kernel, and standard vector stores. What …/perf SLOs. • Fine‐tuning lifecycle: Own dataset curation, training/eval, bias checks, rollback/versioning, and telemetry for tuned models. • Operability: Add observability (OpenAI Observability/OpenTelemetry), token/cost telemetry, retries/backoff, idempotency, and feature flags/canaries; document runbooks and SOPs. Cross‐platform & enterprise integration ...

Platform Engineer

Hiring Organisation
La Fosse Associates Limited
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 500 - 550 Daily
Knowledge of Infrastructure as Code tools and automation Experience in implementing and maintaining CI/CD pipelines to enable rapid deployment Ability to implement observability and troubleshoot complex issues Experience with on-Prem Network Deployment and Automation Experience with Cloud Network Deployment and Automation Windows administration, Linux administration Knowledgeable ...

Cloud Platform Network Engineer

Hiring Organisation
Skillsbay Limited
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£80,000
across cloud and on-premise environments Troubleshoot and resolve connectivity issues quickly and effectively Automate network configuration using Terraform, PowerShell and Azure CLI Maintain observability using Azure Monitor, Log Analytics and Network Watcher Ensure deployments align with security and compliance standards Produce technical documentation and support knowledge sharing Required Experience ...

Solutions Engineer

Hiring Organisation
Harnham
Location
London Area, United Kingdom
with front-end frameworks like React. Skilled in SQL , data warehousing ( BigQuery ), and data visualisation. Knowledgeable about AI integration , agent frameworks, prompt engineering, and observability tools. Familiar with automation, ETL/ELT, and modern software engineering practices. ...

Solutions Engineer

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
with front-end frameworks like React. Skilled in SQL , data warehousing ( BigQuery ), and data visualisation. Knowledgeable about AI integration , agent frameworks, prompt engineering, and observability tools. Familiar with automation, ETL/ELT, and modern software engineering practices. ...

Scala Developer - £55K - £60K + 5% Pension

Hiring Organisation
Stealth IT Consulting Limited
Location
London, England, United Kingdom
Agile environment (Scrum/Kanban).Participate in code reviews, architecture discussions and pair programming.Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts).Help define CI/CD pipelines and deployment processes (e.g., Jenkins/GitHub Actions/Concourse).Produce concise technical documentation and handover notes.Must-have ...

Scala Developer - Remote Contract - Immediate Start

Hiring Organisation
Stealth iT Consulting
Location
London, UK
Employment Type
Full-time
environment (Scrum/Kanban). Participate in code reviews, architecture discussions, and pair programming sessions. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Assist in defining CI/CD pipelines and deployment processes (e.g., Jenkins, GitHub Actions, Concourse). Produce concise technical documentation ...