426 to 450 of 567 Observability Jobs in the UK

Back End Developer

Hiring Organisation
NearTech Search
Location
London Area, United Kingdom
backend initiatives end-to-end, from architecture to rollout • Strengthen testing strategy across unit and integration layers • Improve data and integration workflows with observability and resilience • Optimise Postgres (RDS) and MongoDB performance, modelling and migrations The role requires... • Strong commercial experience with Node.js and TypeScript • Deep API design expertise, including ...

Product Manager

Hiring Organisation
asobbi
Location
United Kingdom
platforms, HPC, or AI systems Familiarity with GPU-based environments and AI workload requirements Understanding of private AI architectures (e.g. model deployment, data pipelines, observability) Experience designing or working with on-prem or hybrid infrastructure solutions Knowledge of performance, scalability, and capacity planning Experience working with technology partners and joint ...

Lead Data Scientist

Hiring Organisation
Zazu Digital Talent
Location
United Kingdom
LLMs behave across queries, surfaces and contexts. You will work across ranking signals, vector and semantic representations, entity understanding, graph-based relationships, model serving, observability, cost and latency optimisation, and the connection between unstructured signals and automated recommendations. You will also help shape the long-term ML strategy, including platform ...

BDR Language Speaker

Hiring Organisation
Pareto
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £35,000 per annum
must speak Filipino fluently to qualify for this role* Our client is a global data platform that helps turn data into action for Observability, IT, Security and more. Leaders in their field, our client is growing at an exciting rate and as such are now looking for new bi-lingual ...

Cloud Engineer

Hiring Organisation
Infinity Quest
Location
Greater Edinburgh Area, United Kingdom
with ITSI, including service modelling, KPIs, thresholds, NEAP, and Service Analyzer. · Experience with MLTK, anomaly detection, forecasting, and operationalizing ML models. · Strong understanding of observability concepts (logs, metrics, traces, golden signals). · Hands-on experience with data onboarding, HEC, Universal Forwarders, Deployment Server, and CIM alignment. · Knowledge of indexing, RBAC ...

Senior Frontend Developer

Hiring Organisation
SEEKR
Location
City of London, London, United Kingdom
bridges so builders can wire their products into hundreds of third‐party tools without hand‐rolling every integration. It handles managed auth, real‐time observability and connector sprawl so product teams can focus on great agent experiences instead of glue code. Your job is to make the surface they ...

IT Service Assurance Specialist - Identity & Access Management

Hiring Organisation
easyJet
Location
Luton, England, United Kingdom
trustworthy, scalable, and efficient IT infrastructure services to support easyJet’s growth and ambitions. Our Platforms form the backbone of our technology capabilities, including Observability, Hosting, Connectivity, Identity and Access, and IT Common Tools. The Role: As a IT Service Assurance Specialist within the Identity and Access Management team ...

Global DevOps Lead

Hiring Organisation
Stott & May Professional Search Limited
Location
United Kingdom
Employment Type
Permanent
Salary
£90,000
opportunities across deployment and operations Reduce manual intervention in release and configuration processes Design scalable CI/CD pipelines that improve speed and quality Observability & Reliability Lead enterprise adoption of Datadog as the observability platform Define monitoring, alerting, and incident response frameworks Improve system reliability, uptime, and incident response times ...

Modern Workplace Solution Architect (M365)

Hiring Organisation
Pontoon
Location
Leeds, West Yorkshire, United Kingdom
Employment Type
Contract
migrations Driving improvements in automation and infrastructure (Terraform, CI/CD pipelines) Exploring AI and modern workplace innovations (e.g., Copilot) Enhancing data visibility and observability across platforms Nice-to-Have Automation experience (PowerShell, Terraform, CI/CD) Exposure to Power Platform (Power Apps, Power Automate) Experience with data, monitoring … observability tools Background in regulated environments (e.g., financial services) Why Join Us? Work on high-impact, enterprise-scale projects that make a difference! Gain exposure to a wide variety of technologies and initiatives. Have the opportunity to influence modern workplace strategy and design . Thrive in a collaborative, fast-moving ...

Senior Software Engineer

Hiring Organisation
Harrington Starr
Location
City of London, London, United Kingdom
business-critical trading platform. The role combines software engineering with reliability engineering. You’ll be involved in designing and building internal tooling, improving observability, automating operations, supporting development teams, and helping ensure trading systems remain stable, scalable, and high performing. It would suit someone who enjoys solving technical problems … speed, resilience, and continuous improvement matter. What you will do Build tools, automation, and internal services that improve platform reliability Implement monitoring, telemetry, and observability standards across distributed systems Analyse performance across application, OS, and network layers to identify bottlenecks Help define and improve SLOs/SLAs for critical services ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Engineering Manager (.NET) - Contract

Hiring Organisation
La Fosse
Location
City of London, London, United Kingdom
resource/capacity management and delivery ownership. - Experience writing executive updates and technical summaries for senior stakeholders. - Strong knowledge of CI/CD, automation, observability, and DevOps maturity models. - Evidence of driving adoption of new tools, frameworks, or processes across multiple teams. Technical Skills & Tools - Languages & Frameworks: C#/.NET … Framework and Core), React - Platforms & Infrastructure: Azure, AKS, Docker, on-prem Windows Server, SQL Server. - IAM and App Gateways: Okta, APIM, Apigee - Monitoring & Observability: Dynatrace, Application Insights - CI/CD & DevOps: Azure DevOps pipelines, SonarCloud, Github - Architecture & Patterns: Microservices, event-driven architecture, domain-driven design, modern scalable design principles ...

Site Reliability Engineer (SRE)

Hiring Organisation
Reading Industrial Pertemps
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 per annum
platform automation, CI/CD, and developer tooling.This is a hands-on role split between supporting engineers and building scalable infrastructure, automation, and observability solutions. You’ll work closely with the Head of Technology and engineering teams to improve reliability, developer experience, and platform performance. What You’ll Be Doing … Build reusable Terraform modules and manage infrastructure-as-code standards Develop internal tooling, automation scripts, self-service tooling, and platform improvements Own and improve observability across monitoring, dashboards, alerting, and runbooks Identify opportunities to automate manual processes and improve platform reliability Contribute to scalable, maintainable, and secure infrastructure practices What ...

Forward Deployed Engineers

Hiring Organisation
Randstad Technologies
Location
London, UK
Employment Type
Full-time
implement AI agents, including: Retrieval (RAG) Orchestration workflows Tool/function invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability Implement observability and monitoring for agent lifecycle AI Platform Integration Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) Build abstraction layers to support … production (agents, RAG, orchestration) Proficiency in Python, Java, or similar backend languages Experience with: CI/CD pipelines Infrastructure as code Monitoring and observability tools Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) Experience designing multi ...

Forward Deployed Engineers

Hiring Organisation
Randstad Digital
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £500 per day + Inside IR35
implement AI agents, including: Retrieval (RAG) Orchestration workflows Tool/function invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability Implement observability and monitoring for agent lifecycle AI Platform Integration Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) Build abstraction layers to support … production (agents, RAG, orchestration) Proficiency in Python, Java, or similar backend languages Experience with: CI/CD pipelines Infrastructure as code Monitoring and observability tools Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) Experience designing multi ...

Cloud Engineer

Hiring Organisation
Spectrum It Recruitment Limited
Location
Southampton, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
Terraform to automate and standardise infrastructure delivery. You'll support the migration and modernisation of traditional infrastructure into cloud services. You'll improve monitoring, observability, security and resilience across cloud platforms. You'll work with engineering, infrastructure and business teams to turn requirements into practical cloud solutions. You'll contribute … teams and wider stakeholders Useful: Cloud migration experience Azure DevOps and YAML pipelines PowerShell, Python or Bash scripting Docker or containerised environments Monitoring and observability tooling Experience in regulated or customer-critical environments Why apply? This is a good opportunity for a Cloud Engineer who wants to work on meaningful ...

Platform Engineer

Hiring Organisation
Albert Bow
Location
City of London, London, United Kingdom
preparation, turning compliance into a competitive advantage Build and maintain robust CI/CD pipelines across backend, frontend, and data services Establish company-wide observability — logging, metrics, tracing, alerting, and on-call culture Take ownership of cloud cost management, optimising spend without compromising performance Champion operational excellence across the engineering … What You'll Bring Technical Cloud & IaC: Azure (AWS a bonus), Terraform, AKS/Kubernetes, Docker, GitHub Actions Observability: Hands-on experience with logging, metrics, and distributed tracing frameworks Security: Secrets management, security scanning, and infrastructure hardening best practices Networking: VPCs, DNS, load balancers, VPNs, firewalls — you know your ...

Senior Software Engineer

Hiring Organisation
Cue
Location
United Kingdom
architectures in production Strong Linux, networking, and infrastructure fundamentals (SRE/platform background ideal) Hands-on experience with containers, Kubernetes, CI/CD, and observability Fluency reading and writing code — you don't need to be the fastest coder, but you need to understand code deeply Active … InfluxDB Networking : Traefik ingress with NLBs, ExternalDNS, cert-manager, Route53, CloudFront, VPC peering Security : GuardDuty, Security Hub, Elasticsearch SIEM, CrowdSec WAF, Firezone (Zero Trust) Observability : Grafana, CloudWatch, Fluent Bit, Goldilocks CI/CD : ArgoCD, GitHub Actions (OIDC), Atlantis What we value over credentials We don't care about your degree ...

Observability Engineer/SRE Engineer

Hiring Organisation
IF Recruitment Ltd
Location
Surrey, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Responsibilities: Assess the current state of monitoring and observability across applications. Provide proactive alerting & visualization by creating actionable dashboards for applications and alerting strategies. Establish and promote monitoring best practices. Act as an escalation point for incidents and provide strategic guidance and recommendations to engineering and operations teams. Use Infrastructure … Code (IaC) tools like Terraform and Scripting (Python, Bash, PowerShell) to automate monitoring setups. Key Skills: Solid experience with APM, monitoring, observability and event management tools including Datadog, Dynatrace/AppDynamics, Grafana. Experience with JSON and Scripting languages such as Python, Bash, PowerShell or JavaScript for automation of tasks. Exposure ...

Contract Software Engineer (Observability & Telemetry)

Hiring Organisation
Xpertise
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Real Time Kernel Observability Engineer | Contract | Inside IR35 | Hybrid London We're working with a global Real Time data platform operating in ultra-low latency, high-throughput environments across distributed systems. This role sits in a specialist engineering team building Kernel-level observability and telemetry infrastructure used to monitor … understand system behaviour in Real Time. What you'll be doing Build Kernel-level observability and instrumentation systems for distributed Real Time infrastructure Develop telemetry pipelines using eBPF-based tooling (metrics, logs, traces at Kernel level) Design and implement system-wide visibility across latency-critical services Work with hotspot detection ...

Senior Full Stack Engineer

Hiring Organisation
Accelerant
Location
United Kingdom
proud that our insurers have been awarded an AM Best A- (Excellent) rating. For more information, please visit www.accelerant.ai. Full Stack Software Engineer - Data Observability & Operational Tooling Accelerant is building a new platform that will serve as the backbone of how risk is exchanged in the future. The Data Observability … team is looking for a Full Stack Software Engineer with a front-end emphasis to build and serve the internal operational tooling and data observability capabilities that power our data platform. This is a unique role: you'll be a software engineer embedded within a data analytics engineering team, working ...

DevOps Engineer ( Azure )

Hiring Organisation
Experis
Location
Wembley, England, United Kingdom
Responsibilities Observability & Monitoring Platform Design, implement, and own an Azure observability playbook, delivering comprehensive dashboards, alerting rules, and operational runbooks using Application Insights, Log Analytics, and Kusto Query Language (KQL). AIOps & Intelligent Automation Develop AI‐driven alerting and detection mechanisms to surface early‐warning signals, including IP reputation degradation … scale. Infrastructure as Code Expertise Deep proficiency in Terraform, including module design, remote state management, workspace strategies, and multi‐environment deployment patterns. Monitoring & Observability Expertise Advanced experience with KQL for Azure Log Analytics, with the ability to design and build custom Azure Monitor Workbooks for operational insight and reporting. Security ...

AI Deployment & Platform Engineer

Hiring Organisation
LEC AI
Location
London, England, United Kingdom
engineering team to deploy AI systems into live environments, manage runtime infrastructure, scale orchestration systems, optimise inference performance, and build the deployment pipelines and observability that keep everything running. This is a deeply hands-on engineering role for someone who enjoys building production infrastructure, solving operational problems, and making … inference infrastructure and deployment automation • Design scalable runtime environments for multi-agent systems Reliability and Scaling • Monitor system performance, latency, throughput, and uptime • Build observability, logging, and alerting systems • Manage autoscaling and infrastructure optimisation • Debug production failures and runtime bottlenecks Infrastructure Operations • Monitor model drift, data drift, and runtime quality ...

Splunk Developer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Dunfermline, Fife, UK
application teams to deliver scalable monitoring, service health, and analytics solutions.________________________________________ Key Responsibilities Technical Leadership Act as Technical Lead for Splunk implementations across monitoring, observability, and service intelligence use cases.Own end to end Splunk solution design including data onboarding, data models, dashboards, alerts, and ITSI objects.Review and govern Splunk development … Studio/Classic dashboardsDesign meaningful alerts using:oCorrelation searchesoRisk based alerting principlesTranslate operational and business requirements into actionable insights.Observability & Production SupportIntegrate Splunk with enterprise observability tools (APM, infrastructure monitoring, cloud platforms).Support production incidents using Splunk, driving root cause analysis and post incident reviews.Improve alert quality by reducing noise ...