976 to 1,000 of 1,270 Observability Jobs

Observability Engineer- Dynatrace

Hiring Organisation
eTeam
Location
Telford, England, United Kingdom
Recruitment specialist that provides support to the clients across EMEA, APAC, US and Canada. We have an excellent job opportunity for you. Job Title: Observability Engineer- Dynatrace Duration: 6 months Location: Telford - 2 days min per month Rate:581GBP/Day(Inside IR35) Role Description: As an Observability Engineer … insight, and proactive incident management. Key Responsibilities: Translate high-level monitoring and non-functional requirements (NFRs) into actionable configurations in Dynatrace. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Collaborate with architects and project teams to integrate monitoring into solution ...

Data Architect (UK)

Hiring Organisation
Jobleads-UK
Location
Fleet, England, United Kingdom
Quantios is a leading provider of software solutions for the trust administration and corporate services industry. With over 30 years of experience, we empower our clients with innovative technology that enhances governance, operations, and investment ...

Senior Product Engineer

Hiring Organisation
Jobleads-UK
Location
City of Edinburgh, Scotland, United Kingdom
What you’ll be doing End-to-end ownership — from idea to launch You lead the charge, collaborate closely, and ship fast. You don’t just influence the direction, you build and drive towards it. ...

Senior Software Engineer - UltraGrid

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Overview Hypervolt is at the forefront of the electric vehicle charging revolution, dedicated to providing innovative and reliable EV charging solutions. We launched in 2021 with the bold ambition to transform the EV charging space ...

London-based Senior Observability SRE for Trading Systems

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
leading financial services company seeks a Senior Software Engineer to enhance systems that ensure observability and performance metrics are streamlined and accurately reported. This role involves collaboration with engineering teams across multiple locations, offering significant technical growth opportunities and engagement with high-level systems. Candidates should possess experience in high ...

Senior Software Engineer

Hiring Organisation
Apache Associates
Location
Manchester Area, United Kingdom
Design and implement rigorous tenant isolation controls at schema and application level. Develop comprehensive testing strategies that guarantee customer data separation and platform integrity. Observability & Autonomous Systems Implement deep observability using OpenTelemetry to ensure all system and agentic behaviours are traceable, explainable, and auditable across distributed environments. Technical Review & Mentorship … Backend & Runtime Bun Elysia Drizzle ORM Postgres 15+ Frontend Next.js 15 (App Router) React 19 Tailwind CSS Infrastructure Kubernetes RabbitMQ Valkey (Redis-compatible) Identity & Observability Keycloak (OIDC/SAML) OpenTelemetry Why Apply? This is an opportunity to work on technically challenging, enterprise-scale systems within a highly engineering-driven organisation ...

Telemetry and Observability Engineer

Hiring Organisation
Oscar Associates (UK) Limited
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 400 - 500 Daily
Telemetry and Observability Engineer (Inside IR-35) London/Hybrid (3 days on-site) I'm working with a global organisation building next-generation cloud-native and observability platforms at enterprise scale, and they're looking for a strong Senior Observability Engineer to join the team. This is a high ...

Automation Engineer

Hiring Organisation
Jobleads-UK
Location
Glasgow, Scotland, United Kingdom
Title: Automation and Observability Engineer Location: Glasgow (hybrid) Contract: 12-months (possible extensions) Pay rate: up to £500 p/d PAYE Are you an experienced Automation and Observability Engineer looking to make an impact in a global financial services environment? We are seeking a talented engineer to join … Enterprise Technology Services team, delivering high-quality automation and observability solutions to enhance data protection and system reliability. What you'll do: Work closely with internal teams to automate processes and improve alerting/observability solutions. Focus on enhancing the reliability of the data protection environment through automation or improved ...

Senior Software Engineer/SRE - BQL Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
financial markets analysis, research, and modeling. In terms of scale, BQL handles ~100 million requests hourly from ~100K active firms. The BQL Platform Observability team owns the reliability of the BQL platform, using observability as its primary lever. We ensure that the BQL ecosystem—spanning workload management layers, the query … Software Development Lifecycle—driving instrumentation standards, defining SLIs and SLOs, influencing design reviews, and ensuring production learnings continuously improve engineering practices. By institutionalizing observability and reliability across the platform, we help teams build and operate BQL with confidence at scale. What You’ll Do: As part of the BQL Platform ...

AWS Cloud engineer- Remote- OutsideIR35

Hiring Organisation
Opus Recruitment Solutions
Location
London, United Kingdom
Employment Type
Permanent
Salary
£400 - £425/day OutsideIR35
role will focus on supporting and evolving a large-scale AWS cloud environment with a strong emphasis on Kubernetes, platform automation, CI/CD, observability, and cloud-native engineering practices. Key responsibilities: Design, build, and maintain scalable AWS infrastructure and deployment pipelines Support and optimize Kubernetes workloads running on Amazon … Octopus Deploy, and Bitbucket Manage and support containerized applications across EKS and ECS environments IaC: Infrastructure as Code (CloudFormation/CDK) Experience with monitoring, observability, and logging platforms including Splunk, CloudWatch, New Relic, or similar solutions AWS Cloud engineer | Until 30/11/26 | £(Apply online only) OutsideIR35 | Remote ...

Platform Engineer

Hiring Organisation
Gravitas Recruitment Group (Global) Ltd
Location
London Area, United Kingdom
responsibilities include: Scaling serverless cloud infrastructure for growth and multi-region reliability Building and improving CI/CD pipelines and deployment systems Enhancing observability, monitoring, and incident response Developing internal tooling to improve engineering productivity Contributing to production code (TypeScript) across infrastructure and product Tech Environment AWS (serverless-first architecture … Pulumi (or similar infrastructure-as-code tools) GitHub Actions for CI/CD Datadog for observability TypeScript across the stack What They’re Looking For Strong platform engineering experience in cloud-native SaaS environments Hands-on experience with AWS serverless architecture (e.g. Lambda, DynamoDB, event-driven systems) Experience building ...

Senior DevOps Engineer

Hiring Organisation
Understanding Recruitment
Location
Oxford, England, United Kingdom
hire a Senior DevOps Engineer. This is a great opportunity to join a highly technical, cross-functional engineering team working across cloud infrastructure, DevOps, observability, and business-critical platform integrations. The environment is heavily AWS-focused, with strong investment in automation, security, and scalable architecture. This role would suit someone … support scalable AWS infrastructure using Terraform and GitLab CI/CD Drive DevOps and DevSecOps best practices across cloud platforms and deployment processes Improve observability, monitoring, and platform reliability across customer-facing systems Perform troubleshooting, root-cause analysis, and production support within complex cloud environments Bring strong AWS experience across ...

Observability Lead with Opentelemetry skills (m/f)

Hiring Organisation
1st solution consulting gmbh
Location
United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Observability Technical Lead (m/f) Start : ASAP Duration : 6 months Location : remote Tasks: Opentelemetry - Otel experience Plan and Execute migration frontrunner with Lines of Businesses (LoB) from one or more of the 3rd party observability tools in scope of the program to One Observability target stack Define clear outcomes … frontrunner technical landscape and process scenario for Ops teams, SREs and others to use as proof point of what feature or gaps on the Observability target stack with the support from GCO and BTP (as One Observability target stack component providers) Document the front runner outcome come with reference ...

Director, AI Platform Owner

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
alignment, and audit readiness in partnership with enterprise risk and compliance teams. Oversee AI gateway and control‐plane capabilities including usage tracking, rate limiting, observability, logging, and chargeback mechanisms. Establish clear RACI models for platform ownership versus domain ownership. Enable safe self‐service AI development across low‐code … gateway patterns, zero‐trust architecture, and identity-centric access controls. Understanding of Responsible AI, data classification, and AI risk management. Experience operating platform observability and usage analytics. Background in multi-vendor or hybrid platform environments. Experience within financial services, asset management, or other regulated industries is strongly preferred. Familiarity with ...

Senior AI Engineer - MCP / AI Tooling

Hiring Organisation
Adepta Partners Ltd
Location
Belfast, Northern Ireland, United Kingdom
agents and internal tooling Implement authentication, session handling, streaming, and stateful interactions Help define standards and reusable patterns for MCP development Contribute to observability, reliability, security, and platform scalability Work closely with senior engineers on architecture and AI-native engineering practices Build systems that support safe, governed AI-assisted workflows … product mindset Comfortable working in fast-moving, evolving technical environments Nice to Have Experience with Cloudflare Workers, serverless platforms, or edge environments Knowledge of observability, CI/CD, and platform engineering Understanding of AI security considerations such as prompt injection and permission scoping Experience building internal AI tooling or developer ...

Software Engineer

Hiring Organisation
Metric
Location
City of London, London, United Kingdom
software for advanced computing platforms. Build and optimise low-latency software interfaces and hardware integrations. Contribute to DevOps, CI/CD pipelines, monitoring, and observability tooling. Lead technical projects from design through deployment. Collaborate with product, engineering, and research teams to deliver new capabilities. Improve system performance, reliability, and scalability. … embedded systems, hardware integration, FPGAs, or scientific instrumentation. Background in quantum computing, HPC, telecoms, robotics, defence, semiconductors, or other deep-tech environments. Experience with observability tools such as Grafana, Prometheus, or InfluxDB. Knowledge of digital signal processing, RF systems, or data acquisition. About You: You are a hands-on engineer ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
moment when the work is genuinely changing shape. Over the last year we've hardened the platform, reduced cost, and built serious observability into our highest-volume systems. The next year is about scaling that work, absorbing infrastructure from a recent acquisition, and being thoughtful about how AI shows … thrive. Here's what that looks like in practice: Month 1 : You're onboarded across our AWS estate, Terraform, and observability stack. You've completed your first on-call shift with support from the team, landed your first PR in the DevOps repo, and started working Claude Enterprise into your ...

Senior Security Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
strategy, conduct technical evaluations, and manage escalation relationships with strategic partners. Ensure compliance with regulatory frameworks and risk management practices in financial services. Drive observability and detection strategies using logging and SIEM solutions, enhancing our security posture. Collaborate with internal stakeholders, architecture teams, and third-party vendors to maintain … Proficiency in cloud networking and security in AWS, GCP, or Azure, including Transit Gateway, VPN/Direct Connect, and Azure Virtual WAN. Experience with observability and detection strategies using logging and SIEM solutions, such as Splunk. Availability for senior technical escalation during critical incidents and after-hours work for high ...

Platform Engineer - DevOps Specialist - 6 months contract £650/d inside IR35

Hiring Organisation
Tenth Revolution Group
Location
United Kingdom
PLEASE NOTE that you must be SC Eligible and open to travel to Telford - 2 days min per month Are you passionate about SRE, observability, and driving operational excellence We’re looking for a talented Observability Engineer to help us build and scale world-class monitoring solutions across complex technology … landscapes. 🔍 About the Role As an Observability Engineer, you’ll play a critical role in ensuring system reliability, performance, and proactive incident management . You’ll work across teams to embed observability into the heart of our solutions, leveraging cutting-edge tooling such as Dynatrace . 💡 Key Responsibilities Translate ...

Senior Software Engineer

Hiring Organisation
Vermillion Analytics
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £80,000 per annum
junior engineers on implementation, design patterns, and engineering practice Identifying and resolving systemic technical issues, not just isolated bugs Improving deployment reliability, monitoring, and observability Communicating trade-offs and risks clearly — to engineers and non-engineers alike Participating in and leading production incident response where needed WHAT THEY'RE LOOKING … experience with Svelte and/or jQuery AWS cloud infrastructure OpenAI APIs or LLM-powered application development Browser or email extension development CRM integrations Observability and operational tooling WHAT GOOD LOOKS LIKE HERE Over time, senior engineers at this company become trusted owners of complex systems, lead technical initiatives involving ...

Principal Engineer - Digital Experience Platform

Hiring Organisation
Jobleads-UK
Location
Skipton, England, United Kingdom
focus of your role is **Value–Flow–Quality (VFQ)**, improve delivery flows, strengthen test automation and embed quality inside CI, and build release‐linked observability that ties every deployment to metrics, logs, traces and golden signals.You will also own the engineering strategy that enables the platform to evolve, to maximise … where lead time drops, deployment frequency rises, and change‐failure rate stays low, using automation‐first pipelines, trunk‐based development, progressive delivery, release‐linked observability and data‐ready environments to make fast, safe flow the norm – with daily/weekly delivery of value the norm.**2. Deep craft in modern ...

Principal ML Platform Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
GPUs and cloud infrastructure. Develop internal tools and abstractions and agentic systems that reduce operational overhead for researchers and engineers. Drive improvements across observability, automation, reliability, and developer experience. Collaborate closely with researchers and product engineers to understand pain points and turn them into robust platform capabilities. Contribute to technical … model serving systems in production. Supporting research or data‐intensive workloads. Working with GPU‐based systems or other performance‐sensitive infrastructure. Experience with observability and debugging in distributed systems. Familiarity with Terraform, Datadog, GitHub Actions, or similar tools. Bonus points for Experience building agentic or LLM‐powered internal tools. Experience ...

Data Architect

Hiring Organisation
Jobleads-UK
Location
Houghton-le-Spring, England, United Kingdom
technologies and tools that enhance Arriva's data and BI capabilitiesStay current with emerging trends including open table formats (Apache Iceberg, Delta Lake), data observability, real-time/streaming architectures, and cloud-native solutionsConduct proofs-of-concept and technical assessments to validate new technologies before adoptionMonitor industry best practices … patterns including ETL/ELT, data lake architectures, and lakehouse approachesUnderstanding of emerging technologies such as open table formats (Apache Iceberg, Delta Lake), data observability tools, and real-time/streaming architecturesExperience with data governance frameworks, data quality tools, metadata management, and ensuring compliance with security and regulatory requirementsPersonal AttributesStrong ...

Principal 5G Network Core Architect

Hiring Organisation
Jobleads-UK
Location
United Kingdom
security, key management, and trust boundaries Leading the cloud-native design and deployment of 5GC, OAM, and supporting control components (Kubernetes, CNFs/VNFs, observability, resilience) Defining and maintaining OAM data models and workflows aligned with standard management frameworks (O-RAN SMO/O1/O2, NETCONF/YANG) Contributing … security (3GPP security, IPsec/TLS, PKI, RBAC, logging/audit)Experience with cloud-native telco platforms (Kubernetes, CNFs/VNFs, Helm/Operators, observability stacks) Hands-on lab experience integrating 5GC + gNB + OAM using COTS components and standard interfaces (NG, F1/E1, O1/O2, NETCONF ...

Staff AI Machine Learning Engineer

Hiring Organisation
Medeloop
Location
Richmond, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
decommissioning agents dynamically for complex healthcare workflows). Develop rigorous evaluation and safety frameworks - automated testing, benchmarking, regression testing, adversarial testing, safety guardrails, observability (tracing, logging, metrics), and human-in-the-loop mechanisms to ensure reliable, compliant performance in production. Drive LLM and ML model development - train, fine-tune … tools: LangChain/LangGraph, Model Context Protocol (MCP), Agent-to-Agent (A2A) protocols, Hugging Face, PyTorch, vector databases/semantic search, prompt engineering, and observability platforms (e.g., LangSmith, Phoenix). Experience designing fully automated evaluation and testing pipelines for autonomous agents and their orchestration, including metrics for reliability, safety, factuality ...