Observability Jobs in the UK

551 to 575 of 708 Observability Jobs in the UK

Consultant/Sr. Consultant - Apigee Engineer/API Platform Specialist (USAPI)

Reading, Berkshire, United Kingdom
Blue Altair
and maintain shared flows and reusable proxy patterns for authentication, logging, error handling, and traffic control. Monitor and troubleshoot platform issues using Kubernetes tools (kubectl, helm) and integrate with observability platforms (e.g., Cloud Ops, Prometheus, ELK). Collaborate with backend teams, security teams, and infrastructure teams to ensure seamless API adoption and runtime stability. Provide technical leadership, documentation, and mentorship … design, high availability, and TLS configuration. Familiarity with Cassandra (for Apigee Hybrid runtime), cluster scaling strategies , and troubleshooting synchronization or runtime issues . Working knowledge of log management and observability solutions (e.g., Fluent Bit, Splunk, ELK, or Google Cloud Logging). Exposure to multi-region or hybrid cloud deployments , and knowledge of best practices around IP address planning and firewalling More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff and Team Lead, Onyx Application Engineering

London, United Kingdom
Hybrid / WFH Options
GlaxoSmithKline
Staff and Team Lead, Onyx Application Engineering The Onyx Research Data Tech organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step-change in our ability to leverage data, knowledge, and prediction to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Azure Cloud Architect

London, United Kingdom
Hybrid / WFH Options
Experis - ManpowerGroup
Role Title: Azure Cloud Architect Location: London Hybrid - 2-3 days onsite, Canary Wharf Start Date: 08/09/2025 End Date: 27/03/2026 Rate: £650 per day - PAYE via Umbrella Only Role Overview: We are More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Observability Site Reliability Engineer

London, United Kingdom
Apple Inc
capacity planning and experienced in managing and scaling distributed systems in a public, private, or hybrid cloud environment Experience with the Prometheus ecosystem and a good understanding of infrastructure observability principles Education & Experience BS/MS in Computer Science or Equivalent ( + in depth experience of software development or production operations experience in a large-scale environment More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Systems Test Engineer

Bristol, Gloucestershire, United Kingdom
Cerebras
area of the product component or the system in aggregate and at scale. Specific domains include Workload Management (Kubernetes, Ray, and so on); Cloud Development (Cloud Infrastructure Automation); Management & Observability (open source and commercial monitoring, observability and DCIM solutions) Skills and Experience Essential Strong relevant programming experience Python/Go/C infrastructure-as-code scripting or related to the … of the products under test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Frontend Developer

Manchester, United Kingdom
The Portfolio Group
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
Employment Type: Permanent
Salary: £45000 - £50000/annum
Posted:

Frontend Developer

Manchester, Lancashire, England, United Kingdom
The Portfolio Group
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
Employment Type: Full-Time
Salary: £45,000 - £50,000 per annum
Posted:

Dynatrace Engineer - SC Cleared

London, United Kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
are looking for an experienced Dynatrace Engineer to join an established IT Infrastructure team on a 6-month contract. This is a high-impact role focused on building scalable observability solutions using Dynatrace across multi-cloud environments (AWS & Azure), with a strong emphasis on automation and Infrastructure-as-Code using Terraform. You'll work cross-functionally with engineering, platform, and … Warrington, with some flexibility on hybrid working arrangements. Occasional on-call and out-of-hours work may be required, depending on project needs. Key Responsibilities: - Design, implement, and manage observability solutions using Dynatrace's full stack (RUM, Full-Stack Monitoring, Synthetic Monitoring, Log Management). - Use Terraform to build reusable infrastructure modules, avoiding manual configuration. - Translate business unit needs into … scalable and automated observability solutions. - Integrate Dynatrace with core AWS and Azure services, ensuring seamless monitoring across platforms. - Create self-service observability patterns, documentation, and tools to empower teams across the business. - Work with development teams to instrument apps for metrics, logs, and traces. - Provide technical leadership on observability best practices and drive platform-wide improvements. - Participate in agile workflows More ❯
Employment Type: Contract, Work From Home
Rate: £350 - £425 per day
Posted:

Dynatrace Consultant

Cambridge, Cambridgeshire, United Kingdom
WNTD
Cambridge (onsite travel required) Job Type: 12-Month Contract (Inside IR35) Experience Level: Mid to Senior Level Role Overview We are seeking an experienced Dynatrace Consultant to join our Observability Team on a 12-month engagement. This role is critical in driving the adoption and integration of Dynatrace across a complex enterprise environment. You will work closely with platform teams … application owners, and DevOps engineers to enable full observability, implement best practices, and ensure successful platform rollout as part of our new Center of Excellence initiative. Key Responsibilities Provide technical consulting and enablement to internal engineering teams for effective use of Dynatrace. Build dashboards, alerts, and service flow mappings aligned with application performance needs. Develop and optimize Dynatrace Query Language … DQL) queries for actionable insights. Support observability design and migration from tools such as Prometheus, Grafana, and AWS CloudWatch to Dynatrace. Advise on RBAC models, data access strategies , and security best practices for multi-team environments. Design monitoring strategies for Kubernetes workloads in hybrid cloud/on-prem environments. Promote observability-as-code using tools like Terraform and GitLab for More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Global Head of Technical Account Management (TAM)

London, United Kingdom
Coralogix, inc
success across all regions. Partner closely with R&D, Customer Success, Product, Sales, and Support to drive holistic customer outcomes. Hands-On Technical Expertise Maintain hands-on fluency in observability tooling, logging infrastructure, and cloud environments. Act as a senior technical escalation point for complex deployments or architectural challenges. Provide in-depth technical guidance on customer environments, use cases, and … performance analytics. Collaborate on the development of tools and dashboards to ensure visibility and impact tracking. Requirements Technical Experience 10+ years of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics … team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we'd love to hear from you. Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Customer Support Engineer - EMEA

London, United Kingdom
Chronosphere
Chronosphere Chronosphere is the observability platform built for control in the modern, containerized world. Chronosphere empowers customers to focus on the data and insights that matter by reducing data complexity, optimizing costs, and remediating issues faster. The observability platform reduces data volumes and associated costs by 60% on average while saving developers thousands of hours. Chronosphere's Fluent Bit-based … Telemetry Pipeline optimizes and simplifies observability and security log data. The product transforms logs at the source and routes them to any destination without lock-in. Recognized as a leader by major analyst firms, Chronosphere is trusted by the world's most innovative brands, including Snap, Robinhood, DoorDash, and Zillow. Learn more at Chronosphere.io . Follow at LinkedIn and X … years of experience in an engineering or support related role. Experience working directly with customers in a technical setting (preferably SaaS). Customer support experience ideally in the monitoring, observability, or data pipeline space. Experience with Kubernetes, Terraform, and significant consideration if you also have Prometheus experience. Technical understanding and experience with: Coding/SDLC, Linux, Cloud providers (AWS, GCP More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Junior Software Engineer

Basingstoke, Hampshire, United Kingdom
InfoSum
ll dig into logs, traces and code to explain behaviour, patch bugs or raise backlog stories when deeper product work is needed. Often these investigations will result in improving observability or stability of the platform. High-impact feature work. Between investigations we deliver focused enhancements and platform improvements that don't slot neatly into long-term road-maps. Because our … team's workload is unpredictable, delivery dates are flexible and scoped by the team. Platform observability & performance. Your team members continually raise the bar on monitoring, metrics and efficiency. Joining as our newest engineer, you'll pair with seasoned Go/TypeScript/Python devs, owning real tasks from week one. Expect a dynamic mix of bug hunting, green field … and basic cloud/Linux fundamentals. Curiosity and the confidence to ask questions in a fast-moving team. Nice-to-haves Exposure to Kubernetes, Docker or Terraform. Experience with observability stacks (Grafana, Prometheus, OpenTelemetry). Familiarity with Postgres. Interest in data-privacy, AdTech/MarTech or large-scale data processing. Familiarity with Kafka, gRPC or Apache Spark. As well as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Director of Product Engineering

Oxford, England, United Kingdom
Hlx Life Sciences
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
Posted:

ITS Application Services (ENET) Engineer

London, United Kingdom
Hybrid / WFH Options
MARGO
and reliability across FX and Fixed Income platforms. Location: Hybrid, Central London Your mission: Deliver and maintain a world-class, low-latency trading environment - managing messaging middleware, network connectivity, observability tooling, and capacity planning to meet front-office demands. What you'll do: Connectivity & Monitoring Deploy and support global market connectivity (Solace/Tibco, multicast, TCP/IP) across 30+ … 29West, Tibco, LBM) in performance-sensitive environmentsHands-on packet analysis using Wireshark , tcpdump , Corvil (custom decoder skills a bonus)Scripting/automation with Python , Bash , or PowerShell Familiarity with observability platforms (ITRS Geneos, Prometheus, Grafana)Experience supporting real-time trading applications , feed handlers, matching enginesExcellent communication-able to translate technical metrics into business insights Nice to have: Exposure to FIX More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Leeds, Yorkshire, United Kingdom
Hybrid / WFH Options
William Hill PLC
you. The Leeds-based, highly skilled SRE team are primarily managing the Kubernetes clusters within the organisation for multiple departments, and through a DevOps culture enabling those departments with observability and pipelines for their business applications. Their job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end … to be. Please note the interviews for this role will be face-to-face in our central Leeds office. What you will be doing: Ensuring Reliability - Best in class Observability and Security, applying the Four Golden Signals, with appropriate Testing and Disaster Recovery Plans Improving Productivity - Automate rapid delivery through software delivery pipelines using Infrastructure as Code Maintaining and Developing … people who can support our ethos. To apply to this post, you will have: A base in Leeds with working experience of an incident response model and fluency with observability and monitoring (Prometheus, Grafana) Experience defining alerts and implementing dashboards from existing monitoring and logging data Relentless focus on customer experience with good understanding of security best practice Fluency in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Director of Product Engineering

banbury, south east england, united kingdom
Hlx Life Sciences
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
Posted:

Senior Platform & Backend Engineer

Islington, London, United Kingdom
Elder HQ
with event sourcing. All of our systems are on Kubernetes and using the Google Cloud Platform. This role comes with the opportunity to take ownership of our GCP infrastructure, observability, and platform reliability, with a focus on ensuring our systems remain secure, scalable, and well maintained. We encourage collaboration and our engineers are involved in the full development lifecycle, from … engineers in building new APIs and data contracts to support new functionality Maintaining and evolving our cloud infrastructure (GCP, Kubernetes) to ensure high availability, security, and performance Managing service observability and reliability, including logging, metrics and alerting (we use Prometheus and Grafana) Handling database and service upgrades (e.g. MySQL, Kubernetes), secrets management and security best practices Taking ownership of platform … Solid understanding of security best practices across infrastructure and applications, including secrets management and credential rotation. Familiarity with infrastructure-as-code or automation tools is a plus Experience with observability tools (such as Prometheus and Grafana), service monitoring, and debugging in production environments A demonstrated interest in staying up-to-date with new technology, new frameworks, new languages and other More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
Omnea
with React & Material UI, Postgres, Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From product, design and architectural decisions … ideally AWS). You focus on having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability and quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . You understand what it … takes to build highly reliable & well architected products. You build with quality, observability & redundancy at the forefront. You're ready to get a lot done. You enjoy all aspects of building a product and are comfortable moving across the stack when necessary. You enjoy problem solving and thinking from first principals You're ready to pick up new skills and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

London, United Kingdom
Hybrid / WFH Options
Tilt
you. You're excited about this opportunity because you will: ️ 0-3 months Drastically improve CI/CD pipeline speed to help us ship faster Level up monitoring and observability so we catch issues before users do Help transition from a legacy REST API to GraphQL infrastructure 3+ months Drive AWS cost optimisation and scaling Improve Docker builds Collaborate with … Terraform (or similar tools) in production environments to manage infrastructure as code Confident working with Docker and containerised environments Knowledgeable in how to set up and tune monitoring and observability tools, and care about catching issues before users do Understanding code well (especially TypeScript, JavaScript, or Python) and able to work closely with engineers, even if not writing production code More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Oracle-Site Reliability Engineer

Knutsford, Cheshire, United Kingdom
Barclays
as an Oracle Site Reliability Engineer to help us build and maintain resilient, high-performing systems in a fast-paced financial services environment. If you're passionate about automation, observability, and continuous improvement, we'd love to hear from you. To be successful as a Oracle Site Reliability Engineer, you should have experience with: Significant experience in Site Reliability Engineering … tools that support system setup and automation, such as Ansible, Puppet, or Chef. Experience designing and maintaining CI/CD pipelines to support seamless deployments. Knowledge of monitoring and observability tools such as Prometheus, Grafana, and the ELK stack. You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Oracle-Site Reliability Engineer

Great Houghton, Northamptonshire, United Kingdom
Barclays Bank PLC
as an Oracle Site Reliability Engineer to help us build and maintain resilient, high-performing systems in a fast-paced financial services environment. If you're passionate about automation, observability, and continuous improvement, we'd love to hear from you. To be successful as a Oracle Site Reliability Engineer, you should have experience with: Significant experience in Site Reliability Engineering … tools that support system setup and automation, such as Ansible, Puppet, or Chef. Experience designing and maintaining CI/CD pipelines to support seamless deployments. Knowledge of monitoring and observability tools such as Prometheus, Grafana, and the ELK stack. You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

United Kingdom
Arbor Education
we provide world-class resilience and performance across the platform. The remit and focus of the role is to advise on all aspects of site reliability including availability, scalability, observability and capacity planning. It's a broad and exciting role, so we're looking for someone up for a challenge - if you're an energetic and a collaborative Site Reliability … Core responsibilities Proactively monitor and analyse platform performance. Collaborate with engineering teams to address performance bottlenecks and ensure scalability. Assist engineering teams with implementing and reviewing SLOs Continually improve observability through monitoring and alerting, and dashboards, using tools such as DataDog or Prometheus for example. Work with other teams to ensure it is effective and provides full coverage. Ensure the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer (UK)

Edinburgh, United Kingdom
Devopshunt
other teams' working lives through automation of manual tasks Lead in making the deployment of Dayshape more scalable Increase our knowledge sharing of SRE across the organisation Improve the observability of Dayshape through reporting and tool creation Drive improvements to our Cost of Giving Service (COGS) Diagnose and triage escalated issues Support, mentor, and coach other members of the team … CI/CD etc. Writing and debugging Python code One or more IaC toolset proficiency e.g. Pulumi or Terraform. Designed and built infrastructure using Azure which takes into consideration: observability, alerting, uptime SLA's and SLO's and Azure DevOps pipelines. Be able to collaborate well with both engineering teams and colleagues in customer-facing teams. Be an excellent communicator More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Lead Data Engineer

City of London, London, United Kingdom
HCLTech
HCLTech is a global technology company, home to more than 220,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with More ❯
Posted:

AWS Lead Data Engineer

London Area, United Kingdom
HCLTech
HCLTech is a global technology company, home to more than 220,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with More ❯
Posted:
Observability
10th Percentile
£59,250
25th Percentile
£68,750
Median
£80,000
75th Percentile
£99,750
90th Percentile
£135,000