776 to 800 of 1,265 Observability Jobs

Senior DataOps Engineer – Databricks Platform

Hiring Organisation
Unisys
Location
City of London, London, United Kingdom
promotion strategies across development, testing, and production Reduce manual deployment activities and operational risk Integrate source control and modern software engineering practices Platform Reliability & Observability Develop monitoring, alerting, and operational dashboards Improve platform resilience, stability, and recoverability Design solutions for failure handling, rollback, and operational recovery Support platform performance optimisation … infrastructure automation Proven experience building CI/CD frameworks for complex cloud platforms Strong Python skills for automation and tooling Experience implementing monitoring, observability, and operational support capabilities Solid understanding of cloud security, access control, and governance principles Strong software engineering fundamentals and automation mindset Nice-to-have: Enterprise-scale ...

Site Reliability Engineer

Hiring Organisation
Oliver Bernard
Location
United Kingdom
hire a mid-level Site Reliability Engineer into a newly created role. This is a true SRE position with a strong focus on observability, incident management and production operations, working closely alongside development and platform teams to improve reliability and performance across a high-scale cloud environment. For this opportunity … with Terraform (building modules, not just consuming templates) CI/CD work with GitHub and/or GitLab Strong history of Monitoring and Observability (with Prometheus and Datadog) Solid understanding of incident management and response Experience operating within high-scale production environments The business is heavily investing ...

Principal Site Reliability Engineering Expert Director

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
shaping how reliability, automation, and operational excellence are engineered across the organisation. Operating across domains including traditional infrastructure, cloud engineering, network operations, identity, observability, security, AI-driven operations, and automated data workflows, the role focuses on designing scalable systems, reusable engineering patterns, and standardised controls that reduce operational toil, improve … first, measurable, and repeatable practices. A key part of the role is building and evolving reusable CI/CD and Terraform modules, engineering guardrails, observability patterns, and automation frameworks that can be adopted across multiple teams and domains without requiring each team to solve the same problems independently. The Principal ...

Site Reliability Engineer (SRE)

Hiring Organisation
Pertemps Reading
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£45,000
platform automation, CI/CD, and developer tooling. This is a hands-on role split between supporting engineers and building scalable infrastructure, automation, and observability solutions. Youll work closely with the Head of Technology and engineering teams to improve reliability, developer experience, and platform performance. What Youll Be Doing Developer … Build reusable Terraform modules and manage infrastructure-as-code standards Develop internal tooling, automation scripts, self-service tooling, and platform improvements Own and improve observability across monitoring, dashboards, alerting, and runbooks Identify opportunities to automate manual processes and improve platform reliability Contribute to scalable, maintainable, and secure infrastructure practices What ...

AWS Platform Architect

Hiring Organisation
Oscar Associates (UK) Limited
Location
Birmingham, West Midlands, United Kingdom
Employment Type
Permanent
platform architecture and modernisation roadmap, including migration from a Java monolith to microservices on EKS. Define standards for containers, runtime environments, observability, tenancy, security, and infrastructure automation. Lead SRE practices including SLI/SLOs, incident management, DR/BCP planning, post-mortems, and operational resilience. Own platform security, secure SDLC … networking, KMS, RDS, and multi-account architecture. Hands-on Kubernetes, CI/CD, Terraform, and cloud security experience. Strong understanding of SRE, observability, incident response, and disaster recovery. Experience operating within regulated environments such as ISO 27001, SOC 2, or GxP. Comfortable balancing strategic leadership with hands-on operational delivery. ...

Forward Deployed Engineers

Hiring Organisation
Randstad Digital
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £500 per day + Inside IR35
implement AI agents, including: Retrieval (RAG) Orchestration workflows Tool/function invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability Implement observability and monitoring for agent lifecycle AI Platform Integration Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) Build abstraction layers to support … production (agents, RAG, orchestration) Proficiency in Python, Java, or similar backend languages Experience with: CI/CD pipelines Infrastructure as code Monitoring and observability tools Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) Experience designing multi ...

Senior DevOps Engineer JBLE1 NI

Hiring Organisation
MCS Group
Location
Belfast, UK
Driving the migration of legacy systems into cloud-native architectures Developing and maintaining infrastructure-as-code using Terraform or comparable tooling Improving system reliability, observability, and cost-efficiency across environments Modernising CI/CD practices to enable safe, rapid delivery across multiple environments Supporting incident response and improving operational visibility … Strong scripting skills in Bash, Python, Go, or similar Useful but not essential: Experience with Ansible, Cloudflare, PostgreSQL, or MySQL in production Familiarity with observability platforms such as Datadog, Splunk, or ELK Exposure to multi-tenant SaaS architectures Experience leading or contributing to cloud migration projects The details: Location: Belfast ...

Cloud Engineer

Hiring Organisation
Spectrum It Recruitment Limited
Location
Southampton, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
Terraform to automate and standardise infrastructure delivery. You'll support the migration and modernisation of traditional infrastructure into cloud services. You'll improve monitoring, observability, security and resilience across cloud platforms. You'll work with engineering, infrastructure and business teams to turn requirements into practical cloud solutions. You'll contribute … teams and wider stakeholders Useful: Cloud migration experience Azure DevOps and YAML pipelines PowerShell, Python or Bash scripting Docker or containerised environments Monitoring and observability tooling Experience in regulated or customer-critical environments Why apply? This is a good opportunity for a Cloud Engineer who wants to work on meaningful ...

Cloud Native DevOps Engineer

Hiring Organisation
Anson McCade
Location
England, United Kingdom
scalability are all core design considerations. The brief will suit someone comfortable operating across infrastructure engineering, platform automation, CI/CD, container platforms, and observability, while working closely with technical and non-technical stakeholders in Agile delivery settings. Employer Overview The employer is a major global technology and transformation organisation … Implement and optimise CI/CD pipelines to support secure, reliable, continuous delivery for critical applications - Monitor system health, performance, and security using modern observability and logging tooling - Work in Agile delivery teams, engaging stakeholders to translate requirements into iterative platform and infrastructure improvements Candidate Profile/Technical Skillset - Proven ...

Lead Engineer Jobs in UK 2026 | Quick Hiring

Hiring Organisation
Jobleads-UK
Location
Swansea, Wales, United Kingdom
with architects, product teams, and government stakeholders. You will also lead technical roadmap planning and prioritize engineering initiatives across teams. Furthermore, you will drive observability practices including logging monitoring and system performance analysis. About Scrumconnect Consulting Scrumconnect Consulting delivers digital transformation services for UK public sector organisations. The company supports … secure design principles. Support modernisation of legacy systems using incremental migration strategies approaches. Participate in code reviews and ensure high quality engineering output. Drive observability practices including logging monitoring and system performance analysis. Job Requirements Bachelor or Master degree in Computer Science or related field required. Strong experience leading engineering ...

DevOps Engineer

Hiring Organisation
Coltech
Location
City of London, London, United Kingdom
Drive automation and DevOps best practices across networking services. Participate in troubleshooting and resolving complex networking and infrastructure issues. Contribute to platform reliability, security, observability, and operational excellence. Support continuous improvement initiatives and cloud platform modernisation programmes. Participate in an on-call support rota where required. Required Skills & Experience Strong … Python and/or Bash scripting for infrastructure automation. Familiarity with IPAM, Infoblox, proxies, identity platforms, and network security controls. Experience with monitoring and observability platforms such as Dynatrace and Google Cloud Monitoring. GCP Professional Cloud Network Engineer or related certifications. Candidate Profile The ideal candidate is a proactive ...

Senior Platform Engineer

Hiring Organisation
AJ Bell
Location
Salford, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
evolving our core engineering platforms, including: Backstage and internal developer portal capabilities Engineering data platforms, including ELT workflows, DBT and SQL-based data pipelines Observability and monitoring Grafana platforms Internal automation and workflow platforms that support software delivery and engineering operations You’ll also contribute to broader platform engineering initiatives … Strong understanding of cloud platforms, containerisation and infrastructure as code Experience building self-service tooling, templates and developer enablement capabilities Experience with monitoring and observability Good understanding of security best practices in software delivery and platform design Strong problem-solving, communication and collaboration skills Ability to provide technical leadership, mentor ...

Senior Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From … having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability & quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . ...

Senior Software Engineer

Hiring Organisation
Cue
Location
United Kingdom
architectures in production Strong Linux, networking, and infrastructure fundamentals (SRE/platform background ideal) Hands-on experience with containers, Kubernetes, CI/CD, and observability Fluency reading and writing code — you don't need to be the fastest coder, but you need to understand code deeply Active … InfluxDB Networking : Traefik ingress with NLBs, ExternalDNS, cert-manager, Route53, CloudFront, VPC peering Security : GuardDuty, Security Hub, Elasticsearch SIEM, CrowdSec WAF, Firezone (Zero Trust) Observability : Grafana, CloudWatch, Fluent Bit, Goldilocks CI/CD : ArgoCD, GitHub Actions (OIDC), Atlantis What we value over credentials We don't care about your degree ...

Senior Software Engineer II - Data Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
ensure technical consistency.* Design, develop, and maintain generative AI services and reusable components using Python.* Define and promote best practices in engineering, including scalability, observability, testing, and CI/CD.* Contribute to system designs spanning multiple services and modules, aligning with architectural best practices.* Collaborate with product, platform, and research … work collaboratively across functions in an Agile or Kanban environment.**Nice to have:*** Experience operationalizing LLMs or building an internal AI platform.* Familiarity with observability practices (metrics, logging, alerts).* Exposure to knowledge graphs or semantic search systems.Join our team and contribute to a culture of innovation, collaboration, and excellence. ...

System Monitoring & Observability Engineer (Prometheus / Grafana)

Hiring Organisation
SRT Marine Systems PLC
Location
Cardiff, South Glamorgan, United Kingdom
Employment Type
Permanent
Salary
£40000 - £65000/annum
work, where talented, hard-working individuals have the opportunity to make a real impact across the marine industry. Role overview of our System Monitoring & Observability Engineer (Prometheus/Grafana) You as a System Monitoring & Observability Engineer (Prometheus/Grafana) here at SRT, you will be part of a small team … tasked with implementing an end-user observability visualisation. Currently, we have observability dashboards in place for our engineers, utilising Prometheus for metrics collection and Grafana for visualisation. This initiative aims to deliver a more user-friendly solution tailored for our end-users. Our clients are located across various countries worldwide ...

Platform Engineer

Hiring Organisation
hireful
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £85000/annum £80,000 - £85,000 + 10% Bonus + Bene
We are recruiting founding Platform Engineers on behalf of a fast-growing enterprise level (global, 500+ staff) software business with a strong engineering culture and a genuine commitment to doing things the right way. They ...

Senior Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Job Title Senior Software Engineer Responsibilities Directly contribute to development and continuous improvement of our products and platforms, focusing on adaptable and resilient solutions. Design new features or refine existing systems, ensuring they are robust ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, UK
Employment Type
Full-time
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, ... LFWQ1_UKTJ ...

Performance and Monitoring Engineer

Hiring Organisation
17918
Location
London, United Kingdom
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, ... WKCL1_UKTJ ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, click apply for full ...

EMEA Regional Sales Director — AI SaaS Growth Leader

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
leading AI observability platform is seeking a Sales Director for the EMEA region. You will lead and scale a high-performing sales team to drive revenue and new customer acquisition. The ideal candidate has proven sales leadership in high-growth SaaS or AI companies, with a track record of meeting ...

SRE - Contract

Hiring Organisation
PIXELCODE TECHNOLOGIES LIMITED
Location
East End, Hampshire, UK
Employment Type
Full-time
Consultant on a contract basis(INSIDE IR35) with strong expertise in Dynatrace implementation. The ideal candidate should have hands-on experience designing and deploying observability solutions across complex enterprise environments, with deep expertise in Dynatrace xxuwjjq architecture, integrations, alerting, dashboarding, and troubleshooting distributed s... Remember to check your CV before ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Westminster, Greater London, UK
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity xkybehq to make a real impact across a large, Increase your chances ...

Linux Support Engineer - Nvidia/GPU workload experience essential

Hiring Organisation
Swisstech Recruitment
Location
United Kingdom
Employment Type
Contract
Contract Rate
GBP 350 - 500 Daily
Data Centre, Infrastructure Support, RMAs, Platform upgrades etc.) - Provide technical expertise and contribute to the build out and configuration of our internal observability platform. - Create and improve documentation around key operational activities. - Identify and drive improvements in performance, stability, and security. The successful candidate must have experience running Nvidia/ ...