451 to 475 of 761 Observability Jobs in the UK

Senior Platform Engineer

Hiring Organisation: SF Partners
Location: Nationwide, United Kingdom
Employment Type: Permanent
Salary: £75000 - £110000/annum

Developing Infrastructure as Code using Terraform - Creating CI/CD pipelines and automation - Building Internal Developer Platforms (IDPs) and self-service tooling - Improving observability, reliability and platform performance - Working closely with architects and engineering teams on large-scale transformation programmes To be suitable, you must have experience with most … following: - AWS - Kubernetes - Terraform - Linux - Git - CI/CD - Infrastructure as Code - Platform Engineering or DevOps Desirable but not essential - Experience with GitOps, Observability (Grafana, Prometheus, OpenTelemetry), SRE or DevSecOps. What's on offer? - £75,000 - £110,000 depending on experience and location - Hybrid working (upto 30% onsite, 70% working ...

Senior Site Reliability Engineer

Hiring Organisation: VIQU IT
Location: United Kingdom, Morley, West Yorkshire
Employment Type: Permanent
Salary: £65000 - £75000/annum

improvements across platform reliability, automation and infrastructure as code. Lead the implementation of CI/CD best practices to improve software delivery. Enhance monitoring, observability and incident management across cloud environments. Collaborate with engineering teams to improve performance, resilience and operational efficiency. Mentor and coach engineers, promoting SRE and DevOps … . Experience building and maintaining CI/CD pipelines. Knowledge of containerisation technologies such as Kubernetes, Amazon EKS or ECS. Experience with monitoring and observability tooling such as Grafana, Prometheus, OpenSearch or similar. Strong understanding of cloud security, resilience and infrastructure automation. Previous experience mentoring engineers or providing technical leadership. ...

Principal Support Engineer – Customer Reliability & Escalations Engineering

Hiring Organisation: Jobleads-UK
Location: Ingram, England, United Kingdom

application diagnostics. Partner with Engineering teams to ensure permanent corrective actions are implemented, validated, and communicated. Identify recurring issues and drive improvements in reliability, observability, automation, and operational efficiency. Serve as a trusted technical advisor during executive‐level customer escalations and critical business events. Mentor engineers, influence technical priorities … services, distributed systems, cloud platforms, databases, and microservices architectures. Hands‐on experience with AWS, Azure and/or GCP, Kubernetes, containers, Linux, and modern observability tools such as Datadog, Dynatrace, Splunk, Grafana, New Relic, AppDynamics, or Elastic. Demonstrated ability to conduct complex root cause investigations across multiple technical domains ...

Senior DevOps Engineer

Hiring Organisation: Halian Technology Limited
Location: Reading, Berkshire, South East, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £95,000

reliability, and availability Implement self-service tooling to empower development teams Drive DevOps best practices across the digital product lifecycle Develop and enhance monitoring, observability, and incident response processes Support global engineering teams delivering high-traffic platforms Key Requirements Proven experience supporting digital product delivery in a DevOps or platform … with Infrastructure as Code (Terraform, Ansible, Puppet or similar) Hands-on experience with Kubernetes, Docker, and cloud platforms (AWS preferred) Experience with monitoring/observability tools (Prometheus, Grafana, ELK, APM tools) Solid understanding of system performance, scalability, and resilience Strong collaboration and communication skills within cross-functional product teams Desirable ...

DevOps Engineer

Hiring Organisation: Eligo Recruitment Ltd
Location: Manchester, Stockport, United Kingdom
Employment Type: Permanent
Salary: £70000 - £80000/annum

Building and maintaining Infrastructure as Code using Terraform Automating infrastructure provisioning and deployment pipelines Managing Kubernetes and containerised workloads Implementing monitoring, logging and observability solutions Driving platform reliability, security and best practices Collaborating with engineering teams to improve developer experience Skills & Experience Essential: Strong commercial experience with Google Cloud Platform … container technologies Experience with Linux and scripting (Bash, Python or Go) Understanding of networking, IAM and cloud security principles Experience with monitoring and observability tooling Desirable: Experience with GitOps practices Knowledge of Prometheus, Grafana or similar tools Experience in a platform engineering or SRE environment Certifications in GCP are advantageous ...

Site Reliability Engineer (AWS)

Hiring Organisation: Spectrum It Recruitment Limited
Location: Birmingham, West Midlands, United Kingdom
Employment Type: Permanent, Work From Home

issues and restoring services quickly and effectively Developing automation to reduce manual operational tasks and improve platform resilience Building and improving monitoring, alerting and observability across cloud environments Working alongside Software, Platform, Cloud and Security Engineers to improve reliability and operational excellence Contributing to post-incident reviews and driving continuous … with exposure to: Linux systems administration AWS cloud infrastructure Kubernetes and Docker Production support and incident management Python, Bash or Go scripting Monitoring and observability platforms such as Grafana, Prometheus, Datadog, Splunk or CloudWatch Networking fundamentals including DNS, TCP/IP and load balancing A passion for automation, continuous improvement ...

Engineering Lead

Hiring Organisation: Hays
Location: Cheshire, North West, United Kingdom
Employment Type: Contract, Work From Home
Contract Rate: Up to £500.0 per day + Inside IR35

external systems. Collaborate with architects, product teams, vendors, and business stakeholders to ensure successful solution delivery. Drive best practices across software engineering, DevOps, observability, resiliency, and operational excellence. Conduct architecture reviews, code reviews, and technical design assessments. Provide technical mentoring and hands-on guidance to engineering teams. Contribute directly … while leading multiple engineering teams. Desirable: Experience within Banking, Financial Services, or large-scale enterprise transformation programmes. Experience with Docker and Kubernetes. Knowledge of observability, monitoring, and site reliability practices. Experience supporting geographically distributed engineering teams. Exposure to AI-enabled engineering tools, automation frameworks, or developer productivity tooling. What ...

Principal Software Development Engineer

Hiring Organisation: Jobleads-UK
Location: Manchester, England, United Kingdom

/CD pipelines, infrastructure as code, automation frameworks, and database-as-code practices using Redgate Flyway. Own critical customer systems, ensuring operational resilience, observability, performance optimisation, and rapid incident response. Collaborate with Product, Delivery, Operations, and Commercial teams to shape technical solutions, delivery plans, and strategic outcomes. Promote secure … Connect or Genesys Cloud. Proven ability to design and deliver secure, scalable, and resilient cloud-native solutions within complex enterprise environments. Strong understanding of observability, operational support, reliability engineering, and end-to-end ownership practices. Knowledge of regulated financial services environments, including UK GDPR and FCA Consumer Duty requirements. Excellent ...

Remote Principal Cloud Infrastructure Engineer

Hiring Organisation: Atreides Caseri Inc
Location: Oxford, Oxfordshire, UK
Employment Type: Full-time

load balancing, and network security controls. Build, maintain, and document infrastructure templates and developer enablement tooling to allow teams to deploy independently. Implement observability and monitoring systems using Grafana, Prometheus, and Loki for infrastructure and application metrics. Establish and contribute to CI/CD best practices using GitHub Actions … Docker) and maintaining image registries and artifact repositories. Solid understanding of networking and security fundamentals (VPNs, firewall rules, IAM policies, encryption). Familiarity with observability and alerting stacks (Prometheus, Grafana, Loki). Excellent communicator, able to write and present technical designs and proposals clearly. Experience mentoring others and leading ...

Remote Principal Cloud Infrastructure Engineer

Hiring Organisation: Atreides Caseri Inc
Location: Rochdale, Greater Manchester, UK
Employment Type: Full-time

Remote Principal Cloud Infrastructure Engineer

Hiring Organisation: Atreides Caseri Inc
Location: Hull, East Yorkshire, UK
Employment Type: Full-time

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Inverness, Highland, UK
Employment Type: Full-time

secrets management and infrastructure provisioning. Service Reliability: Define and implement SLIs, SLOs, and SLAs to ensure the stability and performance of critical services. Observability & Incident Management: Oversee the full monitoring stack and establish formal incident response and post-mortem processes. Security & Compliance: Enforce "security by design" and maintain strict adherence … GitOps practices. Cloud Architecture: Proficiency in managing multi-cloud environments (AWS, Hetzner, GCP). Technical Depth: Strong understanding of service mesh and microservices architecture. Observability: Solid experience with observability tools for metrics, logging, and tracing. Security Mindset: Practical experience implementing security best practices within CI/CD. Communication: Ability ...

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Telford, Shropshire, UK
Employment Type: Full-time

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Wrexham, Wales, UK
Employment Type: Full-time

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Craigavon, Co. Armagh, UK
Employment Type: Full-time

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Rotherham, South Yorkshire, UK
Employment Type: Full-time

Remote Head of DevOps

Hiring Organisation: 1inch
Location: Bury, Greater Manchester, UK
Employment Type: Full-time

SRE - Site Reliability Engineer - Observability & Performance

Hiring Organisation: Sanderson Recruitment
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Contract
Contract Rate: £550 - £600 per day

Observability and Performance Up to £600 per day outside IR35 6 month initial contract Bristol - Largely remote I'm currently working with a client who is looking for an SRE to implement and enhance observability across Java applications, middleware and Linux infrastructure using Grafana. The role is focused on monitoring … monitoring, alerting and instrumentation. The environment is currently hosted on traditional infrastructure, with an AWS migration planned, offering the opportunity to develop cloud-ready observability, automation and operational capabilities as the platform evolves. Essential Skills: Strong hands-on experience in DevOps, SRE, Platform Engineering or Systems Engineering environments. Expertise ...

Senior Frontend Engineer (React+TypeScript) - Remote

Hiring Organisation: Jobleads-UK
Location: United Kingdom

Senior Product Engineer, Frontend (React, TypeScript) based in the United Kingdom. You will shape frontend architecture and product experience for an AI-powered observability platform, delivering high-performance interfaces and data visualizations. You will collaborate with product, design, and backend teams, mentor peers, and leverage AI tools to accelerate delivery ...

Senior AI/ML Solutions Architect (GenAI & MLOps)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Field Engineering team to design production‐grade AI solutions on the Databricks platform. You will drive GenAI initiatives, RAG architectures, agentic systems, AI observability, and NLQ of structured data, while mentoring peers and influencing the platform roadmap. Some travel may be required. #J-18808-Ljbffr ...

MLOps Engineering Manager — Lead Scalable ML (Hybrid)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Trainline in London is seeking an experienced MLOps Engineering Manager to build and lead a new team of engineers. You will shape deployment, observability, and scalable machine learning systems across the platform. You will collaborate with ML Engineers, Data Engineers, Software Engineers, Data Scientists, Product Managers and stakeholders to deliver ...

Data Platform Engineering Lead — Snowflake & Azure

Hiring Organisation: Jobleads-UK
Location: Bexhill-on-Sea, England, United Kingdom

delivery across projects and BAU work, while collaborating with CIO teams to drive end‐to‐end outcomes. You will shape data quality, security, and observability, balancing budget and resources with growth goals, and fostering cross‐functional engagement across the business. #J-18808-Ljbffr ...

Lead C++ & Java Engineer for Low-Latency FX

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

translate business requirements into robust technical solutions, while mentoring teams and driving architectural direction. The role emphasizes adherence to SDLC, performance, security, and observability, with opportunities to influence platform standards. #J-18808-Ljbffr ...

Principal Engineer, CoinDesk Data Engineering

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

operational best practices, fostering a culture of technical curiosity and ownership. Champion Cross-Cutting Initiatives: Identify and lead engineering-wide improvements in areas like observability, developer tooling, and testing strategies to increase performance and reliability across all services. What You’ll Bring Principal-Level Experience: 8+ years in backend development … throughput workloads. Full Lifecycle Ownership: A strong "DevSecOps" mindset with expertise in building and maintaining CI/CD pipelines, infrastructure-as-code, and robust observability (monitoring, logging, tracing) for production systems. Quality as a Feature: A deep commitment to quality, demonstrated by implementing comprehensive testing strategies (unit, integration ...

Public Sector AI-SaaS Account Executive

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

education sector. You will own the full sales cycle—from cold outreach to close—driving adoption of Elastic's AI-powered search, observability, and security solutions across new mid-market and health customers. You will craft tailored value narratives that connect Elastic's capabilities to measurable outcomes like revenue growth ...