Observability Jobs in England

376 to 400 of 615 Observability Jobs in England

Site Reliability Engineer

Bristol, Gloucestershire, United Kingdom
Hybrid / WFH Options
TwinStream
services. You will be working with multiple feature development teams and the BAU/Support team to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks. Key Responsibilities of the Site Reliability Engineer: Collaborate with Software Engineers to improve reliability and performance in their … subsystems Partner with System Administrators in automating toil and eliminating alerts Evolve observability and monitoring capabilities to identify and solve problems before they impact the business Support development environments to help us achieve our delivery and quality goals Research and evaluate technologies, tools and services to influence buy-vs-build decisions Develop expertise in diverse technical and business domains Expand … in one of our platform languages (Java, Go, Python or similar) Knowledge of cross domain principles & technologies Experience of working in a service management environment Practical applications of using observability patterns in previous systems Creating and monitoring system availability metrics and using those to drive work that reduces downtime There are many great reasons to join our team! Pension Plan More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS DevOps Engineer

City of London, London, United Kingdom
Scrumconnect Consulting
Cloud Infrastructure: AWS (EKS, RDS, Aurora, ElastiCache, Kafka, IAM) Secure Hosting: Experience working with air-gapped or government-secure environments Container & Cluster Management: Docker, Kubernetes, Rancher, Jenkins, Helm Monitoring & Observability: Prometheus, Grafana, ELK Stack, Dynatrace Secrets & Identity Management: HashiCorp Vault, Keycloak CI/CD & DevOps Tooling: Jenkins, Git, ServiceNow, Trivy, Terraform Streaming & Messaging: Apache Kafka (including Kafka Replication) Data Layers … tooling and self-service developer pipelines for tenant teams. Proactively manage and resolve tech debt by working with central governance bodies and ensure visibility to the board. Increase automation, observability, and testing coverage across the platform components while enabling data-driven decision-making. Align delivery with the product roadmap, collaborating with internal/external platform and infrastructure teams to support More ❯
Posted:

AWS DevOps Engineer

London Area, United Kingdom
Scrumconnect Consulting
Cloud Infrastructure: AWS (EKS, RDS, Aurora, ElastiCache, Kafka, IAM) Secure Hosting: Experience working with air-gapped or government-secure environments Container & Cluster Management: Docker, Kubernetes, Rancher, Jenkins, Helm Monitoring & Observability: Prometheus, Grafana, ELK Stack, Dynatrace Secrets & Identity Management: HashiCorp Vault, Keycloak CI/CD & DevOps Tooling: Jenkins, Git, ServiceNow, Trivy, Terraform Streaming & Messaging: Apache Kafka (including Kafka Replication) Data Layers … tooling and self-service developer pipelines for tenant teams. Proactively manage and resolve tech debt by working with central governance bodies and ensure visibility to the board. Increase automation, observability, and testing coverage across the platform components while enabling data-driven decision-making. Align delivery with the product roadmap, collaborating with internal/external platform and infrastructure teams to support More ❯
Posted:

Lead DevOps Engineer

London, United Kingdom
Hybrid / WFH Options
Sprout.ai
Salary banding: £90,000 - £110,000 dependent on experience Working pattern: 1-2 days per week in office Location: London About our Engineering Team As a business which has AI at its core, we need to have a reliable, scalable More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Technical Programme Manager

Cambridge, Cambridgeshire, East Anglia, United Kingdom
Hybrid / WFH Options
La Fosse
infrastructure platform with AI-operable capabilities Oversee key infrastructure components such as data centre expansion, programmable compute, and software-defined network/storage Enable automation-first delivery models with observability, self-healing, and policy-driven control Implement and mature GitOps workflows, IaC pipelines, and CI/CD processes across engineering teams Lead programme governance, risk management, and stakeholder engagement Partner More ❯
Employment Type: Contract
Rate: £750 - 950 per day
Posted:

Senior Front End Developer

Manchester, Lancashire, England, United Kingdom
Searchability
skills, testing experience (React Testing Library), and familiarity with Tailwind CSS are essential. Nice-to-haves include Storybook component library work, Cloudflare Workers, E2E testing (Cypress or Playwright), and observability practices. Benefits Salary up to £65,000 25 days holiday + bank holidays + birthday off Friendly, forward-thinking team culture Agile workflows with real influence over technical direction Regular More ❯
Employment Type: Full-Time
Salary: £60,000 - £65,000 per annum
Posted:

Head of Delivery Enablement

Watford, Hertfordshire, England, United Kingdom
Method Resourcing
function integrated throughout the software development lifecycle. Partnering closely with product and engineering teams, you will help scope and estimate strategic work, align on tooling, and drive improvements in observability, automation, and testing. Ideal Experience & Skills Demonstrated technical leadership across diverse skillsets, including Site Reliability Engineering (SRE), DevOps, and Quality Assurance (QA) Proven track record of aligning and integrating cross More ❯
Employment Type: Full-Time
Salary: £90,000 - £95,000 per annum
Posted:

Head of Technical Services UK&I

England, United Kingdom
NCR Corporation
and market demands. • Vendor Management & Cloud Governance: Engage with external vendors, drive cloud governance initiatives, and make critical build vs. buy decisions to support platform scalability and operational efficiency. • Observability & Automation: Develop and execute a comprehensive observability and automation strategy that aligns with business objectives and enhances platform reliability. • Financial Management: Implement best practices for financial operations and cost governance … build and image deployments. • Hands-on experience with classic hosting technologies (e.g. Kubernetes, AWS) • Familiarity with telephony technologies such as SIP, session border controllers, and related components. • Familiarity with observability tools such as Prometheus, Grafana, and Loki. • Strong Experience in Microsoft technology stack • Proficiency in tools such as GitLab, Docker, Terraform, CI/CD, and various deployment architectures. • Strong understanding More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Product Engineer (Backend), Darwin London £70K - £130K

London, United Kingdom
V7 Labs
strong interest in ML and applying AI to solve problems Infrastructure management experience at scale; you should be comfortable in using AWS and terraform Proficiency in queue management and observability, utilizing RabbitMQ as our primary tool. Experience with orchestration tools like Docker Fluent in English More ❯
Employment Type: Permanent
Salary: GBP 70,000 - 130,000 Annual
Posted:

Head of Engineering (AI)

London, United Kingdom
Fuse Energy, LLC
science, product, and platform teams. What You'll Do Own the AI engineering roadmap and lead the development of AI-first features Productionize ML models, ensuring scalability, performance, and observability Design the infrastructure for deploying and maintaining ML systems in production (e.g., MLOps, CI/CD for ML, model versioning) Build systems that integrate AI into key parts of our More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Engineering (AI) (London)

London, UK
Fuse Energy
data science, product, and platform teams. What Youll Do Own the AI engineering roadmap and lead the development of AI-first features Productionize ML models, ensuring scalability, performance, and observability Design the infrastructure for deploying and maintaining ML systems in production (e.g., MLOps, CI/CD for ML, model versioning) Build systems that integrate AI into key parts of our More ❯
Employment Type: Full-time
Posted:

Product Engineer (Backend) London, UK

London, United Kingdom
Hybrid / WFH Options
Granola inc
of our backend infrastructure Design and implement performant APIs and services Build infrastructure to support cutting-edge AI capabilities Optimise database performance and query efficiency Continuously improve reliability and observability through enhanced monitoring and alerting Collaborate cross-functionally to ensure our infrastructure supports continuous product innovation Your background looks something like: Engineering experience in tech and product-driven environments Strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Data Architect (Governance)

London, United Kingdom
Hybrid / WFH Options
Booksy Inc
best practices It will also help you to have Experience establishing and enforcing data governance standards through technical architecture (not just documentation) Familiarity with data cataloging, metadata management, and observability tools A systems-thinking mindset-you understand the full data lifecycle and how to maintain integrity from source to dashboard At Booksy, we believe in the power of well-structured More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Full Stack Engineer (Editor Core) Engineering London

London, United Kingdom
Veed Limited
and know your way around Node.js backend frameworks You have solid experience designing and maintaining APIs , background workers, or async processing systems You have experience with performance optimization and observability You're comfortable working with infra basics (Docker, GCP, CI/CD) You care about code quality and testing What we offer Monthly subsidy programme: Different people have different needs More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Engineer

London, United Kingdom
Hybrid / WFH Options
nPlan limited
prototype new applications of AI for the construction domain, pushing the boundaries of what's possible Build core infrastructure that allows us to build LLM apps quickly - this includes observability, how we work with several LLM providers + our own fine tuned models Work with ML engineers and data scientists in our research team to bring new models and applications More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer (London)

London, UK
Nory AI
managers and engineers to define KPIs, design data flows, and embed algorithms in product features Apply best-in-class ML practices with clear problem framing, model evaluation, experimentation, and observability- to deliver real-world impact Independently lead the development of models across the platform acting as the primary ML owner and act as the primary ML owner; from data access More ❯
Employment Type: Full-time
Posted:

Revenue Manager

London, United Kingdom
Elasticsearch B.V
results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is The Role: We are looking for a dynamic Revenue Manager who is eager to collaborate and support significant growth at More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Staff Software Engineer Voice

London, United Kingdom
Hybrid / WFH Options
DeepL GmbH
on, contributing production code while guiding architectural decisions and mentoring the team Mentor and elevate: Grow engineering maturity through technical coaching, thoughtful code reviews, and driving best practices in observability, reliability, and scale Shape product direction: Work cross-functionally with product managers, researchers, and designers to translate customer problems into impactful technical solutions Scale voice infrastructure: Build systems that meet More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Manager, Martech

London, United Kingdom
Hybrid / WFH Options
Etsy
in mind through thinking thrift. Bonus points if: You are familiar with any one of the following technologies : Scala, Python Swift ( iOS), React or Typescript. You are familiar with observability, tracking and data pipeline tools and methodologies Additional Information Health + Mental Wellbeing PMI and cash plan healthcare access with Bupa Subsidised counselling and coaching with Self Space Cycle to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Solutions Architect

London, United Kingdom
hyperexponential
skills and experience (ideally Python, and/or Rust, Go, Kotlin, Java, etc) Sound technical knowledge, ideally across multiple technical competencies and levels (e.g APIs, networking, databases, security, compliance, observability, architecture) Excellent communication skills (written, graphical, remote, in-person, presentation, one:one, one:many) with the ability to engage, influence, and inspire stakeholders and colleagues to drive collaboration and alignment More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
NinjaOne, LLC
SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring the quality and availability of our services. Location - We are flexible on remote working from home, if you are based in the UK or Germany. On Call Requirements - Participate … our 24x7 on-call rotation, SCRUM, and deployment planning Perform Root Cause Analysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture by influencing design decisions Create and maintain technical documentation and SOP's Develop software, scripts, or tooling to improve efficiency and reduce … experience in Site Reliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead/Staff Data Platform Engineer

London, United Kingdom
Gorilla
efficiently and securely, using modern tooling such as Terraform, Docker, and cloud-native patterns. Drive cross-team initiatives , helping teams adopt and scale platform capabilities (e.g., CI/CD, observability, workflow orchestration) with a focus on reliability, security, and performance. Collaborate with stakeholders (data engineering, backend, DevOps, product) to identify platform gaps and design scalable, long-term solutions that unlock … with Terraform and infrastructure-as-code practices. Familiarity with tools like Airflow or DBT , and data platforms such as Snowflake or Databricks . Solid experience with CI/CD, observability, and platform reliability practices in cloud-native environments. Understanding of distributed computing concepts , and experience designing systems for scale, security, and availability . A proactive, collaborative mindset and demonstrated ability More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer, Fleet

Fleet, Hampshire, United Kingdom
Hayden AI Technologies, Inc
global transportation agencies. As a senior engineer, you will play a critical role in designing, building, and scaling cloud services that enable remote device management, over-the-air updates, observability, and high-availability operations for our mobile perception platform. We tackle complex challenges related to scalability, performance, and security to enable smarter and safer cities through cutting-edge innovation. As … future of intelligent transportation systems. Responsibilities: Participate in incident prevention, response, and remediation efforts, learning and applying best practices. Design, build, and maintain scalable cloud services that support device observability, OTA updates, and fleet operations. Lead efforts to improve the reliability, security, and performance of multi-region AWS infrastructure using Infrastructure as Code (IaC) tools. Own CI/CD pipelines More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer in Test London

London, United Kingdom
Hybrid / WFH Options
Red Badger Consulting Limited
Red Badger's high-performing engineering teams. In this role, you'll define and drive quality strategy for platform and infrastructure-level products - from container orchestration and microservices to observability tooling and CI/CD pipelines. This is a hands-on engineering position within cross-functional teams where quality is everyone's responsibility but you'll lead the way in … leads to ensure the platform behaves reliably under real-world conditions Be a Technical Leader in Quality Engineering Establish standards and practices for testing distributed, event-driven systems Enable observability-driven debugging by working closely with platform and service teams Automate validation of operational characteristics like availability, latency, throughput and recoverability Contribute to security posture through continuous validation of access … and fault-tolerant design in distributed systems Hands-on experience with infrastructure-as-code (Terraform), Kubernetes, cloud-native platforms (AWS), service meshes, CI/CD (e.g. GitHub Actions) and observability tooling (e.g. OpenTelemetry, Grafana) Strong programming skills in a modern backend language (e.g. Kotlin, Java, Go, or Python) with a test automation mindset Familiarity with resilience patterns, chaos testing, synthetic More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Development Engineer - Site Reliability Engineer

Nottingham, Nottinghamshire, United Kingdom
Hybrid / WFH Options
Capital One (Europe) plc
engineering solutions to make them more efficient, stable, and scalable. You'll lead on planning and implementing key SRE initiatives, optimise and automate how our systems operate, and improve observability through better monitoring and logging. You'll also work closely with your peers to drive consistency and high standards across SRE and the wider engineering community, so a real enthusiasm … vision set out by your Site Reliability Engineering Manager (SREM). Contribute to the major optimisation and improvement themes within the team. Identifying opportunities to reduce operational overheads through observability and service automation. Drive engineering best practice (e.g., Operational Excellence, Security, Quality, Resilience etc.) and set standards across the team and wider SRE community. Innovate within your team and contribute More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
England
10th Percentile
£57,500
25th Percentile
£70,000
Median
£80,000
75th Percentile
£99,500
90th Percentile
£120,000