Observability Jobs in England

201 to 225 of 241 Observability Jobs in England

Operational Resilience Lead

Yorkshire, United Kingdom
Whitehall Resources Ltd
backlog prioritisation, and stakeholder alignment. - Product Ownership: Define tool requirements, maintain product backlog, prioritise features. - Lead integration of the tool with other enterprise practices and tooling (eg, SDLC, OMAR, Observability, Architecture, Service Management). - Coordinate with business and technical stakeholders to align functionality with adoption objectives. All of our opportunities require that applicants are eligible to work in the specified More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Operational Resilience Lead

Sheffield, Yorkshire, United Kingdom
Hybrid / WFH Options
eTeam Workforce Limited
alignment. Responsibilities include: Product Ownership: Define tool requirements, maintain product backlog, prioritise features. Integration Orchestration: Lead integration of the tool with other enterprise practices and tooling (eg, SDLC, OMAR, Observability, Architecture, Service Management). Stakeholder Engagement: Coordinate with business and technical stakeholders to align functionality with adoption objectives. Governance Alignment: Ensure tooling supports reporting requirements and If you are interested More ❯
Employment Type: Contract
Rate: GBP Daily
Posted:

Global Head of Technical Account Management (TAM)

London, United Kingdom
Coralogix, inc
success across all regions. Partner closely with R&D, Customer Success, Product, Sales, and Support to drive holistic customer outcomes. Hands-On Technical Expertise Maintain hands-on fluency in observability tooling, logging infrastructure, and cloud environments. Act as a senior technical escalation point for complex deployments or architectural challenges. Provide in-depth technical guidance on customer environments, use cases, and … performance analytics. Collaborate on the development of tools and dashboards to ensure visibility and impact tracking. Requirements Technical Experience 10+ years of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics … team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we'd love to hear from you. Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Platform Engineer

London, United Kingdom
Hybrid / WFH Options
Fruition Group
Platform Engineer, you'll be creating, iterating, and solving genuine engineering challenges. You'll collaborate closely with product and engineering teams across the stack to improve developer experience, strengthen observability, and troubleshoot complex distributed systems. A focus on clean, maintainable code, cloud infrastructure, and strong security practices is essential. Senior Platform Engineer - Key Skills & Experience: Solid background in software engineering …/CD pipelines and modern deployment practices Familiarity with infrastructure-as-code tools such as Terraform Strong understanding of security best practices in application and infrastructure design Exposure to observability tools (e.g. Prometheus, Grafana, structured logging) Confident debugging and resolving issues in complex distributed systems Product-oriented mindset with a collaborative approach to improving developer experience Bonus: experience with Kafka More ❯
Employment Type: Contract, Work From Home
Posted:

Senior Software Engineer

London, United Kingdom
Hybrid / WFH Options
Orgvue
Overview Orgvue is a leading organizational design and planning software platform that captures the power of data visualization and modelling to build more adaptable, and better performing organizations. HR, finance and business leaders use Orgvue for actionable insight and analysis More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Python Engineer £100k benefits

Manchester, Lancashire, England, United Kingdom
Hybrid / WFH Options
Interquest
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for building cloud-native, event-driven More ❯
Employment Type: Full-Time
Salary: £80,000 - £100,000 per annum
Posted:

Senior Python Engineer (£100k + benefits)

Manchester, North West, United Kingdom
Hybrid / WFH Options
InterQuest Group (UK) Limited
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for building cloud-native, event-driven More ❯
Employment Type: Permanent, Work From Home
Posted:

Principle Engineer Sporting Solutions Sporting Solutions London

London, United Kingdom
Hybrid / WFH Options
Betsson AB
teams. Lead our migration efforts from legacy .NET Framework apps to .NET 8/9, containerised and orchestrated with Kubernetes. Champion best practices in software delivery, CI/CD, observability, and infrastructure-as-code. Drive improvements in telemetry and observability , helping us move from log-centric metrics to first-class telemetry using OpenTelemetry and modern observability stacks. Optimise for performance … Deep expertise in Kubernetes (on-prem and cloud) and Terraform . Experience with real-time, event-driven systems and message brokers (e.g., RabbitMQ, Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Senior Platform Engineer

London, United Kingdom
CACI Limited
at scale, leveraging AWS Organizations, Landing Zones, and multi-account best practices. Develop and maintain Infrastructure as Code solutions using Terraform, CloudFormation, and AWS CDK. Champion security, compliance, and observability by integrating services like AWS Security Hub, GuardDuty, and Inspector. Design CI/CD pipelines to enable seamless deployments and self-service models for customers. Innovate with AWS Networking, KMS … Proficiency in Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? 25 days holiday + bank holidays Up to 5% employer pension More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

ENET Engineer

London, United Kingdom
Whitehall Resources Ltd
and execute the Global Markets connectivity roadmap, from project delivery through to operational handoff and life cycle management. Partner with business stakeholders and platform owners to ensure infrastructure and observability tooling meets evolving trading requirements. Monitor and manage capacity and performance of global connectivity systems, working with regional teams to aggregate local intelligence. Conduct deep-dive post-incident analysis and … and automation using Python, Bash, or PowerShell to streamline monitoring, alerting, and recovery workflows. Knowledge of FIX, market data, and order routing protocols in a trading environment. Exposure to observability platforms such as ITRS Geneos, Prometheus, Grafana, or custom telemetry stacks. Comfortable working across Linux systems, hybrid infrastructure, and global production environments. Excellent communication and reporting skills, with ability to More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Application Services Engineer - Trading - Low Latency

London, United Kingdom
Square One Resources
Own and execute the Global Markets connectivity roadmap, from project delivery through to operational handoff and lifecycle management. Partner with business stakeholders and platform owners to ensure infrastructure and observability tooling meets evolving trading requirements. Monitor and manage capacity and performance of global connectivity systems, working with regional teams to aggregate local intelligence. Conduct deep-dive post-incident analysis and … and automation using Python, Bash, or PowerShell to streamline monitoring, alerting, and recovery workflows. Knowledge of FIX, market data, and order routing protocols in a trading environment. Exposure to observability platforms such as ITRS Geneos, Prometheus, Grafana, or custom telemetry stacks. Comfortable working across Linux systems, hybrid infrastructure, and global production environments. Excellent communication and reporting skills, with ability to More ❯
Employment Type: Contract
Rate: £500 - £550/day
Posted:

Senior Software Engineer in Test

London, United Kingdom
Hybrid / WFH Options
LinuxRecruit
This is a fast-expanding company at the forefront of odds comparison, where innovation converges with excitement. Here you can experience the best of both worlds, working within a close-knit team with autonomy while enjoying substantial financial backing from More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist- Gen AI

London, United Kingdom
Scrumconnect Limited
London, United Kingdom Posted on 12/09/2025 We're hiring a Data Scientist with strong Generative-AI experience to design, build, and ship AI-powered tools end-to-end. You'll work in a small, multi-disciplinary More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer / SRE

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Fruition Group
Software Engineer/SRE JavaScript/TypeScript, Node.js, AWS, Observability Leeds/Hybrid, c. 2x per week Salary up to £65,000 We're looking for a Software Engineer with strong AWS and Observability experience to join a growing engineering team in Leeds. This is a hybrid role, giving you the flexibility to split your time between home and a … improving platform performance and automation, while collaborating with developers, product teams, and operations. What you'll be doing: Building and maintaining scalable cloud infrastructure in AWS Implementing and improving observability tools (monitoring, logging, tracing) Automating deployments and improving CI/CD pipelines Driving reliability, availability and performance across systems Working with developers and SREs to solve complex problems What we … re looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines and automation experience What's on offer: Salary up to More ❯
Employment Type: Permanent, Work From Home
Salary: £65,000
Posted:

Lead Azure Platform Engineer

Potters Bar, Hertfordshire, South East, United Kingdom
Searchstone Ltd
Platform Engineer . This is a hands-on, high-impact role where you will design, build, and operate next-generation cloud platforms, with a strong focus on operational resilience, observability, and Infrastructure as Code (IaC) . This is a player-coach role : you will lead by example, delivering complex Azure solutions while mentoring and developing other engineers. What Youll Do … and operation of Azure platforms , ensuring security, scalability, and reliability. Define and implement strategies for operational resilience , including high availability, disaster recovery, and business continuity. Establish end-to-end observability across workloads (logging, metrics, tracing, alerting) to proactively detect and resolve issues. Champion Infrastructure as Code (IaC) using Terraform, Bicep, or ARM templates for repeatable, reliable deployments. Promote native Azure … templates. Proficiency in Azure DevOps, CI/CD pipelines, and automation frameworks . Solid understanding of cloud security, governance, and compliance . Ability to design for reliability, scalability, and observability . Excellent communication and leadership skills, with a proven ability to influence technical direction. Nice to Have Familiarity with multi-region and hybrid cloud architectures . Knowledge of SRE (Site More ❯
Employment Type: Permanent
Salary: £95,000
Posted:

Staff Site Reliability Engineer / DevOps

London, United Kingdom
Almedia
engineer with hands-on experience in high-traffic production systems Strong in Linux, databases (MySQL, Postgres, MongoDB, Redis), and networking fundamentals Comfortable with Kubernetes, CI/CD pipelines, and observability tools like Datadog A self-starter who thrives in scaling environments and can work independently without PMs Pragmatic, able to balance prevention, maintenance, and firefighting when needed Your mission is … reliability Bring initiatives that make the platform automatically reliable, cost-efficient, and scalable Your impact Collaborate with engineering teams to improve operational workflows and resilience Design smart alerts, improve observability, and drive better performance monitoring Lead incident response, including on-call, and drive improvement with blameless postmortems Build safer delivery methods and improve deployments with Kubernetes and GitLab pipelines Report … reliability leader in the company Your toolkit Linux, networking (TCP/IP), and distributed systems troubleshooting Databases: MySQL, Postgres, MongoDB, Redis Kubernetes, GitLab pipelines, CI/CD best practices Observability tools like Datadog, OpenTelemetry, or ELK stack Nice-to-haves: RabbitMQ, Kafka, Terraform, Ansible, GCP, Datadog What makes this role exciting Be the first senior SRE hire with ownership of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Network Engineer

London, United Kingdom
Experis
Own and execute the Global Markets connectivity roadmap, from project delivery through to operational handoff and lifecycle management. Partner with business stakeholders and platform owners to ensure infrastructure and observability tooling meets evolving trading requirements. Monitor and manage capacity and performance of global connectivity systems, working with regional teams to aggregate local intelligence. Develop and maintain automated alerting, health checks … and automation using Python, Bash, or PowerShell to streamline monitoring, alerting, and recovery workflows. Knowledge of FIX, market data, and order routing protocols in a trading environment. Exposure to observability platforms such as ITRS Geneos, Prometheus, Grafana, or custom telemetry stacks. Comfortable working across Linux systems, hybrid infrastructure, and global production environments. Excellent communication and reporting skills, with ability to More ❯
Employment Type: Contract
Rate: £450 - £506/day
Posted:

Full Stack AI Software Engineer

London, South East, England, United Kingdom
Ada Meher
explanations, citations) clear and accessible. Architecture: Shape a modular, scalable platform on AWS (ECS), separating ingestion, retrieval, reasoning, and delivery. Quality & reliability: Ensure reliability through testing, CI/CD, observability (metrics/tracing for LLM and retrieval paths), and performance optimisation. Collaboration: Partner with product and leadership teams, mentor peers, and play a role in shaping technical direction. Innovation: Explore … to have Experience with rerankers (e.g., cross-encoders), hybrid retrieval (SQL + vectors), query expansion, or lightweight knowledge graphs. Familiarity with LLM evaluation tooling (LangChain, LlamaIndex, OpenAI Evals) and observability for cost, relevance, and latency. Background in B2B data products or fintech. More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Principal AWS Platform Engineer

London, United Kingdom
CACI Limited
at scale, leveraging AWS Organizations, Landing Zones, and multi-account best practices. Develop and maintain Infrastructure as Code solutions using Terraform, CloudFormation, and AWS CDK. Champion security, compliance, and observability by integrating services like AWS Security Hub, GuardDuty, and Inspector. Design CI/CD pipelines to enable seamless deployments and self-service models for customers. Innovate with AWS Networking, KMS … Proficiency in Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Experience leading/managing junior engineers Significant experience with Control Tower and deploying landing zones. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
Hybrid / WFH Options
Circle Internet Services Inc
to deployment and beyond-digging into production issues using tools like Honeycomb, Datadog, Grafana, and Rollbar to ensure system health. Write clear, maintainable, and well-documented Go code, with observability and long-term maintainability built in. Participate in architectural decisions and technical strategy development. Lead complex projects and initiatives from inception to completion. Mentor junior and mid-level engineers and … workflows. Knowledge of using machine learning for test selection, build optimization, or predictive CI/CD insights. Background in frontend development with frameworks like React and TypeScript. Familiarity with observability and performance optimization practices. United Kingdom Base Pay Range We will ensure that individuals with disabilities are provided reasonableaccommodation to participate in the job application or interview process, to performessential More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AppSec Lead

Central London, London, United Kingdom
Hybrid / WFH Options
Halian Technology Limited
A leading fintech company is seeking a Lead AppSec Engineer to join their established team. Youll be instrumental in embedding security into every stage of the software development lifecycleguiding engineers, shaping best practices, and driving secure, scalable solutions across our More ❯
Employment Type: Permanent, Work From Home
Posted:

Staff Engineer - Web Analytics & Infrastructure

London, United Kingdom
Hybrid / WFH Options
Intercom
testing infrastructure , making it easy for teams to run experiments, track results, and iterate quickly. Partner with full stack engineers to integrate backend services into frontend experiences, ensuring reliability, observability, and performance. Improve our infrastructure using Terraform, AWS, and CI/CD pipelines , focusing on developer productivity, testability, and system health. Contribute to strategic technical direction for Team Web - identifying … opportunities to improve code quality, reduce tech debt, and scale our systems. Coach and mentor other engineers - sharing best practices around service architecture, observability, and data engineering. What skills do I need? 10+ years of software engineering experience , with a strong emphasis on analytics and infrastructure work . Proven ability to design, build, and maintain scalable APIs, services, and data … systems in a production environment. Familiarity with data pipeline technologies , streaming/batch processing, and event instrumentation best practices. Strong grasp of CI/CD, observability, and system reliability practices in web-scale environments. Experience with Vercel , cloud infrastructure (AWS) and Infrastructure as Code (e.g., Terraform). Comfortable with frontend architecture and tooling , while this is not a UI-heavy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal Software Engineer - Investments

Bristol, Avon, South West, United Kingdom
Hybrid / WFH Options
Hargreaves Lansdown
documentation practices, including Architectural Decision Records, Solution Memos, and C4 diagrams. Guide cloud architecture choices, particularly around container orchestration and the use of AWS services. Champion best practices for observability, logging, security, and networking. Identify opportunities to enhance Developer Experience and efficiency through smarter tooling and frameworks. Support engineering teams with mentoring, pairing, and skills development. Lead conversations around Event … every level of the organisation. Proven ability to balance trade-offs, costs, and technical constraints. Experience coaching teams towards engineering and architecture best practices. Deep understanding of security, networking, observability, and system flows. Adept at producing clear, concise architectural documentation. Desirable: Previous experience as a Solution or Enterprise Architect. Background in enterprise systems and legacy-to-modern transitions. Familiarity with More ❯
Employment Type: Permanent, Part Time, Work From Home
Salary: £95,000
Posted:

Software Engineer, Data

London, United Kingdom
Integer, LLC
TL;DR Kharon is seeking a full-time, London or Madrid-based Software Engineer with proficiency in data engineering practices . Occasional in office attendance is required for this role. RESPONSIBILITIES: Design, develop, test, deploy, and maintain scalable backend services More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Ravelin
Who are we? Hi! We are Ravelin! We're a fraud detection company using advanced machine learning and network analysis technology to solve big problems. Our goal is to make online transactions safer and help our clients feel confident serving More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
England
10th Percentile
£56,250
25th Percentile
£70,000
Median
£80,000
75th Percentile
£105,000
90th Percentile
£132,500