451 to 475 of 505 Observability Jobs in England

Software Integration Apprentice

Hiring Organisation
SPIRAX-SARCO LIMITED
Location
9-15 Runnings Road, Kingsditch Trading Estate, Cheltenham, England, United Kingdom
Employment Type
Higher Apprenticeship
Salary
£25,000 a year
This is a unique opportunity to gain hands-on experience in enterprise integration, data analytics and software design. Working alongside experienced professionals to support the design, development, testing, and monitoring of integration solutions across our ...

Python Developer

Hiring Organisation
Arcus Search
Location
City of London, London, United Kingdom
Python Developer - Observability Engineering London | Hybrid | Perm I’m working with a leading quantitative research and trading firm looking to expand their Observability Engineering team. This team sits at the centre of engineering productivity, owning the systems that allow teams to produce, move and consume telemetry at scale. The focus … making observability seamless across a large, high-performance environment handling cloud-level volumes of data. The role • Build and extend observability tooling across telemetry pipelines and backend systems • Develop and maintain OpenTelemetry collectors, SDKs and exporters • Define and promote “golden paths” for instrumentation across a wide range of services • Work ...

Lead Dev Ops Engineer

Hiring Organisation
Birketts LLP
Location
Ipswich, Suffolk, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
secret-handling patterns aligned to Birketts expectations Implement and enforce PR/branch policies and release controls to reduce variability and operational risk Platform observability and operational readiness Provide and evolve platform observability foundations: monitoring, logging, metrics, dashboards and alerting (using the agreed toolset) Define and improve incident response ...

SRE - Site Reliability Engineer

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£55 - £62/hour
Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : £55 Per Hour - £62 Per Hour Inside IR35 Job Overview We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure … focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Responsibilities Design, deploy and scale observability platforms Manage and scale Prometheus monitoring systems Deploy and maintain large Elasticsearch clusters Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks ...

Senior Data Engineer

Hiring Organisation
develop
Location
City of London, London, United Kingdom
Deploy pipelines into ontology-aware platforms (e.g. graph databases, semantic layers, Foundry-style systems) Ensure semantic compliance, data integrity and reasoning-readiness Data Quality, Observability & Lineage Implement robust data quality frameworks (validation, profiling, anomaly detection) Build observability into pipelines (lineage tracking, logging, freshness monitoring, schema drift detection) Ensure alignment with … thinker – brings clarity and rigour to ambiguous, messy data domains Collaborative – works effectively with ontology architects, AI engineers and consultants Quality-driven – prioritises correctness, observability, maintainability and semantic integrity Clear communicator – able to explain semantic concepts and data reasoning to non-technical stakeholders Low ego, high ownership – focused on outcomes ...

Lead Platform Engineer

Hiring Organisation
REVYBE IT RECRUITMENT LIMITED
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
LeadPlatformEngineer-FinTech £110,000+Bonus(£15k+) CentralLondon-Hybridworking,2/3daysperweekintheoffice WereworkingwithahighlysuccessfulFinTechbusinessinCentralLondonwhoarelookingtohireaLeadPlatformEngineertohelpshapethefutureoftheirinfrastructureandplatformstrategy. Thisisahigh-impactrolewithinagrowingengineeringteamwhereyoullhavetheopportunitytoinfluencearchitecturaldecisions,mentorengineers,andremaindeeplyhands-onwithmoderninfrastructuretooling.Thecompanybuildsallit'ssoftwarein-houseandhasbeeninvestingheavilyinitsplatform,observability,andcloudcapabilitiesastheycontinuetoscale. TheOpportunity: YoulljoinastheLeadPlatformEngineer,workingcloselywithengineeringleadershiptodriveimprovementsacrossinfrastructure,reliability,anddeveloperexperience.Thisrolesitsattheintersectionofhands-onengineering,mentoring,andstrategy.Youllguideplatformdirectionwhilecontinuingtobuildandimprovetheinfrastructurethatpowersthebusiness. Youllalsomentoroneplatformengineer,helpingthemgrowwhileensuringtheteamcontinuesdeliveringhigh-qualityinfrastructureandautomation. Environment: Theplatformcurrentlyoperatesinahybridenvironment: ~60%on-premiseinfrastructure ~40%MicrosoftAzure Thelong-termstrategyisfocusedonmodernisingtheplatform,improvingobservability,andevolvingcloudcapabilities,makingthisanexcellentopportunityforsomeonewhoenjoysbuildingandshapingsystems. TechStack: YoullbeworkingacrossamodernDevOpsandplatformstackincluding: Kubernetes Terraform Hybridcloudinfrastructure(on-premise+Azure …/CD&Automation GitHubActions Python AzureServices AzureKubernetesService(AKS) AzureVirtualMachines AzureVirtualNetworks AzureLoadBalancer AzureApplicationGateway AzureStorageAccounts AzureBlobStorage AzureKeyVault AzureMonitor AzureLogAnalytics AzureActiveDirectory AzureContainerRegistry AzureDNS AzureDevOpsintegrations Observability Logging,monitoring,andtracingacrossdistributedsystems Buildingmeaningfultelemetryandplatformvisibility Whatyou'llbedoing: Leadingtheevolutionofthecompanysplatformandinfrastructurestrategy DesigningandimprovinghybridAzure+on-premiseenvironments DrivingKubernetesplatformimprovements BuildingautomationwithTerraformandPython Improvingobservabilityandmonitoringacrosssystems MentoringaPlatformEngineerandhelpingshapeplatformbestpractices Workingcloselywithengineeringteamstoimprovedeveloperexperienceandreliability Whythisroleisexciting: Hugeimpactonthefutureplatformarchitecture Opportunitytoshapethehybridcloudstrategy Combinationoftechnicalleadershipandhands-onengineering ModernDevOpstoolingandcloudtechnologies Directinfluenceonplatformreliabilityandscalability Package: Salary:Upto£110,000 Bonus:15k+ ...

Lead Software Engineer

Hiring Organisation
5V Video
Location
City of London, London, United Kingdom
+ AWS (Lambda, API Gateway, S3, DynamoDB) Handling event-driven architectures (Kafka, SNS/SQS, etc.) Driving system design decisions across distributed systems Improving observability, reliability, and performance in production Debugging complex issues and leading resolution across teams Staying hands-on while setting technical direction and standards Tech Stack Python … Lambda, API Gateway, S3, DynamoDB, IAM) Event-driven systems (Kafka, SNS/SQS) CI/CD (Concourse, Git workflows) Databases (Postgres, DynamoDB, Couchbase) Observability (Prometheus, Grafana, CloudWatch) What You’ll Bring Strong backend engineering experience (Python preferred) Proven experience building distributed systems at scale Deep understanding of microservices + event ...

Platform Engineer (Outside IR35) - MOD SC

Hiring Organisation
Talent Locker
Location
Farnborough, Hampshire, South East, United Kingdom
Employment Type
Contract
Contract Rate
£475 - £500 per day
secure platforms. Key Responsibilities Develop and enhance platform services across hybrid environments Improve and standardise automated deployment and CI/CD pipelines Strengthen observability, monitoring, and proactive operations Support incident response, troubleshooting, and service improvements Provide guidance on platform patterns, tooling, and best practices Contribute to architectural decisions and technical … managing cloud infrastructure OpenShift & Kubernetes - configuring clusters, building containers, and managing repositories CI/CD pipelines - building and improving automated deployment processes Monitoring & Observability - implementing proactive monitoring across platforms Automation of platform components - ensuring reliable, repeatable operations Secure by Design - experience delivering platforms aligned with MOD security standards MOD Applications ...

Lead DevOps Engineer (Azure)

Hiring Organisation
Reed Technology
Location
East Anglia, United Kingdom
Employment Type
Permanent
Salary
£75,000
pipeline templates, PR/branch policies, approvals and gated releases * Creating 'golden path' delivery patterns so teams can deploy without bespoke pipelines Operational readiness & observability * Defining monitoring, logging, alerting and dashboards * Improving incident response, runbooks and recovery processes * Shaping DR and operational processes (no on-call at present) Ways …/CD engineering experience * Experience implementing governance, security guardrails and delivery controls * Comfortable operating without an existing DevOps team Desirable * Azure Policy at scale * Observability, SRE or platform engineering practices * Container/AKS experience * Cost governance and showback/chargeback experience Why this role? * Opportunity to own and shape DevOps ...

Cloud Security and Platform Engineer

Hiring Organisation
RealityMine
Location
Trafford Park, England, United Kingdom
mainly focused on AWS, with growing involvement in other cloud and SaaS platforms. You’ll improve existing environments—managing identity and access, governance, security, observability, and lifecycle—by reducing risks, eliminating unsafe configurations, validating ownership, and ensuring the cloud estate is clearly governed and auditable. You will take an active … role in improving RealityMine’s security posture by improving and operating security scanning, improving monitoring and observability, and ensuring risks, vulnerabilities, and end of life components are identified and addressed in a timely and pragmatic way. You will also develop automation used to support security and operational hygiene, reducing manual ...

Principal Architect - Platforms

Hiring Organisation
Jobleads-UK
Location
Southampton, England, United Kingdom
cost effectiveness. • Define and govern integration patterns, including APIs, messaging, event streaming, and cross-platform data movement. • Ensure platforms are designed for operability and observability, including monitoring, logging, alerting, and performance management. • Influence DevOps and platform engineering practices, including automation, CI/CD pipelines, and infrastructure as code. • Embed security … production environments. • Deep experience with DevOps and platform engineering practices, including automation, CI/CD, and infrastructure as code. • Strong understanding of reliability engineering, observability, and operational excellence. • Security-minded with practical experience embedding security controls into platform designs. • Experience acting as a trusted technical advisor to senior stakeholders. • Strong ...

Forward Deployed Engineer

Hiring Organisation
Novatus Global
Location
City of London, London, United Kingdom
Novatus Global is a Series B scale-up RegTech SaaS provider and boutique advisory firm, helping financial institutions manage their most complex regulatory requirements. We combine deep consulting expertise with cutting-edge SaaS solutions, enabling ...

Cloud Advisory - Agentic Focused Architecture Consultant

Hiring Organisation
Accenture
Location
London Area, United Kingdom
where GenAI and Agentic play a role. Champion system performance, resilience, and efficiency: Proactively identifying and addressing consumption and scalability challenges. Champion full stack observability using modern full stack observability, SRE and AIOps. Manage & Mentor: Lead teams of architects and engineers, providing technical coaching, career counselling, performance management, and coaching ...

AI Product Developer

Hiring Organisation
DCV Technologies
Location
Leeds, West Yorkshire, United Kingdom
Employment Type
Contract
Contract Rate
£400 - £500/day
pipelines, embeddings and vector search Integrate AI solutions with enterprise systems, APIs and cloud platforms (Azure) Implement secure-by-design AI engineering, monitoring and observability Develop reusable components including prompt libraries, agent frameworks and connectors Prototype, test and improve model performance, reliability and scalability Key Skills Strong experience with LLMs …/Azure OpenAI/cloud-native development Experience building production AI systems and API integrations Knowledge of DevOps, CI/CD, monitoring and observability Experience with LangChain, agent frameworks or Copilot Studio is highly desirable Desirable Experience in financial services or regulated environments Understanding of responsible AI, security and governance ...

Lead Site Reliability Engineer

Hiring Organisation
McGregor Boyall
Location
Leeds, West Yorkshire, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £105,000 per annum
they migrate services to the Cloud. Work with Product Owners and Engineering Leads to balance feature delivery with system reliability, performance and health. Use observability tooling, performance metrics and SRE principles to proactively identify issues and reduce operational toil. Implement Incident and problem management practices, ensuring strong root cause analysis … Technical Skills required: Strong cloud engineering background, ideally across Azure and GCP. Experience building or operating large-scale, resilient cloud platforms. Deep understanding of observability tooling (metrics, logs, traces). Hands-on experience with modern SRE practices: SLOs/SLIs Error budgets Automation to reduce toil Production readiness and robust ...

Senior SRE (Java)

Hiring Organisation
Morgan McKinley
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
Software-First Approach to Reliability I am currently partnering with a major FTSE 100 FinTech company that is undergoing a massive modernisation and observability overhaul. They aren't looking for a traditional, infrastructure-heavy SRE; they need a Senior Java Developer who has recently transitioned into the SRE space. … Foundation: 5+ years of Java development experience with a deep understanding of JVM internals. The SRE Pivot: Recent experience in a Site Reliability or Observability role, with hands-on knowledge of OpenTelemetry , Jaeger, or similar tracing tools. The Mindset: A strong philosophy on what makes a "good ...

Gen AI Engineer

Hiring Organisation
Wave Group
Location
England, United Kingdom
applications in production environments Evidence of debugging real issues such as incorrect outputs, latency spikes, retrieval failures or agent misbehaviour Experience with monitoring and observability of LLM systems, for example Langfuse, Prometheus, Grafana, OpenTelemetry or similar Strong understanding of RAG systems, retrieval pipelines and evaluation workflows Experience with agentic frameworks … application and infrastructure layers Multimodal experience across text and image or video is beneficial Tech stack Python, AWS, LangGraph, LangChain, vector databases, evaluation tooling, observability platforms, Docker Why join Small, senior team with high ownership Systems already in production with real customers Bi-weekly shipping cycles with fast feedback loops ...

Senior Software Engineer (DevSecOps)

Hiring Organisation
CBSbutler Holdings Limited trading as CBSbutler
Location
Skipton, North Yorkshire, United Kingdom
Employment Type
Contract
Contract Rate
£550 - £580/day
measurable outcomes. The role You will take ownership of the full delivery lifecycle: from pipeline design and environment architecture through to release-linked observability and incident readiness. Day to day, you can expect to be shipping small, frequent changes using trunk-based development and feature flags, embedding security and quality … DAST, IaC scanning, SBOM, WAF configuration, and pipeline attestations Experience building and managing ephemeral, production-like environments with data-on-demand capability Strong observability skills - tracing, metrics, logs, SLO/error budget management, and deployment annotations Familiarity with DORA metrics and a track record of removing flow constraints at squad ...

Founding Engineer

Hiring Organisation
Omnam Investment Group
Location
London Area, United Kingdom
environments Lead integrations with external systems and support early data onboarding Establish engineering standards, tooling, documentation, and technical processes from the start Set up observability, monitoring, and performance systems Jump in wherever needed, from quick scripts and data cleaning to debugging production issues What You Bring 5+ years of engineering … with backend frameworks (FastAPI, Django, Node.js, Rails, etc.) Strong SQL, data modeling, and database design knowledge Familiarity with IaC, containers, CI/CD, and observability tools Bonus : experience in ETL, or hospitality/proptech/real-estate technology Why Join Us We work together in the heart of London ...

Senior Technical Delivery Manager

Hiring Organisation
Stackstudio Digital Ltd
Location
Norwich, Norfolk, East Anglia, United Kingdom
Employment Type
Contract
Contract Rate
From £500 to £550 per day
/Tableau), ML feature pipelines, self-service data products. Oversee architecture conformance, security/compliance (PII/PHI, GDPR), and cost optimization. Ensure observability (logging/metrics), DQ SLAs, lineage, and platform SLOs. Align data initiatives to insurance value streams: Policy Admin Claims Underwriting Pricing/Actuarial Distribution/Broker … engineering (lakehouse, ETL/ELT, orchestration), and BI/analytics platforms. Strong understanding of architecture alignment, data privacy/security (PII/PHI, GDPR), observability, operational SLOs, and cost optimization. Broad insurance domain knowledge across policy administration, claims, underwriting, pricing/actuarial, distribution, fraud, and regulatory reporting, with ability ...

Senior Java Engineer (reliability & observability)

Hiring Organisation
GCS
Location
Northampton, Northamptonshire, United Kingdom
Employment Type
Permanent
Salary
£45000 - £60000/annum
Boot development experience in high-throughput systems Deep understanding of event-driven and messaging architectures (Kafka, JMS, AMQP or similar) Experience engineering reliability and observability at scale (monitoring, tracing, SLIs/SLOs) Desirable Skills: Experience building notification delivery infrastructure (webhooks, push, SMS) Awareness of the payments domain, including processing flows ...

Senior Enterprise Sales

Hiring Organisation
Harrington Starr
Location
City Of London, England, United Kingdom
Senior Enterprise Sales Financial Markets | Observability & Infrastructure Software London/Hybrid £120k–£140k base | £250k–£280k+ OTE uncapped This is a high-growth, PE-backed technology platform used by Tier-1 financial institutions to monitor, analyse and optimise mission-critical trading and infrastructure environments. The software sits deep within … Infrastructure & SRE teams Production Engineering Trading Technology Capacity & Performance Engineering Enterprise Architecture You will be positioning a platform that sits at the heart of observability, operational resilience and infrastructure intelligence across complex financial ecosystems. Commercial Scope Deals are large, strategic and multi-year: £100k+ minimum entry point £500k typical deal ...

SRE Lead (Banking/Financial)

Hiring Organisation
Ascendion
Location
City of London, London, United Kingdom
across production systems. Key Responsibilities: Lead the SRE function across the engineering organisation and drive operational excellence across production systems. Define and implement the observability and monitoring strategy, including dashboards, alerting, SLOs, SLAs, and error budgets. Establish comprehensive monitoring coverage to ensure visibility into system health, infrastructure, and business-critical … engineering teams. Manage incident response processes, including on-call management and post-incident reviews. Collaborate with product and engineering teams to build reliability and observability into new systems. Monitor UI behaviour and end-to-end system performance, not just infrastructure metrics. Essential Skills & Experience: Proven experience as an SRE Lead ...

ML & AI -Engineers/Architect/Lead

Hiring Organisation
KBC Technologies Group
Location
England, United Kingdom
version control, and ensuring production-ready AI systems . You’ll also play a key role in integrating AI/LLM agents with strong observability and rollback mechanisms. Location : Leeds/Manchester Client : IT End Client :Banking domain Work Mode: Hybrid Contract : Inside IR 35 Salary : Market Standards Key Responsibilities … workflows Manage model versioning and release processes Monitor inference cost, latency, and model drift Safely integrate AI/LLM agents into production systems Implement observability, alerting, and rollback mechanisms Experience Levels We’re hiring across multiple seniority levels: Senior Developer: 3–6 years (ML/AI Engineering) Lead Engineer ...

Rust Engineer

Hiring Organisation
Huxley Associates
Location
London, United Kingdom
Employment Type
Permanent
Salary
£150000 - £180000/annum
from systems that actually matter. ETrading, you will build the infrastructure that sits between our traders and the market - execution paths, data pipelines, and observability tooling that power trillions in annual notional volume. When a system performs at 3am under peak load, you will be one of the reasons why. … kernel bypass awareness (DPDK, io_uring) Distributed messaging and event streaming: Kafka, NATS, or equivalent; ordering guarantees, exactly-once semantics, consumer group management Production observability: metrics (Prometheus/OpenTelemetry), distributed tracing, structured logging, and alert design CI/CD pipeline design including benchmarking gates, automated performance regression detection, and reproducible ...