451 to 475 of 567 Observability Jobs in the UK

Splunk Developer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Central, Inverclyde, UK
application teams to deliver scalable monitoring, service health, and analytics solutions.________________________________________ Key Responsibilities Technical Leadership Act as Technical Lead for Splunk implementations across monitoring, observability, and service intelligence use cases.Own end to end Splunk solution design including data onboarding, data models, dashboards, alerts, and ITSI objects.Review and govern Splunk development … Studio/Classic dashboardsDesign meaningful alerts using:oCorrelation searchesoRisk based alerting principlesTranslate operational and business requirements into actionable insights.Observability & Production SupportIntegrate Splunk with enterprise observability tools (APM, infrastructure monitoring, cloud platforms).Support production incidents using Splunk, driving root cause analysis and post incident reviews.Improve alert quality by reducing noise ...

Splunk Developer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Livingston, West Lothian, UK
application teams to deliver scalable monitoring, service health, and analytics solutions.________________________________________ Key Responsibilities Technical Leadership Act as Technical Lead for Splunk implementations across monitoring, observability, and service intelligence use cases.Own end to end Splunk solution design including data onboarding, data models, dashboards, alerts, and ITSI objects.Review and govern Splunk development … Studio/Classic dashboardsDesign meaningful alerts using:oCorrelation searchesoRisk based alerting principlesTranslate operational and business requirements into actionable insights.Observability & Production SupportIntegrate Splunk with enterprise observability tools (APM, infrastructure monitoring, cloud platforms).Support production incidents using Splunk, driving root cause analysis and post incident reviews.Improve alert quality by reducing noise ...

Splunk Developer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
North East, Glasgow, UK
application teams to deliver scalable monitoring, service health, and analytics solutions.________________________________________ Key Responsibilities Technical Leadership Act as Technical Lead for Splunk implementations across monitoring, observability, and service intelligence use cases.Own end to end Splunk solution design including data onboarding, data models, dashboards, alerts, and ITSI objects.Review and govern Splunk development … Studio/Classic dashboardsDesign meaningful alerts using:oCorrelation searchesoRisk based alerting principlesTranslate operational and business requirements into actionable insights.Observability & Production SupportIntegrate Splunk with enterprise observability tools (APM, infrastructure monitoring, cloud platforms).Support production incidents using Splunk, driving root cause analysis and post incident reviews.Improve alert quality by reducing noise ...

DevOps Engineer

Hiring Organisation
Reed Technology
Location
Durham, County Durham, North East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
pipelines using Azure DevOps Supporting monitoring, reliability, and operational readiness Working alongside engineers to embed better DevOps and platform practices Contributing to security, observability, and continuity planning What they're looking for Proven experience in an Azure-focused DevOps or platform engineering role Hands-on Terraform experience used in live … essential) DevSecOps exposure Cloud cost management/FinOps awareness Understanding of .NET/C# based platforms Scripting with PowerShell, Bash or Python Experience with observability and monitoring tools Interest in using AI tools to improve engineering productivity Working setup & culture Hybrid working with a flexible, trust-based approach Supportive, inclusive ...

Java Software Engineer

Hiring Organisation
Addition
Location
Cheltenham, England, United Kingdom
secure coding principles to meet strict performance and security standards Contribute to architecture decisions, code quality, testing and continuous improvement Implement monitoring, logging and observability to support live environments Main Skills Needed: Strong Java development experience (Java 11+) using frameworks like Spring Boot Solid understanding of software architecture … building microservices and distributed systems Proven ability to deliver scalable, high-performance backend applications Familiarity with DevSecOps tools (Docker, Kubernetes, CI/CD, testing, observability) Confident working with stakeholders to translate requirements into technical solutions Active enhanced DV clearance What’s in It for You: Work on meaningful projects that ...

Senior Java Software Engineer

Hiring Organisation
Addition
Location
United Kingdom
TypeScript, aligned to shared design systems Taking ownership of technical decisions, documenting approaches clearly through Architecture Decision Records (ADRs) Improving system reliability by driving observability, SLOs, incident response practices, and operational excellence Enhancing CI/CD pipelines, testing strategies, and deployment workflows to improve developer experience Collaborating closely with Product … scalability, and system resilience Knowledge of secure coding practices, OWASP principles, and regulatory/compliance considerations Hands-on experience with CI/CD pipelines, observability tooling, and incident management Ability to write clear technical documentation including architectural decisions Experience mentoring engineers and contributing to team development and delivery capability Familiarity ...

Cloud DevOps Engineer - Derby- £70K

Hiring Organisation
Akkodis
Location
Derbyshire, United Kingdom
Employment Type
Permanent
Salary
£50000 - £70000/annum
where there's genuinely a lot going on, in a good way. They're moving away from legacy infrastructure, modernising their cloud estate, improving observability, and continuing to build out their platform engineering capability. So if you enjoy being part of real change rather than just keeping the lights … collaboration too, you'll be working closely with Dev, QA and Product, helping teams release software reliably while also pushing forward things like monitoring, observability and overall platform resilience. Tech-wise? It's an Azure-first setup, but they're open to people who've worked across ...

Software Engineering Manager Forecasting & Rostering, Capacity Management, Platform Enablement and APIM

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations … decisions Funding for technical enablers Field Ops workflow design and data requirements Use of Data/Insight/Automation Uses engineering metrics, performance insights, observability data and AI[1]assisted diagnostics to guide decisions. Ensures human judgement remains central. Constraints Centrica architectural principles, engineering guardrails, data privacy/security policies ...

Client Service Delivery

Hiring Organisation
Accenture
Location
Birmingham, England, United Kingdom
Service Delivery Management Own full lifecycle service delivery across infrastructure and cloud environments, ensuring alignment to SLAs, KPIs, scope, and cost. Leverage AIOps and observability tools (e.g. Dynatrace, Datadog, New Relic, Elastic) to proactively monitor service health and performance. Utilise predictive alerting and anomaly detection to prevent incidents and optimise … infrastructure and cloud environments Strong understanding of IT Managed Services frameworks Hands-on experience with AIOps tools such as Dynatrace and ServiceNow Familiarity with observability tools (e.g. Datadog, New Relic, Elastic) Knowledge of event analytics tools such as Splunk IT Service Intelligence and Moogsoft Experience in stakeholder and client management ...

Head of Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
platform and infrastructure strategy Design and evolve cloud architecture to support scale, resilience, and performance Set standards for infrastructure, CI/CD, environments, and observability Make architectural decisions and trade‐offs Developer Experience (DevEx) Provide infrastructure for the development team to code, test and deploy efficiently Advise during design sessions … growing company Ability to operate production systems under pressure Deep hands‐on experience with the AWS cloud platform Strong background in reliability, observability, and incident management Experience leading or mentoring engineers What we offer in return 💰 Competitive salary depending on experience 🏝️ 27 days of annual leave (including 3 days Christmas ...

Software Engineering Manager - Knowledge/AI and Platform Enablement Squads

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations, delivery and business teams … management, data modelling and data quality controls. Ability to produce high level and detailed design specifications. Experience running DevOps practices including CI/CD, observability, monitoring and incident management. Multi-squad engineering leadership Proven experience leading software engineering delivery in a complex, multi team environment Experience providing technical leadership ...

Software Engineering Manager - In-Life Domain

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
practice, reduce duplication, and promote maintainable, secure and performant systems. 4. Enhance delivery capability through platform reliability and DevOps maturity Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. 5. Manage stakeholders and ensure transparent communications Build strong relationships across product … management, data modelling and data quality controls. Ability to produce high level and detailed design specifications. Experience running DevOps practices including CI/CD, observability, monitoring and incident management. Demonstrated capability in leading multi squad engineering execution in a product led organisation. Mindset & Ways of Working Comfortable working in iterative ...

Lead Full Stack Developer

Hiring Organisation
Couch Heroes
Location
United Kingdom
design that balance immediate delivery needs with long-term scalability. Establish monitoring, alerting, and incident response practices using Grafana and Uptime Kuma or equivalent observability tools. Frontend Engineering Build polished, responsive, and accessible interfaces using TypeScript, React, and Next.js, with Redux for state management and Zod for schema validation. Collaborate … with real-time communication systems: WebSocket at scale, pub/sub architectures, or event-driven service design. Experience with Grafana, Uptime Kuma, or equivalent observability and monitoring tools. Experience with OAuth2/JWT authentication flows and identity management. Experience with Golang or additional backend languages. Contributions to open-source projects ...

Senior Site Reliability Engineer

Hiring Organisation
Realm
Location
City of London, London, United Kingdom
High-growth infrastructure company focused on delivering large-scale compute, data centre capacity, and power solutions for advanced machine learning workloads. Platforms support leading research and industry teams requiring high-performance computing at significant scale. ...

Technical Lead - Full Stack - AWS - Microservices - East Kilbride/Hybrid (4 DPW On-Site)

Hiring Organisation
Curo Services
Location
East Kilbride, Lanarkshire, United Kingdom
Employment Type
Permanent
Salary
GBP 70,000 - 75,000 Annual
Subject - Technical Lead - Full Stack - AWS - Microservices - East Kilbride/Hybrid (4 DPW On-Site) - £70-75K Per Annum Job Title: Engineering Technical Lead Location: East Kilbride Salary: £70-75K Per Annum Benefits ...

SRE Managing Consultant - Cloud Operating Model

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
Budgets : Establish service measures and targets (SLIs/SLOs) and introduce Error Budgets to enable data‐driven trade‐offs between reliability and delivery velocity. Observability & Operational Insight: Shape observability approaches (metrics/logs/traces) and operational monitoring models that make reliability risks visible and actionable, improving operational decision‐making. … large‐scale delivery contexts; associate‐level certifications are desirable but not mandatory. Design, establish, and evolve SRE‐led centres of excellence (e.g. Reliability, Observability, or Operational Excellence), setting enterprise‐level standards for SLIs/SLOs, incident management, observability, and continuous improvement across cloud and hybrid platforms. Exposure to modern observability ...

DevOps Engineer

Hiring Organisation
Infinity Quest
Location
Halifax, England, United Kingdom
Actions, Harness, Jenkins). • Networking & Security: Experience with GCP Cloud Armor, GCP Networking, and embedding secure-by-design controls from design to runtime. • Automation & Observability: Implementing actionable observability, performance tuning, and automation to reduce toil. Defining and operating against SLOs/SLIs. • Scripting & Tooling: Scripting in Bash, PowerShell, or Python. … Performance & Reliability: Define, monitor, and operate against service level objectives (SLOs/SLIs), ensuring high availability, performance, and fault tolerance. • Continuous Improvement: Drive automation, observability, and performance tuning to reduce manual effort and improve platform reliability. • Collaboration: Work closely with architecture and feature teams to evolve the cloud roadmap ...

SRE Observability Engineer

Hiring Organisation
Access Computer Consulting
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
£350 - £450/day
recruiting for an SRE Observability Engineer to work in London 2-3 days a week, remaining time remote. The role falls inside IR35 so you will be required to work through an umbrella company for the duration of the contract. This is a 6 month contract which will transfer … permanent role after the initial contract term. You will be responsible for collaborating across various organisations within the client to understand and develop observability solutions for enterprise-wide deployment at scale. You will also manage the legacy monitoring stack across the Production Management organisation within the client. You must have ...

Principal Engineer

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
enhance safety, compliance, customer experience and productivity. 5. Establish engineering excellence across teams Champion high engineering standards: clean architecture, CI/CD automation, observability, testing strategies, release processes, telemetry, performance tuning and secure-by-design principles. 6. Lead platform performance, reliability & offline capability Ensure the mobile environment performs reliably … Quality and Platform wide capabilities Shape quality, resilience, and security strategies across teams-ensuring teams adopt shift left testing, strong security hygiene, consistent observability, and reliable operational processes. 8. Improve how work is done (template requirement) Continuously identify opportunities to automate, simplify, reduce cycle time, improve developer experience, adopt ...

DevOps Engineer ID46327

Hiring Organisation
Humand Talent
Location
Oxfordshire, England, United Kingdom
your typical DevOps role. You’ll be working across a mix of cloud-connected and fully isolated environments , tackling unique challenges around deployment, observability, and infrastructure at scale. You’ll play a key role in designing how complex systems are commissioned, deployed, and maintained in both standard and highly controlled … What you’ll be doing Building and automating infrastructure using modern IaC tools Developing and improving CI/CD pipelines (self-hosted environments) Designing observability across distributed systems Supporting deployments across both connected and air-gapped environments Contributing to the evolution of a hybrid cloud/on-prem platform What ...

Lead Integration Engineer & Developer

Hiring Organisation
Ashdown Group
Location
Liverpool, Merseyside, North West, United Kingdom
Employment Type
Permanent, Work From Home
Gateway, EventBridge, SQS, SNS) Node.js/Javascript/TypeScript and Python Data & Infrastructure DynamoDB, RDS Infrastructure as Code (Terraform, CDK, CloudFormation) CloudWatch and observability tooling Integrations HubSpot (CRM) Internal microservices and external APIs Required Experience 7+ years in backend or platform engineering Strong hands-on AWS experience (serverless preferred) Proven … APIs End-to-end ownership of systems (design build operate) Technical Expertise Event-driven architecture (EventBridge, SQS, SNS, Kafka) Reliability patterns (retries, idempotency, DLQs) Observability and debugging in distributed systems Data modelling and schema evolution Leadership & Collaboration Ability to lead technical design and influence architecture Experience mentoring engineers Strong communication ...

Data Platform Engineer

Hiring Organisation
PRISM DIGITAL LIMITED
Location
Milton Keynes, Buckinghamshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
availability Own incident resolution, root cause analysis, and continuous improvement Collaborate with engineers and third-party providers to mature the platform Contribute to monitoring, observability, and cost optimisation strategies Support projects and business initiatives through robust platform delivery What Theyre Looking For: Microsoft Fabric experience Terraform experience Cloud platform engineering … delivery environments What Youll Work With: Microsoft Fabric Terraform (Infrastructure as Code) Azure cloud technologies SQL Server GitHub/CI/CD tooling Monitoring & observability tools Platform design patterns (scalability, resilience, cost control) Nice to Haves: GitHub Actions/CI/CD pipelines Zero Trust architecture Cloud cost monitoring & reporting ...

Infrastructure / DevOps Engineer

Hiring Organisation
rmg digital
Location
England, United Kingdom
Managing and optimising AWS services, including ECS, Lambda, VPC, and Aurora Postgres Building and maintaining CI/CD pipelines using GitHub Actions Implementing monitoring, observability, and alerting using Datadog Supporting development teams with deployment, automation, and operational best practices Improving infrastructure security, scalability, reliability, and cost-efficiency Monitoring system performance … Infrastructure as Code tools such as Terraform and/or CDK Understanding of CI/CD pipelines and GitHub Actions Familiarity with monitoring and observability tooling, such as Datadog Knowledge of containerisation concepts and infrastructure best practices Some experience with TypeScript or JavaScript for scripting and CDK purposes Strong troubleshooting ...

Senior Platform Engineer (Fully Remote) - GKE, GCP, Terraform

Hiring Organisation
Sanderson Recruitment
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
manage workloads using Helm with strong isolation and configuration practices Own and improve CI/CD pipelines using Azure DevOps and GitOps Embed observability across the platform (monitoring, logging, alerting, tracing) Define and enforce platform standards, patterns and best practices Produce and maintain high-quality documentation, diagrams and runbooks Lead … expertise, particularly Azure DevOps Git-based workflows, GitOps and tools such as Argo CD Experience with service mesh technologies (e.g. Istio) Exposure to observability/APM tooling Confident technical leader with experience setting standards and mentoring others Comfortable working in shared platform environments Reasonable Adjustments: Respect and equality are core ...

Site Reliability Engineer

Hiring Organisation
EQUALS
Location
Greater London, England, United Kingdom
recommendation engine that matches people by musical taste. THE ROLE We're looking for a Site Reliability Engineer to own the infrastructure, observability, and operational health of the Equals platform. You'll be the person who monitors systems needs and health to provide a seamless user experience while providing traceability … 1B+ rows) - Manage Cloudflare (WAF, bot management, DNS, firewall rules) - Make cost-conscious infrastructure decisions - right-sizing instances, storage tiering, optimizing spend Monitoring & Observability - Own the Datadog APM setup: tracing, alerting, dashboards, log management - Maintain and tune alert channels integrated with Slack - Reduce alert fatigue by tuning thresholds, suppressing false ...