576 to 600 of 819 Observability Jobs in the UK

Senior / Principal DevOps Engineer

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Bury, Lancashire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£700 - £800 per day
best practices across engineering teams and onboard products onto shared platforms. Build and maintain secure, scalable, and high-performing cloud infrastructure in AWS. Implement observability, monitoring, and operational insights across multiple environments. Improve deployment processes, reduce friction, and enable self-service capabilities for development teams. Support cloud and infrastructure incident … focus on automation. Experience with containerisation and workload orchestration technologies. Scripting and programming experience with tools such as Python and Bash. Strong understanding of observability, reliability, and operational best practices. Knowledge of information security principles and experience embedding security throughout the software delivery lifecycle. If you're interested in this ...

Senior / Principal DevOps Engineer

Hiring Organisation
Hays Technology
Location
Bury, Greater Manchester, United Kingdom
Employment Type
Contract
Contract Rate
£700 - £800/day £700 - £800 p/d (depending on level)
best practices across engineering teams and onboard products onto shared platforms. Build and maintain secure, scalable, and high-performing cloud infrastructure in AWS. Implement observability, monitoring, and operational insights across multiple environments. Improve deployment processes, reduce friction, and enable self-service capabilities for development teams. Support cloud and infrastructure incident … focus on automation. Experience with containerisation and workload orchestration technologies. Scripting and programming experience with tools such as Python and Bash. Strong understanding of observability, reliability, and operational best practices. Knowledge of information security principles and experience embedding security throughout the software delivery lifecycle. If you're interested in this ...

Forward Deployed AI Engineer

Hiring Organisation
WTW
Location
Greater London, United Kingdom
Employment Type
Full Time
enabled systems. You’ll bring deep expertise across modern full-stack technologies (.NET, Azure, SQL, React/Angular), along with experience in distributed systems, observability, and AI tooling such as LLMs, retrieval pipelines, and agentic workflows. Acting as a bridge between business and technology, you’ll work across product, data … orchestration, evaluation loops, and human-in-the-loop controls. Enterprise integration: Integrate AI solutions with enterprise systems, APIs, data platforms, document repositories, workflow tools, observability platforms, and identity and access management services. Production engineering: Ensure AI solutions meet enterprise standards for reliability, scalability, latency, maintainability, cost control, logging, monitoring ...

Software Engineering Manager

Hiring Organisation
Centrica - CHP
Location
United Kingdom
Employment Type
Permanent
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations … decisions Funding for technical enablers Field Ops workflow design and data requirements Use of Data/Insight/Automation Uses engineering metrics, performance insights, observability data and AI[1]assisted diagnostics to guide decisions. Ensures human judgement remains central. Constraints Centrica architectural principles, engineering guardrails, data privacy/security policies ...

Software Engineering Manager

Hiring Organisation
17918
Location
London, United Kingdom
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations … decisions Funding for technical enablers Field Ops workflow design and data requirements Use of Data/Insight/Automation Uses engineering metrics, performance insights, observability data and AI[1]assisted diagnostics to guide decisions. Ensures human judgement remains central. Constraints Centrica architectural principles, engineering guardrails, data privacy/security policies ...

Software Engineering Manager

Hiring Organisation
Centrica - CHP
Location
Leicester, Leicestershire, East Midlands, United Kingdom
Employment Type
Permanent
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations … decisions Funding for technical enablers Field Ops workflow design and data requirements Use of Data/Insight/Automation Uses engineering metrics, performance insights, observability data and AI[1]assisted diagnostics to guide decisions. Ensures human judgement remains central. Constraints Centrica architectural principles, engineering guardrails, data privacy/security policies ...

Senior Site Reliability Engineer

Hiring Organisation
17918
Location
United Kingdom
Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence. Implement preventive measures and continuous improvement processes. Observability Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch. Build real-time dashboards to visualize system health and reliability … culture of shared responsibility for uptime and performance across engineering teams. Qualifications Deep expertise with various AWS services. Advanced knowledge of monitoring and observability tools. Strong leadership capabilities with a focus on setting clear direction, aligning team efforts with organizational goals, and maintaining high levels of motivation and engagement across ...

Senior Site Reliability Engineer

Hiring Organisation
Experian Ltd
Location
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Employment Type
Permanent, Work From Home
Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence. Implement preventive measures and continuous improvement processes. Observability Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch. Build real-time dashboards to visualize system health and reliability … culture of shared responsibility for uptime and performance across engineering teams. Qualifications Deep expertise with various AWS services. Advanced knowledge of monitoring and observability tools. Strong leadership capabilities with a focus on setting clear direction, aligning team efforts with organizational goals, and maintaining high levels of motivation and engagement across ...

Software Engineering Manager - Tooling and Optimisations

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent
practice, reduce duplication, and support maintainable, secure and high-performing systems. Improve delivery capability through platform reliability and DevOps maturity Continuously strengthen deployment pipelines, observability, alerting, incident response, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and maintain clear communication Build trusted relationships across product, operations … data modelling and data quality controls. Ability to produce both high-level and detailed design specifications. Experience leading DevOps practices, including CI/CD, observability, monitoring and incident management. Demonstrated capability leading multi-squad engineering delivery in a product-led organisation. Mindset & Ways of Working Comfortable working in iterative, outcome ...

AI Engineer

Hiring Organisation
Elsevier
Location
Greater London, United Kingdom
Employment Type
Full Time
within a defined problem, building and testing tool use, retrieval pipelines and agent workflows, integrating AI capabilities into enterprise systems, and contributing to evaluation, observability and guardrails. You will hold a high bar on code quality, flag risks and blockers early, and work alongside host-function stakeholders to make sure … agentic AI solutions to production standard within a defined technical approach. Implement and test tool use, retrieval pipelines, and agent workflows. Contribute to evaluation, observability and guardrails for agentic systems. Integrate AI capabilities into existing enterprise workflows and systems. Maintain high code quality and documentation so patterns can be reused. ...

QA Test Infrastructure Engineer

Hiring Organisation
Talent Locker
Location
Cheltenham, Gloucestershire, South West, United Kingdom
Employment Type
Contract
QA Test Infrastructure Engineer - Tauton, Onsite - Outside IR35 - Highest Security Clearance As a QA Test Infrastructure Engineer, you'll help design, build, and deliver secure digital solutions in highly secure environments. You'll work alongside ...

SRE Managing Consultant - Cloud Operating Model

Hiring Organisation
Capgemini
Location
Manchester, United Kingdom
Employment Type
Full Time
Budgets : Establish service measures and targets (SLIs/SLOs) and introduce Error Budgets to enable data-driven trade-offs between reliability and delivery velocity. Observability & Operational Insight: Shape observability approaches (metrics/logs/traces) and operational monitoring models that make reliability risks visible and actionable, improving operational decision-making. … large‐scale delivery contexts; associate‐level certifications are desirable but not mandatory. Design, establish, and evolve SRE‐led centres of excellence (e.g. Reliability, Observability, or Operational Excellence), setting enterprise‐level standards for SLIs/SLOs, incident management, observability, and continuous improvement across cloud and hybrid platforms. Exposure to modern observability ...

DevOps Engineer

Hiring Organisation
Oscar Associates (UK) Limited
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
Salary
£70,000
scalable, reliable and cost-efficient as it moves into full production. Working closely with engineering teams, you'll drive automation, improve deployment pipelines, strengthen observability and ensure the platform performs under high-volume, real-time workloads. This is a hands-on position with genuine ownership and plenty of opportunity … enhancing CI/CD pipelines with blue/green deployments and automated rollback Driving platform reliability, resilience and scalability Developing monitoring, alerting and observability across the environment Managing cloud costs and implementing best FinOps practices Participating in a small production on-call rota Technology AWS ECS Fargate Terraform Aurora ...

Data Engineer-Must have strong GCP experience-Inside IR35

Hiring Organisation
Reed Technology
Location
London, United Kingdom
Employment Type
Temporary
Salary
£425/day POSSIBLY NEGOTIABLE
Standardise ingestion and transformation using configuration-driven frameworks Embed data quality checks by default (schema validation, completeness, freshness, thresholds, alerting) Improve pipeline resilience, monitoring, observability and recovery mechanisms Integrate AI/ML capabilities where appropriate (e.g. anomaly detection, intelligent monitoring) Support delivery of a wider Data Strategy programme , improving consistency … Cloud Run/App Engine Experience with CI/CD, automated testing and infrastructure as code Data Quality & Monitoring Experience implementing data quality frameworks, observability tooling and monitoring solutions Preferred Experience Building reusable pipeline frameworks for large, multi-domain platforms Delivery within enterprise data transformation programmes with strong SLAs Exposure ...

Site Reliability Engineer's

Hiring Organisation
F5 consultants
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
support, shared ownership, and continuous improvement. You'll work hands-on in a modern cloud-native environment leveraging Kubernetes, OpenShift, GitOps, service mesh, and observability tooling There is genuine investment in your development through training, certifications, and the expertise of those around you. You'll also be part … Ability to work within complex multi-cloud or hybrid environments with a solid foundation in distributed systems Expertise in observability tooling such as Prometheus, Grafana, Loki, and Tempo Proficiency in IaC tools such as Kustomize and Helm, with scripting skills in Bash/Python Experience managing GitOps pipelines using Tekton ...

Global DevOps Lead

Hiring Organisation
Stott & May Professional Search Limited
Location
United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
with engineering, cloud, and operations teams to deliver a modern, automated, and scalable platform. You'll drive DevOps strategy across infrastructure, CI/CD, observability, SRE, and cloud optimisation while influencing senior stakeholders across the business. Key Responsibilities - Define and implement a global DevOps operating model, including governance, standards … initiatives. - Partner with engineering and cloud teams to establish clear ownership across DevOps and infrastructure. - Lead the implementation and optimisation of enterprise monitoring and observability using Datadog. - Build scalable deployment pipelines that improve release quality and speed. - Establish and monitor DORA metrics, driving improvements in deployment frequency, lead time, change ...

Site Reliability Engineer (SRE) - Cloud & Automation

Hiring Organisation
Spencer Rose Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 60,000 - 70,000 Annual
implementation of SRE practices across the organisation, working closely with infrastructure teams to optimise deployment processes and embed automation and operational excellence. Enhance observability and reliability , defining and implementing SLAs, SLOs and SLIs to improve alerting, monitoring, and capacity planning. Identify and eliminate toil , developing frameworks to analyse recurring issues … beneficial). Experience supporting and building multi-environment, multi-region cloud platforms (AWS or GCP), using IaC and GitOps workflows. Hands-on experience with observability/APM tooling such as Grafana, Datadog or Dynatrace. Background working in regulated financial services or banking environments. Excellent troubleshooting, analytical and communication skills, able ...

Vice President, DevOps Production Services

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
enterprise applications and ensure platform stability, resiliency, and availability. Monitor application health, system performance, batch jobs, interfaces, and alerts using enterprise monitoring and observability tools. Investigate, troubleshoot, and resolve production incidents within defined SLAs. Perform root cause analysis (RCA) for recurring issues and drive permanent fixes. Analyze production logs, identify … Cloud experience preferred. Knowledge of automation/scripting using Python, Shell, or PowerShell. Exposure to DevOps/SRE practices, CI/CD pipelines, and observability tooling. Strong communication skills with the ability to provide concise incident and executive status updates. #J-18808-Ljbffr ...

Site Reliability Engineer

Hiring Organisation
Connells Limited
Location
Milton Keynes, Buckinghamshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
hands-on role in ensuring it is reliable, scalable, and observable. You will help establish and mature SRE practices, focusing on: Monitoring and observability Incident response Post-incident review Reliability testing and capacity planning Toil reduction Enabling development velocity We offer a hybrid working arrangement with one day per week … Build dashboards, alerts, and runbooks to improve visibility Automate repetitive tasks to reduce operational toil Collaborate with cross-functional teams to enhance reliability and observability Support performance testing and capacity planning Proactively identify and prioritise reliability improvements Experience & Skills Required: Hands-on experience with Azure Monitoring (Application Insights, Alerts, Action ...

Site Reliability Engineer

Hiring Organisation
Connells Group HQ
Location
Milton Keynes, Buckinghamshire, England, United Kingdom
Employment Type
Full-Time
Salary
£40,000 - £55,000 per annum
hands-on role in ensuring it is reliable, scalable, and observable. You will help establish and mature SRE practices, focusing on: Monitoring and observability Incident response Post-incident review Reliability testing and capacity planning Toil reduction Enabling development velocity We offer a hybrid working arrangement with one day per week … Build dashboards, alerts, and runbooks to improve visibility Automate repetitive tasks to reduce operational toil Collaborate with cross-functional teams to enhance reliability and observability Support performance testing and capacity planning Proactively identify and prioritise reliability improvements Experience & Skills Required: Hands-on experience with Azure Monitoring (Application Insights, Alerts, Action ...

Vice President Software Engineering

Hiring Organisation
Jobleads-UK
Location
City of Edinburgh, Scotland, United Kingdom
where 80–90% of code is AI‐generated, with a roadmap to 95%+. Embed modern engineering excellence (CI/CD, trunk‐based development, observability, and automated testing). Partner cross‐functionally across Product, DevOps, Security, and Platform teams. Build a high‐performance culture grounded in accountability, innovation, and continuous … where software is shipped to production frequently or daily. Expertise in modern practices including CI/CD pipelines, trunk‐based development, automated testing strategies, observability and system reliability. Proven ability to use engineering metrics to drive performance and continuous improvement. Organisational Design & Methodologies Experience designing and evolving engineering organisations using ...

Senior Java Engineer - FX eTrading

Hiring Organisation
Pontoon
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£800 - £900/day
data, order/risk workflows, and real-time streaming capabilities. Optimise Performance: Focus on improving latency, throughput, and reliability across the entire stack. Implement observability practises (metrics, tracing, logging) and conduct performance profiling. Establish Best Practises: Champion engineering excellence through code standards, testing strategies (unit/integration/… including market data, order flows, and execution workflows. Hands-On Skills: Proficiency with CI/CD, containerisation, cloud/on-prem deployments, and observability practises. AI Integration: Comfortable integrating AI coding tools into daily development workflows. Communication Skills: Excellent communication and stakeholder engagement abilities, with a track record of leading ...

Principal Data Engineer

Hiring Organisation
WTW
Location
Greater London, United Kingdom
Employment Type
Full Time
foundational governance capabilities: access security (Entra ID, Unity Catalog), data lineage tooling, CI/CD for data (Github Actions, Terraform, DBT Cloud), and observability practices. AI Fluency AI fluency is a core requirement of this role — in two distinct dimensions. First, you will design and build data infrastructure that powers … connect, where coupling creates risk, and how today's decisions constrain tomorrow's options. You hold a high bar for engineering quality — correctness, testability, observability, and documentation are non-negotiable, not nice-to-haves. You are pragmatic under pressure; you know when to build the right thing and when ...

Engineering Manager

Hiring Organisation
RWS
Location
Sheffield, United Kingdom
Employment Type
Full Time
what they build Building strong partnerships with product managers, designers, domain experts, and senior business stakeholders Establishing engineering practices: CI/CD, automated testing, observability, security scanning, and deployment workflows Creating an environment where engineers can do their best work, with clear expectations, regular feedback, and genuine development opportunities Helping … software engineering Experience with AWS and cloud-native architectures Strong understanding of containers and modern deployment practices A DevSecOps mindset with experience embedding security, observability, and operational ownership into engineering culture Track record of delivering complex software products or platforms Ability to balance strategic thinking with day-to-day execution ...

Software Engineer

Hiring Organisation
RWS
Location
Sheffield, United Kingdom
Employment Type
Full Time
security, reliability, and operation of the services you build, with a DevSecOps approach throughout Improving engineering practices including CI/CD pipelines, automated testing, observability, security scanning, and deployment workflows Collaborating closely with product managers, designers, domain experts, and other engineers to deliver meaningful outcomes Mentoring and supporting engineers across … DevSecOps mindset with experience owning the security and operation of services you build Understanding of modern delivery practices: CI/CD, automated testing, observability, and production ownership Ability to work across the full development lifecycle, from early design through to deployment and operations Clear communication skills and a collaborative working ...