1 to 25 of 28 Observability Jobs in the East of England

Lead Azure Platform Engineer

Hiring Organisation
Canada Life UK
Location
Potters Bar, Hertfordshire, South East, United Kingdom
Employment Type
Part Time
Champion consistent patterns for networking, identity, security and landing zones. Lead the development of CI/CD pipelines and automated infrastructure delivery. Promote strong observability, monitoring and alerting practices. Take part in incident response, root cause analysis and platform stability improvements. Balance build-and-run responsibilities with a focus ...

Site Reliability Engineer

Hiring Organisation
Anglian Water
Location
Huntingdon, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
looking for * Experience in site reliability engineering or DevOps roles * Strong scripting and automation skills (e.g., Python, Bash) * Knowledge of monitoring tools and observability practices * Understanding of cloud infrastructure and containerisation * Excellent problem-solving and analytical abilities * Commitment to continuous improvement and operational excellence Benefits As a valued employee ...

Data Engineer

Hiring Organisation
Saffron Housing
Location
Norwich, Norfolk, England, United Kingdom
Employment Type
Full-Time
Salary
£56,000 per annum
Engineering, Pipelines). Apply CI/CD practices (e.g., Azure DevOps) for version control, deployment automation, and environment management. Implement data quality checks, pipeline observability, alerting, and automated monitoring to ensure consistent platform reliability. Work collaboratively with data owners and the wider data team to ensure data definitions, lineage ...

Lead Engineer, Site Reliability

Hiring Organisation
GÉANT
Location
Cambridge, England, United Kingdom
challenges your team faces. Keep Europe online: Guarantee 99.9%+ uptime for identity services used by millions. Design resilience: Build monitoring and observability that spots issues before they happen. Automate at scale: Replace manual tasks with smart automation and CI/CD pipelines. Champion security: Apply best practices and compliance ...

Senior Software Engineer

Hiring Organisation
Method Resourcing
Location
St Albans, England, United Kingdom
real-time/event-driven architectures in complex environments Collaborating with Product, Architecture, and Engineering leadership to shape solutions Improving system performance, reliability, and observability in production Contributing to engineering standards, best practices, and technical direction Required Skills/Experience: This isn’t about ticking every box but you should ...

Head of Software Engineering - Peterborough

Hiring Organisation
Circle Group
Location
Peterborough, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
reduce manual effort. Improve system resilience and reduce operational fragility through structural, strategic improvements rather than reactive firefighting. Lead the evolution of cloud foundations, observability, security, and recovery capabilities to support a modern, scalable technology estate. They are looking to pay a starting salary of £75,000 - £90,000 + ...

Senior Machine Learning Engineer - Agentic AI Platform

Hiring Organisation
Robert Half Limited
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
within the agent framework. Inference & Performance: Optimize LLM integration, latency, and cost efficiency. State & Reliability: Strengthen Redis-backed persistence and ensure system consistency. Evaluation & Observability: Build regression frameworks and implement monitoring and tracing. What We're Looking For Strong Python engineering experience with production-grade systems Hands-on with ...

Founding Engineer

Hiring Organisation
RedTech Recruitment Ltd
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£95,000
develop high-quality frontend interfaces that make complex AI outputs intuitive and actionable for users Build and maintain deployment pipelines, testing frameworks, monitoring, and observability systems Design and implement secure data pipelines with appropriate access controls and auditability Ensure the platform meets enterprise-grade security and compliance requirements ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Norwich, Norfolk, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Colchester, Essex, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Luton, Bedfordshire, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Peterborough, Cambridgeshire, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Ipswich, Suffolk, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

Domain Architect

Hiring Organisation
TALENT INTERNATIONAL UK LTD
Location
Hemel Hempstead, Hertfordshire, UK
ADRs). Key Requirements: Technical Foundation: Solid experience with Microservices, Distributed Systems, and Cloud-native design. Architecture Principles: Understanding of DDD, resilience patterns, and observability (logging/tracing). Reliability: Familiarity with SLO/SLI concepts and availability modeling. Bonus: Knowledge of the energy sector or trading platforms. Essential Skills ...

IT Expert Principal

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Hatfield, Hertfordshire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£500 - £600 per day
. Hours: 37.5 hours a week. Monday - Friday. Time: 9:00 AM - 5:30 PM Job Description: The client is looking for an Enterprise Observability Consultant with strong experience across vendor and open-source observability platforms such as Dynatrace, Splunk, Grafana, Cribl, OpenTelemetry, and Prometheus. Responsibilities Lead observability assessments, discovery … workshops, and roadmap creation Advise on observability strategy, tooling, and best practices Design end-to-end observability architectures (logs, metrics, traces, RUM, synthetics, APM) Implement and integrate platforms including Dynatrace, Splunk, Grafana Cloud, Elastic, and Cribl Build telemetry pipelines, dashboards, alerting, and automation Perform root-cause analysis and performance optimisation ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, United Kingdom
Employment Type
Permanent
Salary
GBP 100,000 Annual
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, South East, United Kingdom
Employment Type
Permanent
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, youll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Principal Engineer

Hiring Organisation
Synergetic
Location
Cambridgeshire, England, United Kingdom
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. You’ll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, you’ll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Luton, Bedfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Cambridge, Cambridgeshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Ipswich, Suffolk, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Norwich, Norfolk, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Stevenage, Hertfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Basildon, Essex, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

DevSecOps Security Engineer - AWS, Security

Hiring Organisation
Adecco
Location
Cambridge, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £100,000 per annum
prioritisation.* Partner with engineering teams to resolve issues efficiently and pragmatically.* Refine detection tooling by tuning logic and reducing unnecessary or inaccurate alerts.Operational Readiness & Observability* Strengthen visibility across systems through improved log pipelines, alerting pathways, and monitoring strategies.* Contribute to updating response guidelines, runbooks, and incident-handling materials.* Support initiatives … Kubernetes Security, Infrastructure as Code, Terraform, CloudFormation, Pipeline Security, Cloud Governance, Policy as Code, Secrets Management, Identity and Access Management, Vulnerability Remediation, Threat Detection, Observability, Logging, Automation Engineering, Python, Bash, Zero Trust, Security Hardening, Cloud Monitoring, Least Privilege, Compliance Automation, Security Orchestration About AdeccoAdecco is acting as an Employment Agency. ...