1 to 25 of 27 Observability Jobs in the East of England

Lead Azure Platform Engineer

Hiring Organisation
Canada Life UK
Location
Potters Bar, Hertfordshire, South East, United Kingdom
Employment Type
Part Time
Champion consistent patterns for networking, identity, security and landing zones. Lead the development of CI/CD pipelines and automated infrastructure delivery. Promote strong observability, monitoring and alerting practices. Take part in incident response, root cause analysis and platform stability improvements. Balance build-and-run responsibilities with a focus ...

Senior DevOps Engineer

Hiring Organisation
Broster Buchanan
Location
Peterborough, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £65,000 per annum
pipelines and automate processes to improve reliability and reduce manual work Strong understanding of cloud and hybrid infrastructure, with a focus on enhancing observability, logging, and operational tooling Experience in incident response, root cause investigations, and implementing fixes to improve stability and prevent recurrence Preferred Qualifications: Proven track record ...

Senior AWS Engineer

Hiring Organisation
Omnigen Biodata
Location
Cambridge, England, United Kingdom
maintain GraphQL APIs Work with DynamoDB and Postgres Write production-quality Python Manage infrastructure with Terraform/Terraform Cloud Improve reliability, scalability, and observability Tech Stack AWS (Lambda, ECS, SQS, Step Functions, Cognito) · Terraform · Terraform Cloud · Docker · GraphQL · Athena · Parquet · DynamoDB · Postgres · GitLab · Python What We’re Looking For Senior ...

Data Engineer

Hiring Organisation
Saffron Housing
Location
Norwich, Norfolk, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£55,000
Data Engineering, Pipelines). Apply CI/CD practices (e.g., Azure DevOps) for version control, deploymentautomation, and environment management. Implement data quality checks, pipeline observability, alerting, and automatedmonitoring to ensure consistent platform reliability. Work collaboratively with data owners and the wider data team to ensure data definitions, lineage, and ownership ...

Data Engineer

Hiring Organisation
Saffron Housing
Location
Norwich, Norfolk, England, United Kingdom
Employment Type
Full-Time
Salary
£56,000 per annum
Engineering, Pipelines). Apply CI/CD practices (e.g., Azure DevOps) for version control, deployment automation, and environment management. Implement data quality checks, pipeline observability, alerting, and automated monitoring to ensure consistent platform reliability. Work collaboratively with data owners and the wider data team to ensure data definitions, lineage ...

Lead Engineer, Site Reliability

Hiring Organisation
GÉANT
Location
Cambridge, England, United Kingdom
challenges your team faces. Keep Europe online: Guarantee 99.9%+ uptime for identity services used by millions. Design resilience: Build monitoring and observability that spots issues before they happen. Automate at scale: Replace manual tasks with smart automation and CI/CD pipelines. Champion security: Apply best practices and compliance ...

Head of Software Engineering - Peterborough

Hiring Organisation
Circle Group
Location
Peterborough, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
reduce manual effort. Improve system resilience and reduce operational fragility through structural, strategic improvements rather than reactive firefighting. Lead the evolution of cloud foundations, observability, security, and recovery capabilities to support a modern, scalable technology estate. They are looking to pay a starting salary of £75,000 - £90,000 + ...

Senior Machine Learning Engineer - Agentic AI Platform

Hiring Organisation
Robert Half Limited
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
within the agent framework. Inference & Performance: Optimize LLM integration, latency, and cost efficiency. State & Reliability: Strengthen Redis-backed persistence and ensure system consistency. Evaluation & Observability: Build regression frameworks and implement monitoring and tracing. What We're Looking For Strong Python engineering experience with production-grade systems Hands-on with ...

Senior Developer

Hiring Organisation
Addition
Location
Watford, Hertfordshire, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 per annum
Doing: Designing, deploying and managing automation and monitoring platforms that support large-scale applications and services Building and maintaining monitoring, alerting and observability tooling across the platform Creating dashboards that translate complex technical data into meaningful insights for stakeholders Developing automation to integrate new systems using existing frameworks Managing … Docker) Strong Python development skills , including scripting and Lambda functions Experience building and managing CI/CD pipelines , ideally with GitHub Actions Monitoring and observability tooling such as AppDynamics, Grafana, InfluxDB, Graphite, Sensu or similar Experience working with serverless architectures (Lambda, API Gateway, DynamoDB, EventBridge) Solid understanding of Linux/ ...

IT Expert Principal

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Hatfield, Hertfordshire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£500 - £600 per day
. Hours: 37.5 hours a week. Monday - Friday. Time: 9:00 AM - 5:30 PM Job Description: The client is looking for an Enterprise Observability Consultant with strong experience across vendor and open-source observability platforms such as Dynatrace, Splunk, Grafana, Cribl, OpenTelemetry, and Prometheus. Responsibilities Lead observability assessments, discovery … workshops, and roadmap creation Advise on observability strategy, tooling, and best practices Design end-to-end observability architectures (logs, metrics, traces, RUM, synthetics, APM) Implement and integrate platforms including Dynatrace, Splunk, Grafana Cloud, Elastic, and Cribl Build telemetry pipelines, dashboards, alerting, and automation Perform root-cause analysis and performance optimisation ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, United Kingdom
Employment Type
Permanent
Salary
GBP 100,000 Annual
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions ...

Senior Software Engineer

Hiring Organisation
Retelligence
Location
Cambridge, England, United Kingdom
Senior Software Engineer (.NET & Hybrid Cloud) Location: Cambridge, Hybrid Salary: £75,000 – £95,000 + Benefits The Opportunity We are seeking a Senior Software Engineer to join a team building high-integrity, scalable platforms. Our ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, South East, United Kingdom
Employment Type
Permanent
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, youll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Principal Engineer

Hiring Organisation
Synergetic
Location
Cambridgeshire, England, United Kingdom
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. You’ll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, you’ll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Technical Lead

Hiring Organisation
Broster Buchanan
Location
Peterborough, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£95,000 per annum
discipline. Review and drive new innovations, ensuring they are engineered for reliability, recoverability and scale. Remove the divide between build and run, strengthening automation, observability and disaster recovery across the platform. Lead the evolution of service operations, working with teams to ensure incidents and operational feedback are used to improve ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Chelmsford, Essex, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Ipswich, Suffolk, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Norwich, Norfolk, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Watford, Hertfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Peterborough, Cambridgeshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Luton, Bedfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

DevSecOps Security Engineer - AWS, Security

Hiring Organisation
Adecco
Location
Cambridge, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £100,000 per annum
prioritisation.* Partner with engineering teams to resolve issues efficiently and pragmatically.* Refine detection tooling by tuning logic and reducing unnecessary or inaccurate alerts.Operational Readiness & Observability* Strengthen visibility across systems through improved log pipelines, alerting pathways, and monitoring strategies.* Contribute to updating response guidelines, runbooks, and incident-handling materials.* Support initiatives … Kubernetes Security, Infrastructure as Code, Terraform, CloudFormation, Pipeline Security, Cloud Governance, Policy as Code, Secrets Management, Identity and Access Management, Vulnerability Remediation, Threat Detection, Observability, Logging, Automation Engineering, Python, Bash, Zero Trust, Security Hardening, Cloud Monitoring, Least Privilege, Compliance Automation, Security Orchestration About AdeccoAdecco is acting as an Employment Agency. ...

Data Platform Solution Architect

Hiring Organisation
MarkJames 🌍
Location
Essex, England, United Kingdom
Define and implement data lakehouse solutions using Apache Iceberg and S3 Lead performance tuning across Snowflake, Airflow, and Iceberg environments Ensure platform reliability, observability, and scalability Drive adoption of cloud-native design patterns and best practices Collaborate with engineering, DevOps, and business stakeholders Requirements Strong experience in Solution Architecture … architectures (Iceberg preferred) Expertise in performance tuning and optimisation Nice to Have CI/CD and DevOps practices Terraform/Infrastructure as Code Monitoring & observability tools (APM) Data governance & catalog tools Cloud security best practices Data modelling and ingestion frameworks ...

Senior System Reliability Engineer - REMOTE FROM IRELAND

Hiring Organisation
Caspian One Ltd
Location
Ireland, Bedfordshire, United Kingdom
Employment Type
Permanent
Salary
EUR 125,000 - 175,000 Annual
responsible for the reliability, performance, and operational excellence of a large-scale, bare-metal trading platform. This is a hybrid role combining systems engineering, observability, automation, and Real Time operational support. You'll work across the full stack - (Linux, networking, applications, hardware) and play a key role in building … resolve issues across OS, network, hardware, and application layers Build and improve automation, tooling, and configuration management (Ansible or similar) Develop and maintain observability dashboards, alerts, and telemetry pipelines Participate in deployments, start-up/shutdown procedures, and change management Contribute to engineering projects such as OS tuning, Kernel-level ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Birchanger, Hertfordshire, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...