22 of 22 Observability Jobs in the East of England

Site Reliability Engineer (Cambridge)

Hiring Organisation
Visa
Location
Cambridge, Cambridgeshire, UK
Employment Type
Part-time
Kubernetes (deploying or operating services). Exposure to service mesh technologies (e.g., Istio). Experience building or operating cloudnative or serverless applications. Familiarity with observability and data platforms such as Prometheus, Grafana, MongoDB, Elasticsearch, Kafka, and HashiCorp Vault. Understanding of application and data security fundamentals (authentication, authorization, encryption, TLS). ...

Site Reliability Engineer

Hiring Organisation
17918
Location
Cambridge, Cambridgeshire, United Kingdom
Kubernetes (deploying or operating services). Exposure to service mesh technologies (e.g., Istio). Experience building or operating cloudnative or serverless applications. Familiarity with observability and data platforms such as Prometheus, Grafana, MongoDB, Elasticsearch, Kafka, and HashiCorp Vault. Understanding of application and data security fundamentals (authentication, authorization, encryption, TLS). ...

Site Reliability Engineer

Hiring Organisation
Anglian Water
Location
Huntingdon, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£40,000
looking for * Experience in site reliability engineering or DevOps roles * Strong scripting and automation skills (e.g., Python, Bash) * Knowledge of monitoring tools and observability practices * Understanding of cloud infrastructure and containerisation * Excellent problem-solving and analytical abilities * Commitment to continuous improvement and operational excellence Benefits As a valued employee ...

Senior Solution Architect - Secure Networks

Hiring Organisation
Jobleads-UK
Location
Peterborough, England, United Kingdom
multiple priorities and deadlines Nice to Have Experience with Zero Trust architectures and modern security frameworks Exposure to cloud networking (AWS, Azure) Exposure within observability and AIOps (Splunk, Logic Monitor, Big Panda) Experience with automation and AI‐driven operations (Python, NetBox, Ansible, Terraform) Relevant certifications (CCNP/CCIE, JNCIP/ ...

Software Engineer

Hiring Organisation
Accountancy Action
Location
Hertfordshire, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £90,000 per annum
Exposure to AI systems, LLMs, or conversational workflows Experience in healthcare or regulated environments Knowledge of infrastructure-as-code (e.g. Terraform, CDK) Experience with observability, monitoring, and scaling production systems ...

System Performance Engineer (Cambridge)

Hiring Organisation
Visa
Location
Cambridge, Cambridgeshire, UK
Employment Type
Part-time
/JVM tuning or diagnostic experience. Basic Kubernetes setup or cluster experimentation experience. Understanding of networking fundamentals. Experience with opensource tools used for diagnostics, observability, or system performance. Basic programming ability (any language) to support automation or small tooling improvements. Exposure to distributed systems concepts or cloud environments ...

System Performance Engineer

Hiring Organisation
17918
Location
Cambridge, Cambridgeshire, United Kingdom
/JVM tuning or diagnostic experience. Basic Kubernetes setup or cluster experimentation experience. Understanding of networking fundamentals. Experience with opensource tools used for diagnostics, observability, or system performance. Basic programming ability (any language) to support automation or small tooling improvements. Exposure to distributed systems concepts or cloud environments ...

Senior Machine Learning Engineer - Agentic AI Platform

Hiring Organisation
Robert Half Limited
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
within the agent framework. Inference & Performance: Optimize LLM integration, latency, and cost efficiency. State & Reliability: Strengthen Redis-backed persistence and ensure system consistency. Evaluation & Observability: Build regression frameworks and implement monitoring and tracing. What We're Looking For Strong Python engineering experience with production-grade systems Hands-on with ...

Founding Engineer

Hiring Organisation
RedTech Recruitment Ltd
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£95,000
develop high-quality frontend interfaces that make complex AI outputs intuitive and actionable for users Build and maintain deployment pipelines, testing frameworks, monitoring, and observability systems Design and implement secure data pipelines with appropriate access controls and auditability Ensure the platform meets enterprise-grade security and compliance requirements ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, United Kingdom
Employment Type
Permanent
Salary
GBP 100,000 Annual
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, United Kingdom
Employment Type
Permanent
Salary
GBP 100,000 Annual
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cherry Hinton, Cambridgeshire, UK
Employment Type
Full-time
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, UK
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, South East, United Kingdom
Employment Type
Permanent
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, youll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Principal Engineer

Hiring Organisation
Synergetic
Location
Cambridgeshire, England, United Kingdom
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. You’ll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, you’ll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Stevenage, Hertfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

DevSecOps Security Engineer - AWS, Security

Hiring Organisation
Adecco
Location
Cambridge, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £100,000 per annum
prioritisation.* Partner with engineering teams to resolve issues efficiently and pragmatically.* Refine detection tooling by tuning logic and reducing unnecessary or inaccurate alerts.Operational Readiness & Observability* Strengthen visibility across systems through improved log pipelines, alerting pathways, and monitoring strategies.* Contribute to updating response guidelines, runbooks, and incident-handling materials.* Support initiatives … Kubernetes Security, Infrastructure as Code, Terraform, CloudFormation, Pipeline Security, Cloud Governance, Policy as Code, Secrets Management, Identity and Access Management, Vulnerability Remediation, Threat Detection, Observability, Logging, Automation Engineering, Python, Bash, Zero Trust, Security Hardening, Cloud Monitoring, Least Privilege, Compliance Automation, Security Orchestration About AdeccoAdecco is acting as an Employment Agency. ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been … happy to mentor and coach others, sharing your SRE expertise with software engineers and DevOps You have a strong knowledge of Azure including observability, monitoring, scaling, security and Azure DevOps pipelines You have experience with observability tools, Datadog preferred You have a good knowledge of automation, scripting (Python or PowerShell ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Birchanger, Hertfordshire, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Stansted, Birchanger, Essex, United Kingdom
Employment Type
Permanent
Salary
£40000 - £50000/annum
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Lead AI Consultant

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
Well‐formed opinions on and experience of AI‐enhanced SDLCs Set teams up for sustainable success through: Strong engineering practices (testing, CI/CD, observability, quality) Clear system boundaries and maintainable architecture Safe and secure deployment patterns for AI components at scale AI performance, accuracy, and reliability monitoring through evals … automated testing strategies (unit/integration/e2e where appropriate) good CI/CD hygiene and code review practicesclean boundaries and readable, extensible code observability and operational readiness (logging, monitoring, failure modes) Confidence working with AI uncertainty and risk You understand that AI systems behave differently to deterministic software ...

Head of IT Service Management

Hiring Organisation
Deerfoot Recruitment Solutions
Location
Hatfield, Hertfordshire, South East, United Kingdom
Employment Type
Permanent
cyber incident response with internal security teams and external partners Manage third-party suppliers, SLAs, and commercial performance Define and deliver strategy across automation, observability, and AIOps Lead and develop a high-performing team while influencing senior stakeholders Key requirements Proven experience in a Head of IT Service Management/… approach with the ability to lead under pressure Desirable Cyber incident response experience ServiceNow, Jira Service Management or similar ITSM tools Exposure to AIOps, observability, automation Advanced ITIL or relevant certifications Package and benefits Base salary £110,000 - £130,000 (DOE) Up to 30% discretionary bonus Up to 7% matched ...