21 of 21 Observability Jobs in the East of England

Site Reliability Engineer

Hiring Organisation
Visa
Location
Cambridge, Cambridgeshire, UK
Kubernetes (deploying or operating services). Exposure to service mesh technologies (e.g., Istio). Experience building or operating cloudnative or serverless applications. Familiarity with observability and data platforms such as Prometheus, Grafana, MongoDB, Elasticsearch, Kafka, and HashiCorp Vault. Understanding of application and data security fundamentals (authentication, authorization, encryption, TLS). ...

Site Reliability Engineer

Hiring Organisation
Anglian Water
Location
Huntingdon, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£40,000
looking for * Experience in site reliability engineering or DevOps roles * Strong scripting and automation skills (e.g., Python, Bash) * Knowledge of monitoring tools and observability practices * Understanding of cloud infrastructure and containerisation * Excellent problem-solving and analytical abilities * Commitment to continuous improvement and operational excellence Benefits As a valued employee ...

Senior Solution Architect - Secure Networks

Hiring Organisation
Jobleads-UK
Location
Peterborough, England, United Kingdom
multiple priorities and deadlines Nice to Have Experience with Zero Trust architectures and modern security frameworks Exposure to cloud networking (AWS, Azure) Exposure within observability and AIOps (Splunk, Logic Monitor, Big Panda) Experience with automation and AI‐driven operations (Python, NetBox, Ansible, Terraform) Relevant certifications (CCNP/CCIE, JNCIP/ ...

Software Engineer

Hiring Organisation
Accountancy Action - Your Specialist Finance Recruitment Partner
Location
Hertfordshire, England, United Kingdom
Exposure to AI systems, LLMs, or conversational workflows Experience in healthcare or regulated environments Knowledge of infrastructure-as-code (e.g. Terraform, CDK) Experience with observability, monitoring, and scaling production systems ...

Senior Software Engineer

Hiring Organisation
OAG
Location
Luton, England, United Kingdom
models, data pipelines, and event-driven systems on Databricks and Snowflake into the Intelligence layer Maintain the quality bar through code reviews, automated testing, observability, and CI/CD Support junior and mid-level engineers and help them grow WHAT WE'RE LOOKING FOR Extensive full-stack experience building ...

System Performance Engineer

Hiring Organisation
17918
Location
Cambridge, Cambridgeshire, United Kingdom
/JVM tuning or diagnostic experience. Basic Kubernetes setup or cluster experimentation experience. Understanding of networking fundamentals. Experience with opensource tools used for diagnostics, observability, or system performance. Basic programming ability (any language) to support automation or small tooling improvements. Exposure to distributed systems concepts or cloud environments ...

Senior Machine Learning Engineer - Agentic AI Platform

Hiring Organisation
Robert Half Limited
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
within the agent framework. Inference & Performance: Optimize LLM integration, latency, and cost efficiency. State & Reliability: Strengthen Redis-backed persistence and ensure system consistency. Evaluation & Observability: Build regression frameworks and implement monitoring and tracing. What We're Looking For Strong Python engineering experience with production-grade systems Hands-on with ...

Founding Engineer

Hiring Organisation
RedTech Recruitment Ltd
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£95,000
develop high-quality frontend interfaces that make complex AI outputs intuitive and actionable for users Build and maintain deployment pipelines, testing frameworks, monitoring, and observability systems Design and implement secure data pipelines with appropriate access controls and auditability Ensure the platform meets enterprise-grade security and compliance requirements ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, United Kingdom
Employment Type
Permanent
Salary
GBP 100,000 Annual
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cherry Hinton, Cambridgeshire, UK
Employment Type
Full-time
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, UK
Description Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been ...

Senior Platform Engineer

Hiring Organisation
PayPoint plc
Location
Welwyn Garden City, England, United Kingdom
GitOps workflows to enable safe, fast, and repeatable delivery. Championing DevSecOps principles, embedding security and compliance into the software delivery lifecycle. Establishing and improving observability, monitoring, and incident response practices, including vulnerability management and remediation. Mentoring engineers and contributing to a strong engineering culture through knowledge sharing, documentation, and technical … would be great if you have the following Experience with Helm, Kustomize, and Kubernetes ecosystem tooling. Familiarity with Azure and Azure DevOps. Experience with observability platforms and Kubernetes policy enforcement tools. Proficiency in scripting or programming (e.g. Bash, Python, PowerShell, C#). Experience designing multi-region or highly available systems. ...

Principal Engineer

Hiring Organisation
Synergetic Recruitment Group Limited
Location
Chelmsford, Essex, South East, United Kingdom
Employment Type
Permanent
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. Youll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, youll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Principal Engineer

Hiring Organisation
Synergetic
Location
Cambridgeshire, England, United Kingdom
client is scaling a large, distributed cloud platform and is looking for a Principal Engineer to act as the Subject Matter Expert (SME) across observability and cloud infrastructure. You’ll be working at serious scale managing thousands of Kubernetes nodes, handling tens of terabytes of logs daily, and supporting millions … highly distributed environment. The Role This is a senior, hands-on role where you will own the technical direction and standards of the observability ecosystem. As the SME, you’ll define best practice, guide architectural decisions, and act as the go-to expert across engineering teams, ensuring scalable, cost-efficient ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Stevenage, Hertfordshire, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Lead Site Reliability Engineer SRE Azure SaaS

Hiring Organisation
Client Server
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Lead Site Reliability Engineer (SRE Azure SaaS) Cambridge/WFH to £100k Do you have expertise with observability and monitoring within a SaaS environment? You could be progressing your career in a hands-on, influential role at a global InsurTech business, working on a flagship product that has recently been … happy to mentor and coach others, sharing your SRE expertise with software engineers and DevOps You have a strong knowledge of Azure including observability, monitoring, scaling, security and Azure DevOps pipelines You have experience with observability tools, Datadog preferred You have a good knowledge of automation, scripting (Python or PowerShell ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Birchanger, Hertfordshire, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Stansted, Birchanger, Essex, United Kingdom
Employment Type
Permanent
Salary
£40000 - £50000/annum
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Lead AI Consultant

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
Well‐formed opinions on and experience of AI‐enhanced SDLCs Set teams up for sustainable success through: Strong engineering practices (testing, CI/CD, observability, quality) Clear system boundaries and maintainable architecture Safe and secure deployment patterns for AI components at scale AI performance, accuracy, and reliability monitoring through evals … automated testing strategies (unit/integration/e2e where appropriate) good CI/CD hygiene and code review practicesclean boundaries and readable, extensible code observability and operational readiness (logging, monitoring, failure modes) Confidence working with AI uncertainty and risk You understand that AI systems behave differently to deterministic software ...

Head of IT Service Management

Hiring Organisation
Deerfoot Recruitment Solutions
Location
Hatfield, Hertfordshire, South East, United Kingdom
Employment Type
Permanent
cyber incident response with internal security teams and external partners Manage third-party suppliers, SLAs, and commercial performance Define and deliver strategy across automation, observability, and AIOps Lead and develop a high-performing team while influencing senior stakeholders Key requirements Proven experience in a Head of IT Service Management/… approach with the ability to lead under pressure Desirable Cyber incident response experience ServiceNow, Jira Service Management or similar ITSM tools Exposure to AIOps, observability, automation Advanced ITIL or relevant certifications Package and benefits Base salary £110,000 - £130,000 (DOE) Up to 30% discretionary bonus Up to 7% matched ...