Observability Jobs in the East of England

16 of 16 Observability Jobs in the East of England

DevOps Engineer (Kubernetes-Focused, Mid-Level)

Ipswich, England, United Kingdom
Core Cognitics
including load balancers, networking, firewalls, and system updates. Set up, configure, and maintain bare-metal or lightweight Kubernetes environments (e.g., kubeadm, K3s, MicroK8s). Monitor performance and reliability using observability tools (Prometheus, Grafana, Loki, ELK, etc.). Troubleshoot deployment, networking, and container runtime issues. Collaborate with development teams to ensure smooth delivery of applications and services. Maintain good documentation and More ❯
Posted:

Technology Lead

Luton, England, United Kingdom
Hybrid/Remote Options
easyJet
driven architecture, and microservices. Familiarity with airline or transport operations systems (crew, flight planning, disruption, scheduling) highly desirable. Strong understanding of DevOps practices (CI/CD pipelines, automated testing, observability). Strong understanding of scrum and agile ways of working, and able to help the team way of working Excellent problem-solving skills, able to assess trade-offs between time More ❯
Posted:

DevOps Engineer

Cambridge, England, United Kingdom
Propel
EC2, VPC, etc.) ⚙️ Strong IaC skills with Terraform and CI/CD pipelines 🐳 Kubernetes operations expertise on AWS (EKS) 🔒 Solid grounding in Linux, networking, and cloud security 📊 Familiarity with observability stacks (Prometheus, Grafana, Loki) If you’re ready to shape the infrastructure behind cutting-edge AI used by global enterprises, we’d love to hear from you. More ❯
Posted:

DevOps Engineer

Cheshire East, England, United Kingdom
Xpertise Recruitment
and Argo Troubleshoot production issues and keep uptime high What You’ll Bring Strong commercial DevOps experience with AWS, Docker, and Kubernetes Solid grounding in IaC (Terraform) and modern observability tools Understanding of cloud security, access control, and network resilience Collaborative, proactive, and solutions-driven mindset — you get things done Why Join Scale with a fast-growing, global tech business More ❯
Posted:

Software Engineer - Security Platforms AI

Cambridge, Cambridgeshire, United Kingdom
Hybrid/Remote Options
Arm Limited
methodologies Familiarity withSQLfor building and querying relational databases. Clear technical writing todocumentdata schemas, APIs, and dashboard usage. "Nice to Have" Skills and Experience Experience with Grafana, Prometheus, or similar observability platforms. Familiarity with SAST and SCA tools (e.g., Coverity, Black Duck) and experience understanding their findings. Experience defining and visualizing key security and performance metrics within dashboard solutions. Experience with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Infrastructure Engineer

cambridge, east anglia, united kingdom
Luminance
through initiatives to remove single points of failure and improve autoscaling, high availability and managed service adoption across the platform. Collaborate with SRE, Security and Engineering teams to enhance observability, monitoring and alerting through tools like Prometheus, Grafana and CloudWatch. Partner with Security to embed best practices for IAM, secrets management, WAF, and posture management. Optimise performance and cloud spend … CodePipeline). Strong knowledge of Kubernetes operations on AWS (EKS), including cluster scaling, deployment automation, and monitoring. Solid background in Linux administration, networking, and cloud security principles. Familiarity with observability tools (Prometheus, Grafana, Loki) and structured alerting practices. Experience with database migrations, HA configurations, backups, and DR strategies. Strong scripting and automation skills (Terraform, Python, Bash, or similar). Excellent More ❯
Posted:

Data DevOps Engineer

St Albans, England, United Kingdom
Addition+
Oversee data pipelines and big data workflows (EMR, Spark) for high-performance analytics. Optimize code for ETL and Power BI (DAX, data models, refresh scheduling) to enhance performance. Implement observability and logging (CloudWatch, Grafana, ELK) for proactive system monitoring. Collaborate cross-functionally with BI, Platform, and Data teams on releases and issue resolution. Enforce security & compliance (RBAC, encryption, GDPR/… on with Docker and Kubernetes; experienced in scalable, portable BI and data environments. Environment Management: Managed Dev/QA/UAT freshness, data synchronisation, and Jira-integrated release workflows. Observability & Monitoring: Implemented CloudWatch, Datadog, Prometheus, and Grafana for logging, metrics, and alerting. Troubleshooting & Problem Solving: Strong analytical and cross-functional collaboration skills; effective under pressure. Project Delivery: Managed multiple concurrent More ❯
Posted:

Senior Cloud Engineer

Croydon, Cambridgeshire, UK
Morson Edge
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
Posted:

Cyber Security Engineer

Stevenage, Hertfordshire, South East, United Kingdom
Hybrid/Remote Options
MBDA
stakeholders to meet the ever-evolving challenges of the cyber threat landscape. Key responsibilities include; Act as the subject matter expert (SME) for Splunk across all cyber security and observability use cases. Lead SOC automation initiatives using scripting and SOAR tools, optimising processes through AI and ML technologies. Support alert tuning, connectivity, and visibility across monitored networks and infrastructure. Maintain More ❯
Employment Type: Permanent, Work From Home
Salary: £60,000
Posted:

Data Steward FTC

Luton, England, United Kingdom
Hybrid/Remote Options
easyJet
and platforms to automate and optimise data management steps and gateways into data and analytical pipelines. • Expertise in implementing and managing statistical process controls for data quality measurement, continuous observability, and data quality remediation. • Strong SQL background – comfortable writing efficient SQL (Transact-SQL, Hive -HQL) to meet the requirement, having had exposure to working with large datasets on a distributed More ❯
Posted:

Director, Infrastructure & Security Operations

Chelmsford, Essex, United Kingdom
Hybrid/Remote Options
Brooks Automation, Inc
infrastructure and security services, ensuring operational excellence and incident response readiness. Partner with the CISO to shape long-term strategy and roadmap for secure, resilient IT services. Drive automation, observability, and scalability across the infrastructure and security stack. Serve as a key escalation point for technical troubleshooting and security event resolution. Guide vendor selection, contract negotiations, and service-level adherence More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Cambridge, Cambridgeshire, United Kingdom
Hybrid/Remote Options
Willis Towers Watson
Description We are looking for an experienced Site Reliability Engineer to join the Igloo team in Cambridge to champion observability and delivery. The candidate should have strong communication skills, experience in coaching or sharing knowledge, and proficiency in Azure and Observability platforms. Join Insurance Consulting and Technology (ICT) during a transformative period aimed at enhancing customer and business value. You … new and exciting uses of their technology. This role will have the opportunity to help the team and product deal with exciting, complex and large-scale client propositions where observability will be essential and help transform how the product is designed and deployed. You will join a cross-team guild of Site Reliability Engineers, which enables you to not only … influence direction within your product family, but to also help shape how we handle observability and monitoring across ICT. This role is open to flexible and hybrid working arrangements, with presence in the Cambridge office a minimum of two days per week. The Role: Collaborate with cross-functional teams to ensure the reliability, availability, and performance of our client-facing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer – Cloud Infrastructure (AWS | Kubernetes | IaC)

Cambridge, England, United Kingdom
SoCode Recruitment
using Terraform and integrate them into CI/CD pipelines. Drive continuous improvement in platform reliability, scalability, and cost optimisation. Collaborate with SRE, Security, and Engineering teams to strengthen observability, monitoring, and alerting using tools like Prometheus, Grafana, and CloudWatch. What You’ll Bring Proven experience designing and automating AWS infrastructure . Strong experience building IaC pipelines with Terraform and …/CD tools Deep understanding of Kubernetes operations on AWS , including scaling, deployment automation, and monitoring. Solid background in Linux administration , networking, and cloud security. Hands-on experience with observability stacks (Prometheus, Grafana, Loki). Knowledge of database reliability . Strong scripting skills. A collaborative approach with a passion for improving systems through automation and consistency. The role: Pay More ❯
Posted:

Data Engineer - 12 month FTC

Alconbury, Cambridgeshire, UK
MM Flowers
of the digital landscape across the business. We’re looking to grow our data team with a skilled, proactive Data Engineer who is keen to establish strong data governance, observability practices-ensuring datasets are versioned, catalogued and fully traceable from source to output while sharing their knowledge within the data team. Working closely with the IT Team, vendors and business … . Maintain a medallion architecture (Bronze–Gold) for trusted, refined datasets. Develop, optimize, and maintain complex SQL queries to support analytics and reporting requirements. Implement data quality, testing and observability; ensure lineage, accuracy and compliance. Enable self-serve analytics through well-documented models and transformation logic. Integrate internal/external sources. Manage data infrastructure (warehouses, data lakes, storage); tune performance … and monitor health. Troubleshoot incidents, run root-cause analysis and deploy fixes and provide technical support. You will apply best practices for data quality, testing, and observability, helping to ensure the data delivered to stakeholders is accurate and trustworthy. Contribute to CI/CD practices, documentation and engineering standards. Partner cross-functionally to deliver fit-for-purpose data solutions. Proactively More ❯
Posted:

Director - Performance and Reliability

Cambridgeshire, East Anglia, United Kingdom
Sanderson Recruitment
lead performance testing and chaos engineering initiatives, and embed reliability best practices across engineering, DevOps, and infrastructure teams. This is a senior, strategic leadership role focused on system excellence, observability, and continuous improvement. Ideal Candidate: Proven experience leading Performance Engineering, Reliability, or SRE functions Deep expertise in performance testing methodologies (load, stress, spike, soak) Strong hands-on background with LoadRunner … strategy across critical platforms and services Oversee load, stress, and chaos testing initiatives to ensure systems perform and recover under real-world conditions Define and drive best practices for observability, monitoring, and APM adoption using tools like Dynatrace Drive incident reduction, faster recovery (MTTR) , and continuous reliability improvements Champion a culture of performance ownership , ensuring teams build with scalability, stability More ❯
Employment Type: Permanent
Salary: £95,000
Posted:

(Senior) EMEA AI Product Owner - Hemel Hempstead

Hemel Hempstead, Hertfordshire, United Kingdom
Boston Scientific Gruppe
by design. You'll groom the roadmap and write user stories to add to the backlog, break down epics, refine acceptance criteria, and balance new features with technical debt, observability, and cost efficiency. You'll demo recent increments, gather user feedback, and turn it into testable stories that enhance usability, trust, and performance. You'll brief sponsors on the value … solutions for the EMEA region. Key Responsibilities: Backlog ownership & delivery: Convert outcomes into prioritized epics/stories, lead refinement and planning, uphold DoR/DoD, balance features with reliability, observability, cost, and sustainability. Value & metrics: Establish VIPs/KPIs/OKRs (adoption, time to data, data trust, NPS, ROI); run quarterly value reviews, iterate the roadmap based on evidence. Privacy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
the East of England
10th Percentile
£70,300
25th Percentile
£81,250
Median
£92,500
75th Percentile
£100,000