351 to 375 of 473 Observability Jobs in England

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Peterborough, Cambridgeshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Derby, Derbyshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Exeter, Devon, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Norwich, Norfolk, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Colchester, Essex, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Cheltenham, Gloucestershire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
York, North Yorkshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Hemel Hempstead, Hertfordshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Newcastle upon Tyne, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Engineering Manager

Hiring Organisation
La Fosse
Location
London Area, United Kingdom
understanding of CI/CD pipelines , including build automation, testing, and deployment Familiarity with modern engineering practices: automated testing, infrastructure as code, monitoring, and observability Technology Stack Backend development across modern JVM frameworks including Spring , Spring Boot , and Micronaut , primarily using Java Cloud-native services deployed on Azure , with orchestration … Kubernetes and system monitoring/observability using tools such as Dynatrace Data persistence and storage using a mix of relational and NoSQL technologies, including SQL Server and MongoDB Frontend applications built with contemporary JavaScript frameworks and languages such as React , Next.js , Angular , and TypeScript In-memory data grids and caching ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
performant infrastructure that underpins critical public-sector services. You’ll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. You’ll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
Stoke-on-Trent, Staffordshire, UK
Employment Type
Full-time
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

DevOps Engineer

Hiring Organisation
Adria Solutions
Location
Manchester, North West, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
£400 per day (Outside)
partial service) Google Cloud Platform (BigQuery for analytics) DevOps & CI/CD GitHub repositories & workflows Shared GitHub Actions pipelines Public and private repositories Monitoring & Observability Prometheus, Grafana, Alertmanager Logit.io StatusCake Azure Monitor Alerts Sentry (service-level monitoring) What Were Looking For Strong hands-on experience with Azure DevOps tooling … Kubernetes (AKS preferred) Experience with CI/CD pipelines (GitHub Actions) Familiarity with multi-cloud environments (AWS/GCP beneficial) Experience with monitoring and observability tools Ability to work in a collaborative, fast-paced environment Contract Details Start Date: ASAP Duration: Initial 3 months Location: Remote/Hybrid If youre ...

Senior Data Platform Engineer

Hiring Organisation
ed Resourcing Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 90,000 Annual
Snowflake platform components Building and maintaining Infrastructure as Code (Terraform) across environments Creating and optimising CI/CD pipelines (Azure DevOps, GitHub Actions) Implementing observability practices (logging, monitoring, alerting) Ensuring platform security, scalability, and performance Collaborating with architects and senior engineers on platform standards Mentoring engineers and promoting engineering best … experience with Terraform or similar IaC tooling Proven ability to build and manage CI/CD pipelines Solid understanding of cloud security and observability Scripting skills (PowerShell, Bash, Python) Strong communicator with experience working across teams Ideal Backgrounds Platform Engineers working in data environments DevOps/Platform Engineers with exposure ...

Azure Engineering Manager - Fully Remote

Hiring Organisation
GBV Ltd
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
engineers. What youll be doing: Leading a distributed engineering team focused on platform reliability and scalability Driving SRE best practices (SLOs, automation, observability, incident management) Partnering with product, security, and engineering teams to shape infrastructure strategy Improving CI/CD, developer experience, and system performance Championing a culture of continuous … teams Deep Azure expertise (Terraform/IaC preferred) Background in software engineering (C#, Java, Python, or Ruby) Experience with Kubernetes, CI/CD, and observability tooling Passion for automation, reliability, and scalable systems Package highlights: Salary around £110-130k Private healthcare, pension, and strong benefits Clear progression and development ...

Platform Engineering Manager

Hiring Organisation
Prism Digital
Location
London Area, United Kingdom
cloud environments Architecture governance and design authority Security-by-design and Zero Trust Terraform or Bicep (production IaC) CI/CD and infrastructure automation Observability (SLOs, monitoring, incident management) Disaster recovery and resilience planning Vendor and third-party management Strong stakeholder communication What You’ll Work With Azure (landing zones … shared services) Terraform/Bicep CI/CD pipelines Kubernetes (AKS) Observability tooling (logs, metrics, tracing) Networking (VNets, ExpressRoute, private endpoints) Security controls and compliance frameworks Event Hubs, Service Bus, API Management Hybrid Windows/Linux infrastructure Nice to Haves FinOps (cost control, budgeting, optimisation) Financial services or regulated environments ...

Typescript Developer

Hiring Organisation
Get2Talent
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
performance trading platform . Work primarily with: TypeScript ( Node. js & React) Monorepo tooling, GitHub, GitHub Actions Jest, Playwright Redis, MS SQL, WebSockets Docker, Kubernetes Observability tools ( Grafana, Prometheus, SonarQube) Take end- to- end ownership of features from design to production. Collaborate closely with platform and DevOps engineers on build pipelines … observability, and operational concerns. Communicate directly with clients to clarify requirements and propose solutions. Contribute to and improve automated testing practices. Participate in peer code reviews and maintain high engineering standards. Leverage LLM/AI- enabled development tools as part of day- to- day development. Requirements 8+ years of professional ...

Lead React Developer

Hiring Organisation
Get2Talent
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
performance trading platform . Work primarily with: TypeScript ( Node. js & React) Monorepo tooling, GitHub, GitHub Actions Jest, Playwright Redis, MS SQL, WebSockets Docker, Kubernetes Observability tools ( Grafana, Prometheus, SonarQube) Take end- to- end ownership of features from design to production. Collaborate closely with platform and DevOps engineers on build pipelines … observability, and operational concerns. Communicate directly with clients to clarify requirements and propose solutions. Contribute to and improve automated testing practices. Participate in peer code reviews and maintain high engineering standards. Leverage LLM/AI- enabled development tools as part of day- to- day development. Requirements 8+ years of professional ...

Full Stack Typescript Engineer

Hiring Organisation
Get2Talent
Location
Oxford, Oxfordshire, South East, United Kingdom
Employment Type
Permanent
Salary
£70,000
performance trading platform . Work primarily with: TypeScript ( Node. js & React) Monorepo tooling, GitHub, GitHub Actions Jest, Playwright Redis, MS SQL, WebSockets Docker, Kubernetes Observability tools ( Grafana, Prometheus, SonarQube) Take end- to- end ownership of features from design to production. Collaborate closely with platform and DevOps engineers on build pipelines … observability, and operational concerns. Communicate directly with clients to clarify requirements and propose solutions. Contribute to and improve automated testing practices. Participate in peer code reviews and maintain high engineering standards. Leverage LLM/AI- enabled development tools as part of day- to- day development. Requirements 8+ years of professional ...

Senior Lead Engineer

Hiring Organisation
Investigo
Location
City of London, London, United Kingdom
change management tools like Liquibase into automated pipelines Apply DevSecOps best practices across the lifecycle: static analysis, dependency scanning, and secure credential management Ensure observability, monitoring, and performance using GCP Operations Suite or New Relic Mentor engineers and collaborate across global, distributed teams What We’re Looking For Proven experience … expertise : BigQuery, Dataproc, Cloud Composer Deep data architecture and engineering knowledge : Spark, DBT, Oracle, BigQuery Experience designing scalable architectures (Microservices, Monoliths, Batch) Skilled in observability, monitoring, and DevSecOps integration Excellent communication with a record of collaborating globally Why You’ll Love It Combine architecture, coding, and leadership in one role ...

Senior / Lead Data Engineer (AI-Focused)

Hiring Organisation
PaymentGenes
Location
City of London, London, United Kingdom
inference (batch and real-time) Evaluate and integrate emerging AI tooling where strategically valuable 🔧 Technical Leadership Set best practices for testing, documentation, lineage, and observability Lead code reviews and mentor data & analytics engineers Drive CI/CD and infrastructure-as-code adoption Own platform reliability, performance optimisation, and cost efficiency … Infrastructure Feature engineering architecture ML pipeline and deployment workflows Experience supporting production ML systems Familiarity with embeddings, vector databases, LLM orchestration (desirable) Data observability and model monitoring Platform & DevOps CI/CD for data workflows Git-based engineering standards Docker/containerisation Infrastructure-as-code (e.g., Terraform) Monitoring and alerting ...

Cloud Engineer

Hiring Organisation
Spectrum It Recruitment Limited
Location
Southampton, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
secure, resilient cloud infrastructure across AWS and Azure . You'll play a key role in modernising platforms, migrating legacy services, and improving automation, observability and security across a multi-cloud estate. Cloud Engineer (AWS & Azure) Hybrid (2 days per month onsite) Location: Southampton What you'll be doing Designing … similar), Azure DevOps, PowerShell, Azure CLI Scripting: PowerShell, Python, Bash Containers: Docker, container registries (e.g., ACR) CI/CD: Azure DevOps Pipelines, YAML automation Observability: Datadog, Grafana Cloud, OpenTelemetry, CloudWatch, Prometheus, Loki Benefits (from day one) Up to 15% Bonus scheme 25 days annual leave + bank holidays Pension ...