Observability Jobs in the UK excluding London

26 to 50 of 582 Observability Jobs in the UK excluding London

Infrastructure Engineer

York, Yorkshire, United Kingdom
Polo's Point S Tire
performance issues Managing regular patching and upgrade cycles for Infrastructure and Software Managing security vulnerabilities and performing platform hardening activities Developing automation to remove manual tasks Developing and maintaining observability dashboards and alerting Collaborating with Software Engineers and Users across the business Required skills and experience: Strong knowledge of at least one Public Cloud provider: Azure, AWS or GCP (Managed … Compute, Networking, RBAC/IAM) Prior experience in Linux system administration in a production environment Prior experience in provisioning and operating Kubernetes clusters in a production environment Experience in observability with Grafana with a good understanding of PromQL and LogQL Good knowledge of using Infrastructure-as-Code solutions such as Terraform Comfortable with scripting for automation using Bash and Python More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Messaging Administrator - Solace

South East London, England, United Kingdom
Marlin Selection Recruitment
For: 3+ years’ hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well across technical teams Bonus More ❯
Posted:

Remote Senior Site Reliability Engineer Manager (Remote)

Cambourne, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Remotestar
strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

Macclesfield, England, United Kingdom
Revolent Group
a minimum of two years working with us post training Nice to have: Domain knowledge: Banking, Financial Services, Lending (Very nice to have – understanding the wholesale lending lifecycle) Monitoring & Observability: Experience in managing Tools like APPD, ELK stack, Grafana Security Practices: DevSecOps principles, vulnerability scanning, compliance automation, Certificate/vault/user role management. Strong attention to detail a passion More ❯
Posted:

Software Engineer

Cheltenham, England, United Kingdom
Hybrid / WFH Options
Argo DevOps Solutions Ltd
BDD approaches (e.g., Cucumber, Gherkin) for test automation Containerisation & Microservices Container Technologies: Practical understanding of Docker or equivalent solutions Microservice Patterns: Experience architecting microservice-based systems with built-in observability and security Cloud Services & Environments Cloud Providers: Demonstrable experience with AWS or Azure Security & Configuration: Ability to build, configure, and secure cloud environments effectively Security & CI/CD Security Integration More ❯
Posted:

AWS Engineer

Manchester, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Engineer

Manchester, Lancashire, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Hereford, Herefordshire, West Midlands, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Work Scheme Key Responsibilities of the Site Reliability Engineer: Partner with developers to improve performance and reliability across systems Automate toil and reduce unnecessary alerts with smart tooling Evolve observability so we can prevent issues before they become incidents Improve CI/CD pipelines and support development teams in delivering quality faster Explore new technologies, tools, and services that improve … plus) Experience with Terraform and modern IaC practices Hands-on with Docker and orchestration tools (Kubernetes, OpenShift, or Docker Swarm) CI/CD experience (Jenkins or equivalent) Monitoring/observability tools: Grafana , Prometheus , or InfluxDB Event-driven messaging: RabbitMQ or similar Strong Linux skills, scripting, and understanding of network security protocols Experience with AWS: EC2, S3, RDS, Lambda Desirable: Experience … coding in Python, Java, or Go Exposure to cross-domain solutions Experience in a service management environment Observability best practices and metric-driven reliability improvement Security Requirements Due to the sensitive nature of our work, candidates must be eligible for Developed Vetting (DV) clearance. All offers are subject to security screening. Ready to Engineer Systems That Matter? If youre a More ❯
Employment Type: Permanent, Work From Home
Posted:

Senior Site Reliability Engineer

Manchester, United Kingdom
Hybrid / WFH Options
Embarcaderomediagroup
ll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and more cost-efficiently. What you … ll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules, and self-service infrastructure Enhancing CI/CD pipelines (Azure DevOps, YAML-based) with security … knowledge (AKS, Functions, SQL, Cosmos DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

Crewe, Cheshire, United Kingdom
Hybrid / WFH Options
Manchester Digital
platform security, reliability, and performance across systems deployed in Canada, the UK, and AWS cloud environments Contribute to key projects, platform optimizations, and ongoing maintenance initiatives Help drive scalability, observability, and operational excellence If you're passionate about infrastructure, cloud, and systems engineering-and want to help shape the future of mobility-we want to hear from you! Requirements We … configurations (Azure AD , Ory, Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, New Relic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/… understanding of Infrastructure as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., New Relic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Python Developer

Northern Ireland, United Kingdom
Hybrid / WFH Options
Ocho
cross-functional teams to design and deliver full-featured software components • Drive a “security-first” mindset across development practices, including OAuth2 and IAM policies • Lead operational efforts using modern observability frameworks to monitor and debug production systems • Mentor junior engineers and contribute to a culture of continuous improvement Essential Criteria: • Strong commercial experience in Golang and Python • Proven track record … secure application design principles • Hands-on experience designing and consuming RESTful and GraphQL APIs • Strong SQL skills and familiarity with data warehouses like Snowflake • Day-2 operations experience including observability, debugging, and triage Desirable Skills: • Experience with Auth0 , AWS Cognito , or similar identity platforms • Familiarity with Helm , Prometheus , Grafana , or OpenTelemetry • Exposure to other cloud platforms (GCP, Azure) • CI/ More ❯
Posted:

Loan IQ DevOps Engineer

Manchester Area, United Kingdom
Hybrid / WFH Options
Revolent Group
related processes like data migrations and environment setup. ✅ Preferred (Nice to Have): Banking/Financial Services knowledge — especially around wholesale lending and Loan IQ . Experience with monitoring and observability tools such as APPD, ELK Stack, or Grafana. Understanding of DevSecOps principles , including vulnerability scanning, secrets management, and compliance automation. Further experience with CI/CD integration and pipeline automation More ❯
Posted:

Platform Engineer - DevOps Specialist

Knutsford, Cheshire, United Kingdom
Square One Resources
this role, you will assist in upgrading the Elastic DP estate to Kubernetes, moving away from obsolete technology (Cloudera), upgrading to RHEL 8, and contributing to improving stability and observability of the platform. You will provide advanced analytics tooling and services for modeling analytics, working across continuous integration, development, build, and deployment using automation and cloud technologies to support the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Bristol, Gloucestershire, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual
Posted:

Site Reliability Engineer

BS1, Bristol, City of Bristol, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum Hybrid, Great Benefits
Posted:

Site Reliability Engineer

Bristol, Avon, South West, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent, Work From Home
Posted:

Solutions Architect [UAE Based]

South East London, England, United Kingdom
AI71
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
Posted:

Senior DevOps Engineer

Belfast, United Kingdom
Menlo Ventures
Terraform). Experience in software development in general, with skills in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR Our UK benefits: Stock Options Annual Performance Bonus or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer - GammaLabs

Manchester, United Kingdom
Hybrid / WFH Options
Gamma Communications plc
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal MLOps/GenAI Infrastructure Engineer

Glasgow, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal MLOps/GenAI Infrastructure Engineer

Salford, Manchester, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal MLOps/GenAI Infrastructure Engineer

Cardiff, South Glamorgan, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal MLOps/GenAI Infrastructure Engineer

Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Hybrid / WFH Options
BBC Group and Public Services
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer - Content & Personalisation

Bristol, Avon, South West, United Kingdom
Hybrid / WFH Options
Hargreaves Lansdown
Excited to grow your career? Our purpose is to empower people to save and invest with confidence. We are looking for great people to join us, so please come and invest in YOUR future at HL. We know that sometimes More ❯
Employment Type: Permanent, Part Time
Salary: £75,000
Posted:

Observability Engineer - Grafana Dashboarding

South East London, England, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with Grafana … dashboard creation, templating, and performance optimization. Strong understanding of PromQL, VictoriaMetrics, or VictoriaLogs query languages. Ability to interpret and map RESTful API data into observability pipelines and dashboards. Familiarity with IaC outputs and tooling (e.g., Terraform) as data sources for observability. Solid programming ability in Golang (preferred) or Python for automation and integration. Strong collaboration skills to work with cross More ❯
Posted:
Observability
the UK excluding London
10th Percentile
£49,563
25th Percentile
£61,563
Median
£74,500
75th Percentile
£85,000
90th Percentile
£98,500