376 to 400 of 486 Permanent Observability Jobs

Senior Lead Engineer

Hiring Organisation
Investigo
Location
City of London, London, United Kingdom
change management tools like Liquibase into automated pipelines Apply DevSecOps best practices across the lifecycle: static analysis, dependency scanning, and secure credential management Ensure observability, monitoring, and performance using GCP Operations Suite or New Relic Mentor engineers and collaborate across global, distributed teams What We’re Looking For Proven experience … expertise : BigQuery, Dataproc, Cloud Composer Deep data architecture and engineering knowledge : Spark, DBT, Oracle, BigQuery Experience designing scalable architectures (Microservices, Monoliths, Batch) Skilled in observability, monitoring, and DevSecOps integration Excellent communication with a record of collaborating globally Why You’ll Love It Combine architecture, coding, and leadership in one role ...

Senior / Lead Data Engineer (AI-Focused)

Hiring Organisation
PaymentGenes
Location
City of London, London, United Kingdom
inference (batch and real-time) Evaluate and integrate emerging AI tooling where strategically valuable 🔧 Technical Leadership Set best practices for testing, documentation, lineage, and observability Lead code reviews and mentor data & analytics engineers Drive CI/CD and infrastructure-as-code adoption Own platform reliability, performance optimisation, and cost efficiency … Infrastructure Feature engineering architecture ML pipeline and deployment workflows Experience supporting production ML systems Familiarity with embeddings, vector databases, LLM orchestration (desirable) Data observability and model monitoring Platform & DevOps CI/CD for data workflows Git-based engineering standards Docker/containerisation Infrastructure-as-code (e.g., Terraform) Monitoring and alerting ...

Software Engineer

Hiring Organisation
Hydrogen Group
Location
City of London, London, United Kingdom
Code Scaling and managing large fleets of IoT devices in the field Developing CI/CD pipelines and automation across the stack Implementing observability, monitoring, and telemetry (cloud + edge) Supporting security and compliance standards (e.g. SOC2, HIPAA) Improving developer workflows and engineering productivity What We Are Looking For 5+ … Docker & Kubernetes (EKS preferred) Proficiency in Python, Go, or another modern language Experience building CI/CD pipelines and automation Hands-on experience with observability tools (Grafana, Prometheus) Nice to Have Experience with IoT/edge infrastructure (device provisioning, OTA updates) Hybrid or multi-cloud environments SOC2 compliance exposure High ...

Cloud Engineer

Hiring Organisation
Spectrum It Recruitment Limited
Location
Southampton, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
secure, resilient cloud infrastructure across AWS and Azure . You'll play a key role in modernising platforms, migrating legacy services, and improving automation, observability and security across a multi-cloud estate. Cloud Engineer (AWS & Azure) Hybrid (2 days per month onsite) Location: Southampton What you'll be doing Designing … similar), Azure DevOps, PowerShell, Azure CLI Scripting: PowerShell, Python, Bash Containers: Docker, container registries (e.g., ACR) CI/CD: Azure DevOps Pipelines, YAML automation Observability: Datadog, Grafana Cloud, OpenTelemetry, CloudWatch, Prometheus, Loki Benefits (from day one) Up to 15% Bonus scheme 25 days annual leave + bank holidays Pension ...

Staff Engineer

Hiring Organisation
Xapien
Location
London Area, United Kingdom
workflows and MongoDB running on Kubernetes in GCP. You'll work with modern patterns including event-driven architectures, gRPC and REST APIs, and comprehensive observability with Grafana Cloud. We're an AI-native engineering team. We use Claude Code daily and we're investing heavily in AI-assisted development … team — establishing shared conventions, measuring impact, and helping engineers level up Background in SaaS platforms or B2B products at scale Expert-level knowledge of observability tools (Grafana, Prometheus, etc.) Deep understanding of authorization patterns, security, and multi-tenancy Experience with protobuf and gRPC Our Tech Stack Languages: Go Databases: MongoDB ...

DevOps Engineer

Hiring Organisation
Tata Consultancy Services
Location
Belfast, Northern Ireland, United Kingdom
/CD pipelines, environment provisioning, and release automation, enabling secure, scalable, and reliable software delivery across enterprise platforms. The role focuses on automation, observability, security, and reliability engineering. Your responsibilities: Own and maintain CI/CD pipelines for build, test, and deployment automation. Provision and manage environments using Infrastructure … Docker and Kubernetes. Implement release automation and deployment strategies. Integrate security scanning and compliance checks into CI/CD pipelines. Implement monitoring, logging, and observability solutions. Drive reliability practices including SLO/SLI definitions and incident response. Monitor platform health, performance, and availability. Collaborate with development, QA, and security teams ...

Senior Python Engineer (£100k + benefits)

Hiring Organisation
Morson Edge
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for ...

Lead Backend Engineer - Assistance

Hiring Organisation
N26 GmbH
Location
Berlin, Germany
Employment Type
Permanent
Salary
EUR Annual
About the opportunity You will be part of the Intelligent Operations Platforms (IOP) or the Assistance segments, which cross-functional teams power the operations at N26 and in some cases enable other tech teams through ...

Lead Backend Engineer - Assistance

Hiring Organisation
N26 GmbH
Location
Potsdam, Brandenburg, Germany
Employment Type
Permanent
Salary
EUR Annual
About the opportunity You will be part of the Intelligent Operations Platforms (IOP) or the Assistance segments, which cross-functional teams power the operations at N26 and in some cases enable other tech teams through ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, click apply for full ...

Hybrid Domain Consolidation Analyst | IT Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
leading observability solutions provider in Greater London is seeking a Domain Consolidation Analyst for a 6-month full-time contract with hybrid work. The role involves managing a project to consolidate IT domains, coordinating with third parties, and ensuring compliance with ISO 27001 standards. Candidates must have at least ...

Partner Manager

Hiring Organisation
Timebeat
Location
London Area, United Kingdom
written communication and CRM discipline Nice to have Experience with channel models (reseller, referral, MSP, SI), co-sell motions, or marketplace partnerships Familiarity with observability/monitoring, networking, infrastructure tooling, or developer-facing products Experience building partner programs from scratch (tiering, enablement, certification, MDF) Success metrics (examples) Number ...

GCP Devops Lead

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Bristol, Somerset, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
Actions, Harness, Jenkins). Networking & Security: Experience with GCP Cloud Armor, GCP Networking, and embedding secure-by-design controls from design to runtime. Automation & Observability: Implementing actionable observability, performance tuning, and automation to reduce toil. Defining and operating against SLOs/SLIs. Scripting & Tooling: Scripting in Bash, PowerShell, or Python. … Performance & Reliability: Define, monitor, and operate against service level objectives (SLOs/SLIs), ensuring high availability, performance, and fault tolerance. Continuous Improvement: Drive automation, observability, and performance tuning to reduce manual effort and improve platform reliability. Collaboration: Work closely with architecture and feature teams to evolve the cloud roadmap ...

Site Reliability Engineer

Hiring Organisation
McGregor Boyall Associates Limited
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£80,000
implementation across the GCP/Azure platforms. They are looking for several Site Reliability Engineer (SRE) to help improve the reliability, performance and observability of our Azure and GCP environments. You'll work within a multidisciplinary engineering squad, supporting the delivery, operation and continuous improvement of our cloud-hosted services. … Support the reliability and performance of the cloud platforms your squad owns. Use observability tools, metrics, logs and traces to detect and prevent issues. Contribute to incident response, post-incident reviews and problem management activities. Build automation that removes toil and improves operational efficiency. Work collaboratively with engineers, Product Owners ...

Site Reliability Engineering Lead – Financial Services

Hiring Organisation
Alexander Ash Consulting
Location
London Area, United Kingdom
operations, and improvement of the SRE platforms, teams, and organisation. You will be responsible for leading and scaling the SRE function, driving intelligent automation, observability, and resilience, across the organisation, and leading on production incidents, from frameworks to resolution. You will work in a hybrid on-premise/AWS-based … related fields (platform engineering, DevOps etc.) Deep technical experience in cloud-native AWS and on-premise systems architecture Strong incident management and observability experience for large scale systems Intelligent automation/Agentic AI experience preferred Excellent AWS services, data platforms, software engineering, CI/CD, IaC, experience Degree educated ...

AI Engineer – Production LLM Systems

Hiring Organisation
Redimeer
Location
London Area, United Kingdom
orchestration . You will work on: Multi‐agent architectures Intelligent tool and API integrations RAG pipelines and vector‐based retrieval Evaluation frameworks and AI observability Production workflows that ensure reliability, consistency, and scale You’ll play a critical role in crafting the orchestration layer that makes LLM systems trustworthy—handling … improving robustness across diverse use cases. Key Responsibilities Build production AI systems using LLMs, RAG pipelines, vector databases, and agentic frameworks Design evaluation and observability frameworks to measure performance, accuracy, and reliability Develop clean, scalable applications with proper error handling, APIs, and data pipelines Implement and maintain retrieval systems (vector ...

Principal Developer Team Lead

Hiring Organisation
Cambridge University Press & Assessment
Location
Cambridge, Cambridgeshire, United Kingdom
Employment Type
Permanent
Salary
GBP 51,400 - 68,800 Annual
legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage … more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling ...

Senior Data Ops Engineer

Hiring Organisation
Specsavers
Location
Nottingham, Nottinghamshire, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 per annum
line” behind our data products. You’ll be focused on making our data platform faster, more reliable, and easier to work with using automation, observability, and modern engineering practices to improve quality, resilience, and speed to value. If you enjoy solving complex problems, reducing friction for engineering teams, and making … example and champion best practice across the data engineering community. What really sets you apart is your mindset. You care deeply about quality, observability, and operational excellence. You enjoy collaborating across teams, explaining complex technical concepts in simple terms, and helping others learn and improve. You’re curious, proactive ...

Senior Data Ops Engineer

Hiring Organisation
Specsavers
Location
St. Andrews, Fife, Scotland, United Kingdom
Employment Type
Full-Time
Salary
£85,000 per annum
line” behind our data products. You’ll be focused on making our data platform faster, more reliable, and easier to work with using automation, observability, and modern engineering practices to improve quality, resilience, and speed to value. If you enjoy solving complex problems, reducing friction for engineering teams, and making … example and champion best practice across the data engineering community. What really sets you apart is your mindset. You care deeply about quality, observability, and operational excellence. You enjoy collaborating across teams, explaining complex technical concepts in simple terms, and helping others learn and improve. You’re curious, proactive ...

Senior Data Ops Engineer

Hiring Organisation
Specsavers
Location
Whiteley, Fareham, Hampshire, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 per annum
line” behind our data products. You’ll be focused on making our data platform faster, more reliable, and easier to work with using automation, observability, and modern engineering practices to improve quality, resilience, and speed to value. If you enjoy solving complex problems, reducing friction for engineering teams, and making … example and champion best practice across the data engineering community. What really sets you apart is your mindset. You care deeply about quality, observability, and operational excellence. You enjoy collaborating across teams, explaining complex technical concepts in simple terms, and helping others learn and improve. You’re curious, proactive ...

Platform Engineer

Hiring Organisation
Connells Limited
Location
Milton Keynes, Buckinghamshire, South East, United Kingdom
Employment Type
Permanent
teams to understand their needs and deliver solutions. You will work across multiple technical domains including orchestration, automation, CI/CD pipelines, cloud services, observability, and security, developing deeper expertise in areas that align with platform priorities and your interests. Experience with Microsoft Azure is essential. Daily coding … concepts Understanding of cloud networking concepts (VNets, subnets, NSGs) Awareness of cloud security best practices and compliance requirements Basic knowledge of monitoring, logging, and observability tools Understanding of cloud cost management and resource optimisation principles Comfort with troubleshooting and supporting development teams Understanding of service reliability and incident response practices ...

Head of Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
platform and infrastructure strategy Design and evolve cloud architecture to support scale, resilience, and performance Set standards for infrastructure, CI/CD, environments, and observability Make architectural decisions and trade‐offs Developer Experience (DevEx) Provide infrastructure for the development team to code, test and deploy efficiently Advise during design sessions … growing company Ability to operate production systems under pressure Deep hands‐on experience with the AWS cloud platform Strong background in reliability, observability, and incident management Experience leading or mentoring engineers What we offer in return 💰 Competitive salary depending on experience 🏝️ 27 days of annual leave (including 3 days Christmas ...

UK-Principal Software Engineer-YP

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
Anaplan\'s platform and third-party integrations Optimize model inference pipelines for performance, cost, and scalability in production environments Implement monitoring, logging, and observability for GenAI systems to track usage, errors, and model behaviour Collaborate with data scientists to productionize ML models and forecasting algorithms Write comprehensive tests, including unit … Experience with A/B testing and experimentation frameworks for AI features Contributions to open-source ML projects or research publications Experience with model observability tools (LangSmith, W&B, MLflow) What Makes This Role Exciting Lead a greenfield team building transformative AI capabilities from the ground up Work on cutting ...

Technical Lead - Full Stack - AWS - Microservices - East Kilbride/Hybrid (4 DPW On-Site)

Hiring Organisation
Curo Services
Location
East Kilbride, Lanarkshire, United Kingdom
Employment Type
Permanent
Salary
GBP 50,000 - 60,000 Annual
Subject - Technical Lead - Full Stack - AWS - Microservices - East Kilbride/Hybrid (4 DPW On-Site) - £50-60K Per Annum Job Title: Engineering Technical Lead Location: East Kilbride Salary: £50-60K Per Annum Benefits ...

Software Development Engineer In Test

Hiring Organisation
Response Informatics
Location
London, United Kingdom
Employment Type
Permanent
Required Qualifications Bachelors or masters degree in computer science, Engineering, or a related technical field. 8+ years of hands-on software development experience, including large-scale backend systems or platform engineering. Expert in Python with ...