176 to 200 of 218 Remote/Hybrid Observability Jobs

Senior Platform Engineer

Hiring Organisation
Connells Limited
Location
Milton Keynes, Buckinghamshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
teams to understand their needs and deliver solutions. You will work across multiple technical domains including orchestration, automation, CI/CD pipelines, cloud services, observability, and security, developing deeper expertise in areas that align with platform priorities and your interests. Experience with Microsoft Azure is essential. You will play your … concepts Understanding of cloud networking concepts (VNets, subnets, NSGs) Awareness of cloud security best practices and compliance requirements Basic knowledge of monitoring, logging, and observability tools Understanding of cloud cost management and resource optimisation principles Comfort with troubleshooting and supporting development teams Understanding of service reliability and incident response practices ...

Head of Software Engineering - 2 Days London City/3 Remote

Hiring Organisation
ZENZO DIGITAL LTD
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£90,000
code (Bicep, Terraform), and container orchestration (AKS, Docker). Embed engineering best practice: implement CI/CD, code quality gates, automated testing, and observability from build to production. Guide platform evolution: re-engineer legacy .NET and SQL systems into modular, API-driven architectures. Build and empower teams: mentor full-stack …/CD automation, gated releases, and environment governance Infrastructure-as-Code (Bicep, ARM, Terraform) Containerisation (Docker, AKS) and serverless (Azure Functions) Monitoring and observability (Application Insights, Log Analytics) Secrets management and vulnerability scanning (Key Vault, SonarQube, OWASP) Architecture & Design Microservices and event-driven design (Service Bus, Event Grid, Kafka) Domain ...

Senior Python Engineer (£100k + benefits)

Hiring Organisation
Morson Edge
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for ...

Solution Architect Data and Systems Inside IR35

Hiring Organisation
Interact Consulting Limited
Location
South West London, London, United Kingdom
Employment Type
Contract, Work From Home
dependencies. Document current-state architecture and define target-state options, balancing trade-offs in security, scalability, and operability. Establish NFR baselines (availability, performance, resilience, observability, RTO/RPO, and cost). Define and document security architecture posture (Entra ID/OIDC, encryption, key management, network zoning, data protection). Produce ...

Data Warehouse Architect

Hiring Organisation
TXP
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
£700 - £750/day Outside IR35
Standardisation validation and data quality layers Master and reference data integration Analytical and dimensional modelling approaches Semantic layer and BI consumption patterns Metadata lineage observability and monitoring Architecture Leadership Act as architectural authority and design reviewer Produce architecture artefacts principles and standards Translate business strategy into scalable data architectures Work ...

AI Technical Lead

Hiring Organisation
Lorien
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
governs, and optimises AI systems across the bank. This is a hands-on, high-impact role at the intersection of AI governance, distributed systems, observability, and platform engineering. This role is based in Edinburgh OR London. This role will be Via Umbrella. Working in a Hybrid Model of 2 days … site. What You'll Do Lead the architecture, design, and engineering of the AI Control Tower platform. Build the core framework that enables AI observability, guardrails, performance monitoring, and lifecycle management. Shape the technical roadmap in partnership with product leaders, ensuring delivery against ambitious milestones. Establish engineering standards, patterns ...

Site Reliability Engineer

Hiring Organisation
Nigel Wright Group
Location
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 per annum
high-performing engineering function. This role blends elements of software engineering, DevOps, and modern SRE practices, offering the opportunity to work across cloud platforms, observability tools, and backend systems. You will sit within a collaborative engineering team and contribute to building reliable, scalable, API-driven services that support business-critical … e.g., serverless, containers, storage, and managed databases). Implementing SRE principles including SLIs/SLOs, error budgets, incident reduction, and resilience engineering. Enhancing system observability using monitoring and logging platforms (e.g., DataDog or equivalent). Supporting the creation and improvement of CI/CD pipelines and deployment processes. Ensuring customer ...

Senior DevOps Engineer

Hiring Organisation
Xact Placements Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 per annum
doing Designing and evolving distributed, multi-region infrastructure Solving complex scaling, reliability and performance challenges Driving DevOps performance across deployment, availability and recovery Improving observability, fault tolerance and operational maturity Championing infrastructure-as-code, automation and secure-by-design practices Collaborating across engineering, product and security teams What they … looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Senior DevOps Engineer

Hiring Organisation
Opus Recruitment Solutions Ltd
Location
Leeds, West Yorkshire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£450 per day
Opus is working with a client who is looking for a Senior DevOps Engineer to support the development and evolution of a large-scale observability platform. This is an opportunity for someone who brings strong technical foundations alongside the right mindset: intelligent, adaptable, confident, and able to work independently while … communicating effectively with the wider team. Role Responsibilities Contribute to the design, development and operation of a large-scale observability platform Build and maintain cloud infrastructure using Infrastructure as Code Work across distributed systems at scale Write, review and test code as part of DevOps engineering practices Manage deployments ...

DevOps Systems Engineer Hybrid Cloud

Hiring Organisation
Ernest Gordon Recruitment Limited
Location
Guildford, Surrey, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £65,000 per annum
cloud environments. You'll drive automation using Terraform and Ansible, build and improve CI/CD pipelines with GitHub Actions, and implement monitoring and observability solutions to improve reliability and performance across the business. You will also act as a technical escalation point, mentoring junior engineers and influencing best practice. … Role: Designing, deploying and supporting AWS and hybrid cloud infrastructure Building and maintaining CI/CD pipelines using GitHub Actions Implementing monitoring and observability tools (Grafana, Prometheus, CloudWatch) Participating in an on-call rota and incident response Monday to Friday (Hybrid working 1 day a month site based) The Person ...

Snowflake Data Architect - £550 Inside IR35- Hybrid

Hiring Organisation
Tenth Revolution Group
Location
Warwick, Warwickshire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£400 - £550 per day
/ELT processes for structured and semi-structured data* Develop data engineering solutions using Python for data processing, automation, and orchestration* Implement monitoring and observability for data systems using Prometheus* Define data models, schemas, and standards to ensure data consistency and quality* Collaborate with data engineers, analysts, and business stakeholders … Hands-on experience with DBT for data transformation and modeling* Solid understanding of ET/ELT architecture and best practices* Experience with monitoring and observability tools such as Prometheus* Strong knowledge of data modeling, data warehousing concepts, and cloud architecture* Excellent problem-solving and communication skills Preferred Qualifications* Experience with ...

Senior Software Engineer - Amplify C.S

Hiring Organisation
Klaviyo Inc
Location
Dublin, Ireland
Employment Type
Permanent
Salary
EUR 125,000 - 150,000 Annual
measurably improve productivity, quality, and reliability. Take ownership of the technical and architectural evolution of parts of the Amplify platform, anticipating scalability, reliability, and observability needs as usage grows. Define and maintain operational standards for the systems you own, including SLOs, monitoring, incident response, and follow-up on root-cause … applying AI thoughtfully to improve engineering productivity and system capabilities. You take ownership of operational excellence for the systems you build, including performance, reliability, observability, and on-call participation where required. You enjoy questioning convention and continuously improving how things work, whether that's architecture, tooling, workflows, or team practices. ...

Full Stack Engineer

Hiring Organisation
EMW Staffing Solutions LLC
Location
Greater London, England, United Kingdom
life intuitive, AI-native user experiences tailored to litigation workflows, in partnership with the founders and early customers. • Improve system performance, stability and observability as we scale to more cases, larger datasets and additional firms. • Help shape our engineering culture and team – from best practices and code standards to interviewing … shape how the core litigation platform evolves over the coming years. Help define engineering culture from scratch – everything from code standards and observability to how we run interviews and onboard future engineers. Build deep relationships with elite UK and US law firms and see your work used on complex, cross ...

Principal Platform Engineer

Hiring Organisation
Xact Placements Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 per annum
doing Designing and evolving distributed, multi-region infrastructure Solving complex scaling, reliability and performance challenges Driving DevOps performance across deployment, availability and recovery Improving observability, fault tolerance and operational maturity Championing infrastructure-as-code, automation and secure-by-design practices Collaborating across engineering, product and security teams What they … looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Storage Senior Automation Specialist - Contract - 11 months

Hiring Organisation
CBSbutler Holdings Limited trading as CBSbutler
Location
Sheffield, South Yorkshire, United Kingdom
Employment Type
Contract
Contract Rate
£400 - £430/day
Automation Storage Specialist to provide technical leadership across enterprise-scale storage automation. This role sits at the intersection of storage technology, automation, testing, and observability, with close alignment to OpenShift platform engineering. You will shape strategy, define standards, and build reusable automation patterns that are adopted across the bank, while … security, and compliance. * Act as a technical authority, reviewing designs and guiding engineering teams. * Drive adoption of GitOps, IaC, and modern automation practices. * Own observability KPIs, ensuring continuous improvement of automation SLAs. * Collaborate with platform, security, and infrastructure stakeholders across the organisation. Skills & Experience: * Expert-level experience with Ansible, Python ...

Digital Product Owner

Hiring Organisation
Kintec Global Recruitment
Location
Gothenburg, Sweden
Employment Type
Contract
Terraform, Bicep, and Ansible. - DevOps & CI/CD: Experience with CI/CD pipeline creation and integration, particularly GitHub Actions and Azure DevOps. - Monitoring & Observability: Experience with monitoring tools such as Zabbix, Azure Monitor, Grafana, and Prometheus. - Product Ownership: Strong backlog management, stakeholder engagement, prioritization, and roadmap definition capabilities. Experience … Azure. - Demonstrated ability to align cloud strategies with business objectives. - Successful track record delivering large-scale platform automation projects. - Experience designing and implementing observability frameworks. - Worked collaboratively with cross-functional teams (development, operations, business, security). Key Skills: - Strategic thinker with technical depth in cloud and infrastructure. - Influential leader ...

Active SC - Data Engineer - Remote

Hiring Organisation
Stealth IT Consulting
Location
United Kingdom
Employment Type
Contract
Contract Rate
GBP 500 Annual
Job Title: Data Engineer Rate: £500 (Inside IR35) Duration:6 months Location: Remote Clearance: Active SC Stages: 1 Stage Must Have Active SC What you'll do Engineer production-grade data pipelines on Microsoft Fabric ...

Integration Engineer

Hiring Organisation
POWWR
Location
Manchester Area, United Kingdom
POWWR are seeking a skilled Integration Engineer to join our UK engineering team to help deliver high-quality, customer focused integrations for POWWR’s SaaS energy marketplace platform. This is an excellent opportunity for someone ...

Site Reliability Engineer

Hiring Organisation
Searchability (UK) Ltd
Location
Wigan, Greater Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
maintain high standards across the platform. SITE RELIABILITY ENGINEER ESSENTIAL SKILLS * Strong understanding of reliability engineering, scalable architectures and performance optimisation * Experience with observability, debugging and incident response * Proficiency in a programming language for automation and tooling (GO or .NET preferred) * Cloud experience, ideally AWS, and knowledge of container orchestration … Kubernetes) and Infrastructure as Code (Terraform) * Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry * Strong understanding of networking fundamentals and distributed systems * Ability to collaborate effectively with engineering, operations and product teams TO BE CONSIDERED: Please either apply through this advert or email me directly ...

Tech Lead - Data Analytics & Data Engineering (AWS) - SC+NPPV3

Hiring Organisation
Sanderson Government and Defence
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£550 - £600 per day
architecture decisions, including data modelling, partitioning strategies, and performance optimisation Defining and enforcing engineering standards across code quality, testing, CI/CD, IaC, and observability Building and optimising ELT/ETL pipelines using Glue (PySpark) , Lambda, Step Functions, and event-driven patterns Championing security-by-design in collaboration with security … Agile delivery environments Nice to Have: Experience with AWS Lake Formation, Lambda, Step Functions, Athena, EMR, DataBrew Familiarity with data quality, lineage, or observability tooling Knowledge of GDS ways of working and public sector delivery frameworks Experience with privacy-by-design, data retention, and FOIA considerations Exposure to dbt , Redshift ...

Cloud Architect

Hiring Organisation
Ultima
Location
United Kingdom
Job Description: Cloud Architect – Azure, DevOps, Terraform (with Technical Account Management Focus) Position: Cloud Architect Location: Remote (UK-based) Type: Full-time We are seeking a skilled and client-focused Cloud Architect with deep expertise ...

Site Reliability Engineer

Hiring Organisation
NFU Mutual
Location
Stratford-Upon-Avon, Warwickshire, West Midlands, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£50,000
supportive team, enabling NFU Mutual to understand, operate, and continually improve its technology services through world-class observability. Drive the evolution of observability across the business, improving reliability, performance, and customer experience Deploy and mature modern observability tooling to support high-profile change and transformation initiatives Hybrid … homeworking available and 20% in Stratford-upon-Avon About the role We have an exciting permanent opportunity for a Site Reliability Engineer (Observability) to join our Monitoring Team within IT Infrastructure Products. This role is central to our ambition to move beyond traditional monitoring and establish end-to-end observability ...

Platform Architect- Procurement (m/f/d)

Hiring Organisation
METRO Digital GmbH
Location
Düsseldorf, Nordrhein-Westfalen, Germany
Employment Type
Permanent
Salary
EUR Annual
establish standards and guardrails, and guide design decisions across squads in collaboration with Enterprise Architecture and the Design Council; Ensure Operational Excellence - Drive automation, observability, resilience, and performance improvements. Monitor platform KPIs and recommend architecture changes to continuously improve reliability and efficiency; Maintain & Evolve Architecture Documentation - Keep architectural artifacts … Strong communication skills to explain complex technical concepts to both engineers and business leaders; Operational Excellence Mindset - Knowledge of DevOps/SRE practices, automation, observability, and monitoring. Ability to interpret platform KPIs and drive continuous improvements; Documentation & Knowledge Management - Disciplined in maintaining architecture artifacts in tools like LeanIX, Confluence ...

OpenShift Telemetry Engineer

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £450 to £500 per day
skilled OpenShift Telemetry Engineer to join our team. Your Responsibilities In this role, you will be: Primarily responsible for implementing, managing, and optimizing the observability stack within a Red Hat OpenShift Container Platform environment to ensure system health, performance, and security. Bridge the gap between application monitoring and infrastructure, leveraging … OpenShift telemetry via Kafka (producers, topics, schemas) and build resilient consumer services for transformation and enrichment. Engineer data models and routing for multi-tenant observability; ensure lineage, quality, and SLAs across the stream layer. Integrate processed telemetry into Splunk for visualization, dashboards, alerting, and analytics to achieve Observability Level ...

AI Ops Engineer - up to £85,000 Benefits - Hybrid - Derby

Hiring Organisation
Involved Solutions
Location
Derby, Derbyshire, England, United Kingdom
Employment Type
Full-Time
Salary
£75,000 - £85,000 per annum
deployment pipelines for AI models, prompts and supporting artefacts Own lifecycle management including versioning, promotion, rollback and retirement of AI solutions Implement monitoring and observability covering performance, usage, drift and data quality Ensure AI systems meet security, compliance and governance requirements Optimise inference performance, scalability and cost efficiency Manage infrastructure … experience with CI/CD pipelines for data or ML workloads Experience managing cloud-based infrastructure for AI workloads Solid understanding of monitoring, observability and operational resilience Strong collaboration skills with the ability to work across engineering and data teams Experience supporting secure, compliant and well-governed systems Desirable Skills ...