351 to 375 of 547 Remote/Hybrid Observability Jobs

Cloud Engineer

Hiring Organisation
BAE Systems
Location
United Kingdom
maintain compute workloads including VMs, VMSS, App Services, and containers. Manage Azure Storage, backup and recovery policies, and lifecycle configurations. Implement monitoring and observability using Azure Monitor, Log Analytics, Alerts, and Dashboards. Apply governance using Azure Policy, Management Groups, and RBAC models. Support CI/CD pipelines using Azure DevOps … load balancing. Familiarity with Azure landing zone concepts and the Cloud Adoption Framework. Understanding of container orchestration and AKS architecture. Knowledge of monitoring and observability practices within Azure (Monitor, Logs, Metrics). Familiarity with CI/CD solutions and DevOps pipelines. Understanding of identity concepts including Entra ID integration, managed ...

Cloud DevOps Engineer - Derby- £70K

Hiring Organisation
Akkodis
Location
Derbyshire, United Kingdom
Employment Type
Permanent
Salary
£50000 - £70000/annum
where there's genuinely a lot going on, in a good way. They're moving away from legacy infrastructure, modernising their cloud estate, improving observability, and continuing to build out their platform engineering capability. So if you enjoy being part of real change rather than just keeping the lights … collaboration too, you'll be working closely with Dev, QA and Product, helping teams release software reliably while also pushing forward things like monitoring, observability and overall platform resilience. Tech-wise? It's an Azure-first setup, but they're open to people who've worked across ...

Data Architect / Data Engineers

Hiring Organisation
Vaco LLC
Location
Cincinnati, Ohio, United States
Employment Type
Permanent
Salary
USD 100 Annual
data and analytics assets Data Engineering Design and implement metadata-driven batch ingestion frameworks Develop up to 10 production-grade data pipelines Implement monitoring, observability, and remediation processes Design dimensional data models optimized for analytics and reporting Modern Cloud Data Platforms Architect solutions using Azure data services (Microsoft Fabric, Synapse … Power BI report or dashboard to validate the solution Document reporting and analytics processes DataOps & MLOps Enablement Design CI/CD, testing, and observability frameworks across data pipelines Promote data quality, lineage, and reproducibility through modern DevOps practices Support AI-ready data architectures where applicable Enablement, Collaboration & Leadership Lead technical ...

Network Monitoring & Observability Engineer - Fully remote

Hiring Organisation
Akkodis
Location
Derby, Derbyshire, United Kingdom
Employment Type
Contract
Contract Rate
£70000 - £75000/annum
successful delivery of this initiative. This is a hands-on engineering role where you'll be responsible for designing, implementing, and commissioning monitoring and observability solutions across newly deployed fibre infrastructure and network equipment. Working closely with Network Operations and Core Network teams, you'll ensure full visibility of critical … services from day one through modern monitoring technologies, streaming telemetry, and AI-driven analytics. Key Responsibilities Monitoring & Observability Design and implement end-to-end monitoring solutions across new fibre infrastructure deployments. Build and maintain streaming telemetry pipelines to provide real-time network visibility. Configure, optimise, and manage VictoriaMetrics environments, including ...

SRE Managing Consultant - Cloud Operating Model

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
Budgets : Establish service measures and targets (SLIs/SLOs) and introduce Error Budgets to enable data‐driven trade‐offs between reliability and delivery velocity. Observability & Operational Insight: Shape observability approaches (metrics/logs/traces) and operational monitoring models that make reliability risks visible and actionable, improving operational decision‐making. … large‐scale delivery contexts; associate‐level certifications are desirable but not mandatory. Design, establish, and evolve SRE‐led centres of excellence (e.g. Reliability, Observability, or Operational Excellence), setting enterprise‐level standards for SLIs/SLOs, incident management, observability, and continuous improvement across cloud and hybrid platforms. Exposure to modern observability ...

Principal Engineer - Member Experience Platform

Hiring Organisation
Jobleads-UK
Location
Skipton, England, United Kingdom
Quality), and bar‐raising across squads: you shorten lead times, increase deployment frequency, hold change‐failure rate low, and improve MTTR through release‐linked observability - turning fast, safe flow into the default way of working.Operating at platform scale, you define cross‐cutting architecture and delivery standards (API/event contracts … resilience, observability, language/dependency baselines) and drive adoption through the Golden Path: policy‐as‐code CI/CD, progressive delivery (feature flags, canary/blue‐green), automated rollback/forward‐fix, ephemeral, data‐ready environments, and guardrails that make security and compliance by design. You partner with Platform Ownership ...

Senior DevOps Engineer

Hiring Organisation
Stealth IT Consulting Limited
Location
Telford, Shropshire, West Midlands, United Kingdom
Employment Type
Contract
Contract Rate
£580 per day Inside IR35
Observability Engineer (SC Eligible) Rate: £580/day Inside IR35 Duration: 6 months Location: Mostly remote (Telford occasional onsite - 2 days/month) Clearance: SC Eligible Role Overview We are seeking an experienced Observability Engineer to design, implement, and support enterprise-grade monitoring and observability solutions across complex technology environments. … role focuses on improving service visibility, performance insight, and proactive incident detection. Key Responsibilities Design and implement end-to-end observability solutions across enterprise platforms Translate NFRs and monitoring requirements into Dynatrace configurations Deliver APM, log analytics, synthetic monitoring, and infrastructure observability Build and maintain dashboards, alerts, and performance visualisations ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Engineering teams to drive high availability, reliability, and uptime. In this role, you will use a code‐first approach to reduce toil, advance our observability platforms using SLIs/SLOs, and ensure high compliance. You will actively manage operational success by serving as a key Incident Commander, participating … Doing: Integrating tightly with our Product Engineering teams Following SRE practices and maintaining high standards of compliance Implementing a new standard of observability utilising SLI/SLO/Error Budgets Continually evolving our observability platforms for greater coverage Using a code‐first approach to build and changes to reduce TOIL ...

Data Platform Engineer

Hiring Organisation
PRISM DIGITAL LIMITED
Location
Milton Keynes, Buckinghamshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
availability Own incident resolution, root cause analysis, and continuous improvement Collaborate with engineers and third-party providers to mature the platform Contribute to monitoring, observability, and cost optimisation strategies Support projects and business initiatives through robust platform delivery What Theyre Looking For: Microsoft Fabric experience Terraform experience Cloud platform engineering … delivery environments What Youll Work With: Microsoft Fabric Terraform (Infrastructure as Code) Azure cloud technologies SQL Server GitHub/CI/CD tooling Monitoring & observability tools Platform design patterns (scalability, resilience, cost control) Nice to Haves: GitHub Actions/CI/CD pipelines Zero Trust architecture Cloud cost monitoring & reporting ...

Site Reliability Engineer's

Hiring Organisation
F5 consultants
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
support, shared ownership, and continuous improvement. You'll work hands-on in a modern cloud-native environment leveraging Kubernetes, OpenShift, GitOps, service mesh, and observability tooling There is genuine investment in your development through training, certifications, and the expertise of those around you. You'll also be part … Ability to work within complex multi-cloud or hybrid environments with a solid foundation in distributed systems Expertise in observability tooling such as Prometheus, Grafana, Loki, and Tempo Proficiency in IaC tools such as Kustomize and Helm, with scripting skills in Bash/Python Experience managing GitOps pipelines using Tekton ...

Infrastructure / DevOps Engineer

Hiring Organisation
rmg digital
Location
England, United Kingdom
Managing and optimising AWS services, including ECS, Lambda, VPC, and Aurora Postgres Building and maintaining CI/CD pipelines using GitHub Actions Implementing monitoring, observability, and alerting using Datadog Supporting development teams with deployment, automation, and operational best practices Improving infrastructure security, scalability, reliability, and cost-efficiency Monitoring system performance … Infrastructure as Code tools such as Terraform and/or CDK Understanding of CI/CD pipelines and GitHub Actions Familiarity with monitoring and observability tooling, such as Datadog Knowledge of containerisation concepts and infrastructure best practices Some experience with TypeScript or JavaScript for scripting and CDK purposes Strong troubleshooting ...

Principal Full Stack Engineer & Architecture Lead

Hiring Organisation
Command Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£100000 - £110000/annum
technical design decisions Define scalable, secure, and maintainable engineering standards Provide technical leadership across frontend, backend, APIs, infrastructure, and integrations Drive platform scalability, resilience, observability, and performance Partner with leadership teams to align technical strategy with business goals Act as the senior technical authority for complex engineering decisions Hands … Gateway, EventBridge, SQS, Step Functions, S3, CloudWatch, RDS) Backend Node.js, TypeScript Frontend React, Next.js, Tailwind CSS Data & Architecture PostgreSQL, Serverless, Event-Driven Microservices DevOps & Observability Terraform/AWS CDK, CI/CD, Monitoring & Logging About You We are looking for a technically strong and commercially minded engineering leader with: 8+ ...

Principal Developer & Team Lead

Hiring Organisation
Cambridge University Press & Assessment
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
programme your own delivery to design, build and own. Around that, you'll lead the wider migration to AWS, build the DevOps automation and observability that lets SRE practices take hold, and establish the standards for how we use AI responsibly in education products. You'll set the technical … worked with AWS (or an equivalent cloud) in anger not just touched it You understand CI/CD, infrastructure as code, and what observability actually means in production You can hold a conversation about event-driven architecture, microservices and security in cloud environments at a level beyond the textbook ...

Senior Software Engineer - DevOps

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Implementing infrastructure as code and improving automation across environments Troubleshooting and resolving complex build, deployment and production issues across application and infrastructure layers Improving observability, reliability and performance of internal platforms and production systems Partnering with engineering teams to define best practices for deployment, release management and cloud architecture Contributing … principles Experience working with GitHub, including workflow automation and repository management Experience with infrastructure as code and automated environment management Strong understanding of reliability, observability and operational best practices Ability to debug complex systems and work effectively across multiple engineering teams Why Deliveroo Our mission is to transform ...

Senior Full Stack Java Developer (Legacy Modernization & Cloud Migration)

Hiring Organisation
Vaco LLC
Location
Charlotte, North Carolina, United States
Employment Type
Permanent
Salary
USD Annual
database solutions (Oracle, SQL Server, PostgreSQL, MongoDB). Collaboration: Work closely with cross-functional teams to ensure seamless integration, data integrity, and system observability (Sumo Logic). Technical Leadership: Mentor junior developers, review code, and contribute to best practices for Java, Angular, and DevOps. Required Skills & Experience 5+ years … e.g., VB to Java, on-prem to cloud). Languages: Fluent in English and Spanish. Nice-to-Have Experience with Sumo Logic or similar observability tools. Familiarity with Windows Server Linux migrations. Basic Python scripting for automation. What We Offer Remote-first work environment. Competitive salary and benefits. Opportunities ...

Principal Platform Engineer (Edge)

Hiring Organisation
Jobleads-UK
Location
Bristol, England, United Kingdom
peer coordination, and systems that can operate independently during disconnection. You are the sort of engineer who thinks carefully about failure modes, deployment risk, observability, workload lifecycle, service discovery, and operational simplicity. You care about building robust abstractions that allow application teams to securely deploy workloads without needing to understand … limited connectivity. Working on security baselines for edge nodes, including secure boot, hardware‐rooted identity, attestation, and the runtime isolation of workloads. Building observability, logging, and telemetry capabilities that work when bandwidth is scarce and devices are intermittently reachable. Designing zero‐touch onboarding and provisioning flows so devices come online ...

Senior Onboarding Engineering | 6 month Contract

Hiring Organisation
Novatus
Location
London Area, United Kingdom
Novatus is a Series B scale-up RegTech SaaS provider and boutique advisory practice, enabling financial services firms to solve complex challenges and redefine what’s possible through expert-led technology and consulting. Across both ...

Senior SRE Lead

Hiring Organisation
Albany Beck
Location
London Area, United Kingdom
about capability build, technical excellence, and delivering meaningful change within complex enterprise environments. Role Overview Albany Beck is seeking a Senior SRE Lead/Observability SME to lead the establishment of a new enterprise Site Reliability Engineering (SRE) capability, with a primary focus on designing and implementing a modern observability … suite and operational resilience framework. This is a foundational build role, responsible for defining how reliability engineering and observability are structured, measured, and embedded across a complex global technology estate. The successful candidate will play a key role in shifting the organisation from reactive operational support to a metrics-driven ...

Field CTO EMEA

Hiring Organisation
Jobleads-UK
Location
Maidenhead, England, United Kingdom
Engineering, platform teams, and business stakeholders.Translate customer business goals into compelling transformation strategies powered by Dynatrace.Lead high-impact technical discovery and executive conversations around observability, cloud modernization, AI adoption, security, automation, and business outcomes.Shape account strategy with Sales and Solution Engineering teams for complex, multi-stakeholder deals.Develop board-level … executive-level narratives that connect platform capabilities to risk reduction, operational excellence, digital experience, and growth.Guide customers on modern observability and security operating models, including platform engineering, SRE, DevSecOps, and AI-assisted operations.Support large opportunities by validating architecture direction, differentiation, value realization, and long-term platform vision.Influence go-to-market ...

Agentic AI Data Architect

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
ModelOps - Azure AI Foundry (model hosting, versioning, monitoring); Evaluation frameworks (LLM-as-judge, test datasets); Prompt/version control, cost/latency monitoring DevOps & Observability - CI/CD pipelines (Azure DevOps/GitHub Actions); Logging, monitoring, observability (App Insights, etc.); Performance tuning and scalability As part of a leading global ...

Data Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
work from home 2 days per week. This is a high-impact role focused on improving data quality, reducing incidents, and building scalable observability across a modern enterprise data platform. You’ll help ensure data across the organisation is accurate, reliable, and trusted for critical business decision-making. … style roles, with strong SQL and Python skills and experience working in modern cloud-based data environments. Hands‐on experience with data observability tools such as Grafana, Monte Carlo, or Acceldata, and data governance/quality platforms like Informatica, Collibra or Microsoft Purview is highly desirable. Experience within the Azure ...

Principal Full Stack Engineer & Architecture Lead

Hiring Organisation
BCT Resourcing
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £90,000 per annum
technical design decisions * Define scalable, secure, and maintainable engineering standards * Provide technical leadership across frontend, backend, APIs, infrastructure, and integrations * Drive platform scalability, resilience, observability, and performance * Partner with leadership teams to align technical strategy with business goals * Act as the senior technical authority for complex engineering decisionsHands-On Engineering … Lambda, API Gateway, EventBridge, SQS, Step Functions, S3, CloudWatch, RDS)Backend Node.js, TypeScriptFrontend React, Next.js, Tailwind CSSData & Architecture PostgreSQL, Serverless, Event-Driven MicroservicesDevOps & Observability Terraform/AWS CDK, CI/CD, Monitoring & LoggingAbout YouWe are looking for a technically strong and commercially minded engineering leader with: * 10+ years of software ...

Senior Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
flexibility, simplicity and delivery speed Build and maintain backend services and integrations that support our insurance journeys Work with infrastructure, CI/CD and observability to help the team ship safely and often Partner with product, design and data to turn ambiguous opportunities into concrete, measurable improvements Raise the technical … similar Testing: integration and end-to-end testing, component story testing, and visual regression testing CI/CD: Automated testing and deployment pipelines Observability: Analytics platforms, error monitoring and performance tracking Cloudflare experience, including Workers, CDN or load balancing Builder.io or other visual/content tooling experience ...

Director of Software Engineering

Hiring Organisation
Spire
Location
Glasgow, Scotland, United Kingdom
hands-on: review code, prototype solutions, and get into the details when it matters Establish engineering standards across code quality, system design, testing, and observability, and hold the team to them Be the person engineers come to when the problem is genuinely hard Team Building & Culture Recruit, develop, and retain … Experience writing performance software in Rust Background in space systems, aerospace, or highly constrained real-time environments Experience building data lakes, telemetry platforms, or observability infrastructure at scale A history of leading teams through technical transformations and not just maintaining the status quo Spire operates a hybrid work model ...

Cloud Architect

Hiring Organisation
Tata Consultancy Services
Location
Luton, England, United Kingdom
least privilege, KMS encryption, secrets management, data classification, PII redaction, prompt/response filtering, and model governance. Drive non-functional requirements: reliability, scalability, latency, observability, DR, and cost controls (FinOps) for GenAI workloads. Guide build teams through solution design, reviews, and implementation; produce architecture artefacts (HLD/LLD), patterns … more languages (Python/Node.js preferred) and infrastructure-as-code (CDK/CloudFormation/Terraform) for repeatable deployments. Experience setting up observability for GenAI: tracing, logging, metrics, and model/application performance dashboards. Excellent communication skills for architecture storytelling, stakeholder management, and client-facing workshops. Rewards & Benefits TCS is consistently ...