201 to 225 of 228 Remote/Hybrid Observability Jobs

Sr. AI Engineer, AI.x

Hiring Organisation
Charles Schwab
Location
Daly City, California, United States
Employment Type
Permanent
Salary
USD Annual
bring curiosity, creativity, and technical depth to help shape the next era of AI at Schwab, with a special emphasis on site reliability, monitoring, observability, and operations. You'll ensure that the systems you build are robust, reliable, and well-monitored, implementing best practices for observability and operational excellence … uptime for mission-critical AI applications. Design, build, and deliver GenAI applications that elevate client experience and generate business impact. Champion reliability, monitoring, observability, and operational best practices for AI systems and data pipelines. Collaborate with cross-functional teams to align solutions with enterprise strategy and technical standards. Mentor ...

Cloud Security and Platform Engineer

Hiring Organisation
RealityMine
Location
Trafford Park, Greater Manchester, UK
mainly focused on AWS, with growing involvement in other cloud and SaaS platforms. You’ll improve existing environments—managing identity and access, governance, security, observability, and lifecycle—by reducing risks, eliminating unsafe configurations, validating ownership, and ensuring the cloud estate is clearly governed and auditable. You will take an active … role in improving RealityMine’s security posture by improving and operating security scanning, improving monitoring and observability, and ensuring risks, vulnerabilities, and end of life components are identified and addressed in a timely and pragmatic way. You will also develop automation used to support security and operational hygiene, reducing manual ...

Cloud Security and Platform Engineer

Hiring Organisation
RealityMine
Location
Trafford Park, England, United Kingdom
mainly focused on AWS, with growing involvement in other cloud and SaaS platforms. You’ll improve existing environments—managing identity and access, governance, security, observability, and lifecycle—by reducing risks, eliminating unsafe configurations, validating ownership, and ensuring the cloud estate is clearly governed and auditable. You will take an active … role in improving RealityMine’s security posture by improving and operating security scanning, improving monitoring and observability, and ensuring risks, vulnerabilities, and end of life components are identified and addressed in a timely and pragmatic way. You will also develop automation used to support security and operational hygiene, reducing manual ...

Senior System Reliability Engineer - REMOTE FROM IRELAND

Hiring Organisation
Caspian One Ltd
Location
Ireland, Bedfordshire, United Kingdom
Employment Type
Permanent
Salary
EUR 125,000 - 175,000 Annual
responsible for the reliability, performance, and operational excellence of a large-scale, bare-metal trading platform. This is a hybrid role combining systems engineering, observability, automation, and Real Time operational support. You'll work across the full stack - (Linux, networking, applications, hardware) and play a key role in building … resolve issues across OS, network, hardware, and application layers Build and improve automation, tooling, and configuration management (Ansible or similar) Develop and maintain observability dashboards, alerts, and telemetry pipelines Participate in deployments, start-up/shutdown procedures, and change management Contribute to engineering projects such as OS tuning, Kernel-level ...

AI Architect

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
From £450 to £500 per day
into high value solutions Enforce IAM least privilege with IAM Conditions, organisation policies, and scoped service accounts; integrate BeyondCorp for zero trust access Operationalise observability using Cloud Logging, Cloud Monitoring, Error Reporting, Trace, and Profiler; build model/LLM telemetry dashboards and alerts Identify the right AI/ML frameworks … patterns, vector databases, embeddings, and prompt/guardrail engineering Desirable Skills/Knowledge/Experience Knowledge of MLOps/AgentOps, CI/CD, and observability Strong understanding of regulated financial services environments Proven experience implementing AI risk controls, model governance, and auditability Ensure alignment with FCA, PRA, data privacy, model ...

SRE Manager /Ops Manager

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Wokingham, Berkshire, South East, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £400 to £450 per day
Incidents, service risks, and operational failures. Service Reliability & Operations (SRE Focus) Define, own, and govern SLO, service health metrics. Ensure proactive monitoring, alerting, and observability across the estate. Lead blameless post-incident reviews, root cause analysis, and preventative actions. BAU Team Leadership Lead and manage multiple BAU teams, potentially covering … Service Management teams in complex environments. Proven accountability for 24x7 BAU services at scale. Deep understanding of: Incident & problem management Monitoring & observability Change & release control Experience working across cloud, applications, data, and integrations. Strong stakeholder and escalation management skills. Desirable Background in Site Reliability Engineering or DevOps-led operations. Knowledge ...

AI Engineer

Hiring Organisation
MarkIT Placements
Location
Didcot, Oxfordshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
execution Deploy AI systems into cloud, on-premises, and air-gapped environments Build production-ready pipelines from data ingestion through to inference Experience with observability for AI systems, including agent behaviour, model performance, and failure modes Collaborate with engineers, product leads, and customers to translate requirements into working systems Contribute … with edge or offline AI deployments Familiarity with Kubernetes (EKS/OpenShift) for monitoring and managing deployed applications MLOps experience - model evaluation, monitoring, reproducibility Observability tooling for agentic systems (model drift, agent behaviour, performance monitoring) Experience with agent orchestration patterns and inter-agent communication protocols (e.g. A2A) Familiarity with MCPs ...

Lead AI Engineer

Hiring Organisation
Capco
Location
manchester, north west england, united kingdom
LLMs and multi-modal models at scale Strong engineering background in Python with proven backend and API development skills Solid understanding of scalable MLOps, observability, and cloud-native AI deployment Excellent communication, problem-solving, and project management skills in agile environments Bonus Points For Experience with agentic frameworks (e.g., LangChain … LlamaIndex) Experience in deep learning frameworks and front-end development Familiarity with Langfuse, Langsmith, or other LLM observability tools Understanding of Model Context Protocol and bias/hallucination mitigation techniques Previous success in integrating GenAI solutions into enterprise-scale systems Why Join Capco Deliver high-impact technology solutions for Tier ...

Solution Architect Digital Client Self Service Enterprise API Journeys

Hiring Organisation
Capgemini
Location
london, south east england, united kingdom
wide standards around API modelling documentation versioning and governance Drive adoption of cloud native approaches ensuring solutions leverage appropriate cloud services resilience patterns and observability frameworks Your Skills: Strong experience designing digital client self-service platforms preferably in financial services or corporate banking Proven delivery of E2E cross functional journeys … experience with microservices Java Spring Boot and cloud native deployment patterns Strong understanding of OAuth2 OIDC FAPI and enterprise security controls Knowledge of observability and monitoring tools such as Splunk or ELK Experience with public cloud platforms ideally AWS Understanding of integration and event driven patterns e.g. Kafka ...

Platform Engineer

Hiring Organisation
Wave Talent
Location
London Area, United Kingdom
scale Owning foundational services - SSO, authentication, notifications, and core shared services Integrating AI tooling into the platform infrastructure - greenfield, high-ownership work Driving observability and DevX improvements - internal tooling, developer workflows, reducing friction across the org Setting the patterns and guardrails that enable engineering autonomy across the business ✅ Must-haves … Bonus points for: Experience with Pulumi or CDK Previous work on internal developer platforms or DevX tooling Exposure to AI/LLM infrastructure integration Observability tooling (Datadog, OpenTelemetry, etc. ...

Head of Software Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
with Product and Design as part of a leadership trio, shaping vision and outcomes. Establish modern engineering standards (cloud‐first, CI/CD, automation, observability, secure SDLC). Drive operational excellence across performance, resilience, and security. Build and scale a multi‐site engineering organisation, embedding a culture of ownership … architectures, and distributed systems. Strong knowledge of Web, Mobile, FE technologies such as JavaScript, React, Kotlin, .Net, Azure. Experience implementing CI/CD pipelines, observability, and secure engineering practices. Track record of scaling teams and delivering in fast‐paced, evolving environments. Experience working in or with startup/scale ...

Senior Platform Owner

Hiring Organisation
Skipton Building Society
Location
Skipton, North Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent, Work From Home
with autonomy and purpose across App, Portal and Identity capabilities, supported by the Societys modern engineering ecosystem, including automation-first CI/CD pipelines, observability by default, decoupled architecture, cloud platforms, API and event-streaming foundations. Working in partnership with Product Owners, Platform Leads, Engineers, Security, Architecture and Data teams … Working closely with Engineering, Architecture, Security, Data and Product, youll guide technology adoption, steward platform budgets, optimise resource allocation, and champion automation-first and observability-by-default practices. Youll mentor cross-functional teams, elevate craft and capability, and embed healthy platform governance, risk management, continuous improvement and incident/problem ...

Quant Strategist

Hiring Organisation
Deutsche Bank
Location
london, south east england, united kingdom
group, which is part of Risk Strats and delivers applications that solve quantitative problems for businesses across Deutsche Bank covering Independent Price Verification (IPV), Observability and Levelling, Prudent Valuation (PruVal) and Fair Value reserves. The design, specification and implementation are the responsibilities of Strats in close partnership with Trading, Sales … development of efficient front-to-back functionality to deliver to business requirements in C++ and Python, with a focus on valuation controls including IPV, observability and levelling, fair value reserves, and prudential valuation Responsible for designing and re-engineering platform analytics and tooling to improve efficiency of valuation control processes ...

Senior Software Engineer

Hiring Organisation
In Product
Location
United Kingdom
Senior Software Engineer (Backend) – London, Hybrid (2 days/week) – £80,000-£90,000 plus Benefits – High Growth Startup We’re partnering with a fast-growing healthtech company on a mission to transform primary care. ...

Principal AI Engineer - UK (Remote)

Hiring Organisation
NST Recruitment Ltd
Location
United Kingdom
Employment Type
Permanent, Work From Home
Principal AI Engineer AI, LLM, Python, CI/CD, SaaS/PaaS, Prompt Engineering, Agentic Workflows, Platform Systems, Remote (UK) Excellent Salary + Benefits This is a fantastic Principal AI Engineer opportunity to work with ...

SRE Operations Transformation Lead

Hiring Organisation
Adecco
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£900 - £1000/day
including major incident triage, root cause analysis, and post-incident reviews. Strengthen problem management processes to reduce repeat incidents and mitigate technical debt risks. Observability and Tooling Enablement Set observability standards across logs, metrics, traces, and dashboards to enhance service detection and restoration times. Partner with platform and tooling teams … influence and communicate effectively at senior levels. Preferred Qualifications: Experience in payments, cash management, or other high-availability banking platforms. Familiarity with designing observability approaches and enhancing alert quality across diverse portfolios. Experience in building SRE communities of practise and structured training programmes. Knowledge of enterprise service management tools ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
London Area, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
Newcastle Upon Tyne, England, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...

Principal Engineer

Hiring Organisation
Skipton Building Society
Location
Skipton, North Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent, Work From Home
with autonomy and purpose across App, Portal and Identity capabilities, supported by the Societys modern engineering ecosystem, including automation-first CI/CD pipelines, observability by default, decoupled architecture, cloud platforms, API and event-streaming foundations. Working in partnership with Product Owners, Platform Leads, Engineers, Security, Architecture and Data teams … Working closely with Engineering, Architecture, Security, Data and Product, youll guide technology adoption, steward platform budgets, optimise resource allocation, and champion automation-first and observability-by-default practices. Youll mentor cross-functional teams, elevate craft and capability, and embed healthy platform governance, risk management, continuous improvement and incident/problem ...

Domain Consolidation Analyst

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
ITRS, we make society's critical technology work. Our mission is to deliver automated and holistic IT observability solutions that safeguard critical applications and enable innovation. We are the only monitoring and observability platform designed for the most demanding and regulated industries — trusted by 90% of Tier 1 capital markets ...

React Developer

Hiring Organisation
Robert Half
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
retries, resilience, caching and invalidation Performance engineering: Drive render efficiency, bundle size control, code splitting, virtualisation for large datasets, network efficiency, and instrumentation/observability API contract collaboration: Work with backend engineers on predictable API contracts via BFF (pagination, aggregation shapes, response models, error semantics) Security-by-design delivery: Apply … feature boundaries, shared UI patterns, conventions) Comfortable establishing foundations early (linting/formatting, project structure, testing strategy, patterns for API access, error handling, and observability hook) Experience shipping complex interface: tables/grids, dashboards, rich filters, multi-step workflows, and interaction-heavy UIs Familiarity with performance constraints caused by large ...

Data Engineering Manager

Hiring Organisation
Skyscanner
Location
london (city of london), south east england, united kingdom
search, social and programmatic. In other words, not just dashboards... but decisions. Along the way, you'll help evolve our data platform, improving scalability, observability and governance within a modern cloud environment. You'll partner across Marketing, Product, Analytics and Data Science to turn complex data into clear, actionable direction. … Partnering cross-functionally: You'll work closely with Marketing, Product, Analytics and Marketing Technology to shape and deliver the data roadmap. Improving data quality & observability: You'll champion reliable, trustworthy datasets with strong SLAs and clear monitoring. Balancing speed and sustainability: You'll navigate the trade-offs between rapid delivery ...

Flight Software Engineer

Hiring Organisation
IO Associates
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
Exciting Opportunity: Flight Software Engineer (Perm/Fixed Term) Our Client, a pioneering leader in the space industry, is shaping the future of space exploration and satellite technology. Known for their cutting-edge approach to ...

Global Marketing Manager - Digital Experience Monitoring

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Global Marketing Manager - Digital Experience Monitoring At ITRS, we make society's critical technology work. Our mission is to deliver automated and holistic IT observability solutions that safeguard critical applications and enable innovation. We are the only monitoring and observability platform designed for the most demanding and regulated industries — trusted … optimise marketing mix balancing efficiency with brand building Requirements 5+ years of B2B technology marketing experience with focus on SaaS or monitoring/observability solutions Experience with product-led growth strategies and conversion optimization Understanding of digital experience monitoring, synthetic monitoring, or related technologies Proven track record of demand generation ...

AI Engineer - Outside IR35 - Remote

Hiring Organisation
Tenth Revolution Group
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£350 - £400 per day
REST APIs Ship production-ready agents with reliability, monitoring, and performance in mind Integrate agents with external tools, APIs, and MCP servers Establish observability for AI systems, including tracing, cost monitoring, and failure detection Develop evaluation and testing frameworks for agent performance and reliability Collaborate with product and engineering teams … track record of shipping AI agents to production environments Preferred Qualifications Experience with MCP server development or agent-to-tool integration frameworks Familiarity with observability tools for AI workloads (e.g., tracing, cost tracking, failure analysis) Exposure to e-commerce or direct-to-consumer (DTC) product environments Experience building agent evaluation ...