1,076 to 1,100 of 1,270 Observability Jobs

Senior Software Engineer, Banking Connectivity London, UK

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
that high-integrity financial data is correctly distributed across internal systems. Your work will focus on scaling integrations while improving the system’s resilience, observability, and overall structure. You will play a key role in evolving the platform to support new banking partners, products, and regulatory requirements while addressing technical … real‐world banking constraints Collaborate with product, operations, and external partners to unblock integrations and accelerate delivery Improve system quality through pragmatic enhancements in observability, testing, and resilience. This is a high‐impact role. What You'll Bring Experience building and supporting reliable backend systems with external integrations (APIs, webhooks ...

Senior Software Engineer / Reliability Engineering - Real-time Data

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
Build and maintain production-grade software supporting Bloomberg’s global distribution infrastructure Design and implement scalable, fault-tolerant systems with a focus on observability, performance, and automation Analyse system behaviour under real-world and failure scenarios to validate capacity, failover, and recovery meet resilience objectives Identify bottlenecks, scaling limits … Work With Configuration systems serving thousands of servers across the global network Service discovery and clustering systems for distributed infrastructure Monitoring and observability frameworks for large-scale server estates Tooling for diagnosing data quality and distribution issues Ownership of systems may evolve over time as the team focuses on areas ...

Group Head of Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
club platforms, aligned to transformation priorities Set clear architectural direction and embed modern engineering standards (cloud-first, CI/CD, automated testing, observability, secure SDLC) Own end‐to‐end delivery outcomes, ensuring valuable increments are shipped frequently, safely, and predictably Drive operational excellence across reliability, resilience, performance, and security Establish … continuous improvement Experience Senior engineering leader with strong hands‐on technical credentials. Deep experience across cloud-first architectures, distributed systems, CI/CD, observability, and secure SDLC. Experience delivering AI-enabled capabilities into production environments. Proven track record of improving reliability and leading incident response and prevention. Experience scaling engineering ...

Senior Director, Master Data Management

Hiring Organisation
Jobleads-UK
Location
Northampton, England, United Kingdom
manage the MDM product/platform team (product, engineering, data quality, metadata/lineage). Implement DataOps for MDM (CI/CD, automated testing, observability, change control, incident/problem management). Deliver golden record services (match/merge/survivorship, hierarchy management) and reference data services. Define integration architecture … merge/survivorship, hierarchy & reference data management, quality management, metadata & lineage. Hands‐on familiarity with DataOps (CI/CD for data, automated data testing, observability), microservices, and event streaming patterns (e.g., CDC, pub/sub). Experience with enterprise data catalogs, lineage tooling, and at least one MDM platform (commercial ...

Automation Engineer

Hiring Organisation
RealityMine
Location
Trafford Park, Greater Manchester, UK
test automation frameworks (including our AI-assisted tools), scripting (e.g. Python and JavaScript), CI/CD tooling and our internal observability tools to design and execute automated test suites, manage device infrastructure, and provide fast, reliable feedback to product and engineering teams. Our offices are in Trafford Park, Manchester … managing or using a device farm solution (e.g. AWS Device Farm, Firebase Test Lab, BrowserStack, Sauce Labs, or an internal farm). · Familiarity with observability and monitoring for test and device infrastructure (logs, metrics, dashboards, alerts). · Knowledge of mobile platform internals (Android/iOS), SDK integration testing, or backend ...

Automation Engineer

Hiring Organisation
RealityMine
Location
Trafford Park, England, United Kingdom
test automation frameworks (including our AI-assisted tools), scripting (e.g. Python and JavaScript), CI/CD tooling and our internal observability tools to design and execute automated test suites, manage device infrastructure, and provide fast, reliable feedback to product and engineering teams. Our offices are in Trafford Park, Manchester … managing or using a device farm solution (e.g. AWS Device Farm, Firebase Test Lab, BrowserStack, Sauce Labs, or an internal farm). · Familiarity with observability and monitoring for test and device infrastructure (logs, metrics, dashboards, alerts). · Knowledge of mobile platform internals (Android/iOS), SDK integration testing, or backend ...

Senior Software Engineer / Reliability Engineering - Real-time Data

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Build and maintain production-grade software supporting Bloomberg’s global distribution infrastructure Design and implement scalable, fault-tolerant systems with a focus on observability, performance, and automation Analyse system behaviour under real-world and failure scenarios to validate capacity, failover, and recovery meet resilience objectives Identify bottlenecks, scaling limits … Work With Configuration systems serving thousands of servers across the global network Service discovery and clustering systems for distributed infrastructure Monitoring and observability frameworks for large-scale server estates Tooling for diagnosing data quality and distribution issues Ownership of systems may evolve over time as the team focuses on areas ...

Principal Machine Learning Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Anaplan\'s platform and third-party integrations Optimise model inference pipelines for performance, cost, and scalability in production environments Implement monitoring, logging, and observability for GenAI systems to track usage, errors, and model behaviour Collaborate with data scientists to productionise ML models and forecasting algorithms Your Skills Extensive hands … Experience with A/B testing and experimentation frameworks for AI features Contributions to open-source ML projects or research publications Experience with model observability tools (LangSmith, W&B;, MLflow) DEIB Our Commitment to Diversity, Equity, Inclusionand Belonging (DEIB) We believe attracting and retaining the best talent and fostering ...

EMEA VP of AI-Observability Sales

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Snowflake is seeking a Sales Leader for the EMEA region to build and lead a high-performing sales team focused on AI-driven observability solutions. The ideal candidate will have over 10 years of experience in cloud and enterprise software sales, with a track record of managing successful sales teams. … This role offers a unique opportunity to shape the future of data observability in a fast-growing environment. A BA/BS degree is required, alongside strong leadership and coaching skills. #J-18808-Ljbffr ...

Data Engineer

Hiring Organisation
HCLTech
Location
London Area, United Kingdom
HCLTech is a global technology company, home to more than 220,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services ...

Principal AI Engineer - UK (Remote)

Hiring Organisation
NST Recruitment Ltd
Location
United Kingdom
Employment Type
Permanent, Work From Home
Principal AI Engineer Generative AI, LLMs, Python, CI/CD, SaaS/PaaS, Prompt Engineering, Agentic Workflows, Platform Systems, Remote (UK) Up to £200,000 + Equity + Benefits This is a fantastic Principal AI ...

Senior Software Engineer

Hiring Organisation
In Product
Location
City of London, London, United Kingdom
Tech Lead – London, Hybrid (2 days/week) – £100,000-£120,000 plus Benefits – High Growth Startup We’re partnering with a fast-growing healthtech company on a mission to transform primary care. Their platform ...

SRE Lead: Automation, Observability & Reliability

Hiring Organisation
Jobleads-UK
Location
Bromley, England, United Kingdom
Huxley is seeking an experienced SRE Lead to oversee SRE strategy within an investment banking environment. The role focuses on driving automation, improving observability, and enhancing reliability by design. Ideal candidates will possess over 8 years of SRE experience, particularly in resilience engineering, and demonstrable skills in scaling operations. This ...

Site Reliability Engineering Manager

Hiring Organisation
Gravitas Recruitment Group (Global) Ltd
Location
City of London, London, United Kingdom
Engineering Manager to lead their Site Reliability function. The Role This is a blended people leadership and technical role, responsible for operational excellence, observability, and reliability at scale across a platform that serves millions of users. You'll own incident management processes, drive reliability engineering standards, and ensure the business … maintains its exceptionally high availability targets. Key Responsibilities Own monitoring, alerting and observability strategy, ensuring product teams have high reliability confidence and fast incident detection and resolution Lead and standardise incident management processes, maintaining a culture of accountability, transparency and continuous learning Define reliability patterns and standards to reduce cascading ...

Software Engineer - Python/Go

Hiring Organisation
Atarus
Location
United Kingdom
Doing ⚙️ • Building and scaling backend platform services powering real-time AI products • Breaking down monolithic systems into high-performance distributed services • Improving observability, deployment processes, and platform reliability • Working across LLM, speech, and telephony infrastructure • Optimising systems for low latency and high throughput workloads • Improving internal tooling and overall developer … system design, scalability, and reliability Tech 🧱 • Python/Go • Distributed systems & microservices • LLMs, speech systems, and real-time AI infrastructure • Modern cloud, deployment, and observability tooling Why It’s Interesting 💡 • High-impact engineering role within a rapidly scaling AI company • Opportunity to work on genuinely difficult real-time systems problems ...

Senior Engineering Manager

Hiring Organisation
Myn
Location
Manchester, England, United Kingdom
closely with product and senior stakeholders to maintain clarity on priorities Help shape engineering practices across areas such as CI/CD, automation and observability Drive improvements in delivery flow, release processes and overall engineering effectiveness Contribute to improving reliability, scalability and overall system health Manage cross-team dependencies, risks … distributed systems, production scale and operational complexity Track record of improving delivery across multiple teams Familiarity with modern engineering practices (CI/CD, automation, observability) Comfortable working across teams, stakeholders and organisational boundaries Strong leadership and communication skills Nice to have Experience in financial services, fintech or other complex, regulated ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, London, United Kingdom
Employment Type
Permanent
Salary
£50,000
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Birchanger, Hertfordshire, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Stansted, Birchanger, Essex, United Kingdom
Employment Type
Permanent
Salary
£40000 - £50000/annum
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Head of Technology (Germany Onsite)

Hiring Organisation
Questhiring
Location
England, United Kingdom
execution. Drive implementation of LLM orchestration frameworks such as LangGraph, CrewAI, AutoGen, Semantic Kernel, or equivalent platforms. Establish best practices for AI evaluations, guardrails, observability, governance, safety, and reliability. Own the complete lifecycle of AI systems including experimentation, validation, production deployment, monitoring, and continuous optimization. Engineering & Platform Leadership Lead engineering … enabled experiences. Drive modernization of legacy platforms into cloud-native, API-first, AI-ready architectures. Ensure engineering excellence through scalable system design, automation, observability, and operational rigor. Guide teams in building highly available, secure, and resilient distributed systems. AI Platform & Infrastructure Build enterprise-grade AI platforms supporting model serving, vector ...

Engineering Manager

Hiring Organisation
Myn
Location
Manchester, England, United Kingdom
delivery progress Work closely with product and stakeholders on priorities, scope and trade-offs Support adoption of modern engineering practices (CI/CD, automation, observability) Drive improvements in flow, release processes and overall delivery effectiveness Contribute to system reliability, scalability and operational performance Collaborate with senior engineers and architects … environments with scale, distributed systems or complexity Track record of improving team delivery and performance Familiarity with modern engineering practices (CI/CD, automation, observability) Experience working in agile environments and collaborating with product stakeholders Strong leadership, communication and organisational skills Nice to have Experience in complex or regulated environments ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
City of London, London, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...

Staff AI Security Engineer

Hiring Organisation
Spring Health
Location
Seattle, Washington, United States
Employment Type
Permanent
Salary
USD Annual
security guardrails and tooling that enable safe experimentation across engineering and business teams Establish AI-specific governance frameworks covering identity, access control, auditability, and observability Take ownership of and lead our AI Red Team to proactively identify vulnerabilities Design and implement AI observability pipelines to detect anomalous model behavior ...

Staff AI Security Engineer

Hiring Organisation
Spring Health
Location
San Francisco, California, United States
Employment Type
Permanent
Salary
USD Annual
security guardrails and tooling that enable safe experimentation across engineering and business teams Establish AI-specific governance frameworks covering identity, access control, auditability, and observability Take ownership of and lead our AI Red Team to proactively identify vulnerabilities Design and implement AI observability pipelines to detect anomalous model behavior ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
Newcastle Upon Tyne, England, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...