451 to 475 of 496 Observability Jobs in England

RVP, EMEA Sales - Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
just to execute a function, but to help redefine the future of how work gets done. Observe by Snowflake brings AI-native observability to the Snowflake AI Data Cloud, helping engineering and data teams debug, optimize, and understand systems operating at massive scale. Traditional observability tools were not built … strong judgment, and the ability to align people, strategy, and execution across functions. WHAT WE LOOK FOR 10+ years of experience selling cloud, infrastructure, observability, data platforms, or enterprise software. 2+ years of experience managing high-performing enterprise sales teams. Experience selling to senior technical and business stakeholders, including CIOs ...

Software Engineer (Prometheus / Grafana)

Hiring Organisation
SRT Marine Systems PLC
Location
Bristol, United Kingdom
Employment Type
Permanent
Salary
£50000 - £75000/annum
Software Engineer (Prometheus/Grafana) here at SRT, you will be part of a small team tasked with implementing an end-user observability visualisation. Currently, we have observability dashboards in place for our engineers, utilising Prometheus for metrics collection and Grafana for visualisation. This initiative aims to deliver a more … across multiple sites. We are fortunate to have a team of highly experienced engineers, including UX designers, who can provide support and guidance. Ourlead observability engineer will oversee and assist with your work throughout the project in the role of Software Engineer (Prometheus/Grafana). Key Responsibilities - Software Engineer ...

Software Engineer (Prometheus / Grafana)

Hiring Organisation
SRT Marine Systems PLC
Location
Birmingham, West Midlands (County), United Kingdom
Employment Type
Permanent
Salary
£50000 - £75000/annum
Software Engineer (Prometheus/Grafana) here at SRT, you will be part of a small team tasked with implementing an end-user observability visualisation. Currently, we have observability dashboards in place for our engineers, utilising Prometheus for metrics collection and Grafana for visualisation. This initiative aims to deliver a more … across multiple sites. We are fortunate to have a team of highly experienced engineers, including UX designers, who can provide support and guidance.Our lead observability engineer will oversee and assist with your work throughout the project in the role of Software Engineer (Prometheus/Grafana). Key Responsibilities - Software Engineer ...

Vice President - Full Stack Engineer (Java/Angular/AI)

Hiring Organisation
Robert Walters
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 - £100,000 per annum
APIs using modern engineering practices Contribute to system design, architecture discussions, and technical decision-making Build resilient, automated systems with strong focus on reliability, observability, and performance Work closely with engineering and product teams to deliver production-grade solutions Contribute to CI/CD, testing, monitoring, and operational improvements across … working with high-volume, scalable systems Familiarity with event-driven architecture and messaging systems Experience with cloud-native technologies, CI/CD pipelines, and observability tooling Strong hands-on engineering mindset and interest in modern development tooling, including AI-assisted workflows Robert Walters Operations Limited is an employment business ...

Site Reliability Engineer (SRE)

Hiring Organisation
UA Consulting
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
From £300 to £400 per day
platform. Key Responsibilities Partner with development teams to define and manage SLOs/SLIs, and use error budgets to guide engineering decisions. Enhance observability ensuring metrics, logs, and tracing are in place to detect and fix issues proactively. Lead cost optimisation initiatives: monitor spend, rightsize workloads, tune autoscaling, and drive … with Kubernetes (on-prem and AWS EKS). Proven track record defining and working with SLOs/SLIs in production environments. Deep understanding of observability (metrics, logging, tracing, telemetry ...

Principal Cloud Engineer - Azure - Hybrid - Manchester

Hiring Organisation
Experis
Location
Manchester, United Kingdom
Employment Type
Permanent
Salary
£78000/annum + Excellent Bens
Implement governance, policy, and identity standards (Entra ID) Develop core platform capabilities, including: API Management (APIM) and Web Application Firewall (WAF) Logging, monitoring, and observability Introduce and scale Infrastructure as Code (Terraform) across the environment Contribute to the design and implementation of business continuity and disaster recovery strategies Support … working with: Azure landing zones and governance frameworks Infrastructure as Code (Terraform preferred) Identity and access management (Entra ID/Azure AD) Monitoring and observability tooling Experience working in environments undergoing cloud transformation Ability to operate across engineering and architecture, with a focus on practical implementation Strong communication skills ...

AI Engineering Director

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Build APIs, integrations, MCP Servers, and reusable platform capabilities to connect AI systems with enterprise platforms, tools, and workflows. Establish evaluation, experimentation, regression, and observability frameworks to continuously improve AI system quality, reliability, and agent behavior. Mentor senior engineers and influence engineering direction through code reviews, architecture discussions, technical standards … makers with compelling technical arguments. Preferred Qualifications, Capabilities, and Skills Experience with enterprise-scale AI platform development. Knowledge of industry-standard AI evaluation and observability frameworks. Expertise in cloud-native architectures and container orchestration. Proven track record of cross-functional collaboration and leadership. Familiarity with MCP protocols and enterprise integration ...

Senior Software Engineer, Banking Connectivity London, UK

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
that high-integrity financial data is correctly distributed across internal systems. Your work will focus on scaling integrations while improving the system’s resilience, observability, and overall structure. You will play a key role in evolving the platform to support new banking partners, products, and regulatory requirements while addressing technical … real‐world banking constraints Collaborate with product, operations, and external partners to unblock integrations and accelerate delivery Improve system quality through pragmatic enhancements in observability, testing, and resilience. This is a high‐impact role. What You'll Bring Experience building and supporting reliable backend systems with external integrations (APIs, webhooks ...

Senior Software Engineer / Reliability Engineering - Real-time Data

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
Build and maintain production-grade software supporting Bloomberg’s global distribution infrastructure Design and implement scalable, fault-tolerant systems with a focus on observability, performance, and automation Analyse system behaviour under real-world and failure scenarios to validate capacity, failover, and recovery meet resilience objectives Identify bottlenecks, scaling limits … Work With Configuration systems serving thousands of servers across the global network Service discovery and clustering systems for distributed infrastructure Monitoring and observability frameworks for large-scale server estates Tooling for diagnosing data quality and distribution issues Ownership of systems may evolve over time as the team focuses on areas ...

Group Head of Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
club platforms, aligned to transformation priorities Set clear architectural direction and embed modern engineering standards (cloud-first, CI/CD, automated testing, observability, secure SDLC) Own end‐to‐end delivery outcomes, ensuring valuable increments are shipped frequently, safely, and predictably Drive operational excellence across reliability, resilience, performance, and security Establish … continuous improvement Experience Senior engineering leader with strong hands‐on technical credentials. Deep experience across cloud-first architectures, distributed systems, CI/CD, observability, and secure SDLC. Experience delivering AI-enabled capabilities into production environments. Proven track record of improving reliability and leading incident response and prevention. Experience scaling engineering ...

Lead Software Engineer (Golang)

Hiring Organisation
Sky
Location
TW75QD, Syon, Greater London, United Kingdom
Employment Type
Permanent
improving system design, and reducing long-term maintenance and operational cost Drive engineering quality across the lifecycle, including testing strategy, security, CI/CD, observability, release safety, and reliability of live systems Resolve complex technical risks, cross-team dependencies, and production issues, and lead improvements based on incidents, operational gaps … framework or library Experience leading architecture and delivery for complex distributed systems and customer-facing platforms Strong experience with testing strategy, CI/CD, observability, release engineering, incident management and reliability improvement in production environments Experience driving technical standards, mentoring engineers and influencing engineering decisions across teams Strong judgement, communication ...

Senior Director, Master Data Management

Hiring Organisation
Jobleads-UK
Location
Northampton, England, United Kingdom
manage the MDM product/platform team (product, engineering, data quality, metadata/lineage). Implement DataOps for MDM (CI/CD, automated testing, observability, change control, incident/problem management). Deliver golden record services (match/merge/survivorship, hierarchy management) and reference data services. Define integration architecture … merge/survivorship, hierarchy & reference data management, quality management, metadata & lineage. Hands‐on familiarity with DataOps (CI/CD for data, automated data testing, observability), microservices, and event streaming patterns (e.g., CDC, pub/sub). Experience with enterprise data catalogs, lineage tooling, and at least one MDM platform (commercial ...

Senior Software Engineer / Reliability Engineering - Real-time Data

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Build and maintain production-grade software supporting Bloomberg’s global distribution infrastructure Design and implement scalable, fault-tolerant systems with a focus on observability, performance, and automation Analyse system behaviour under real-world and failure scenarios to validate capacity, failover, and recovery meet resilience objectives Identify bottlenecks, scaling limits … Work With Configuration systems serving thousands of servers across the global network Service discovery and clustering systems for distributed infrastructure Monitoring and observability frameworks for large-scale server estates Tooling for diagnosing data quality and distribution issues Ownership of systems may evolve over time as the team focuses on areas ...

Principal Machine Learning Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Anaplan's platform and third-party integrations Optimise model inference pipelines for performance, cost, and scalability in production environments Implement monitoring, logging, and observability for GenAI systems to track usage, errors, and model behaviour Collaborate with data scientists to productionise ML models and forecasting algorithms Your Skills Extensive hands … Experience with A/B testing and experimentation frameworks for AI features Contributions to open-source ML projects or research publications Experience with model observability tools (LangSmith, W&B;, MLflow) Our Commitment to Diversity, Equity, Inclusionand Belonging (DEIB) We believe attracting and retaining the best talent and fostering an inclusive ...

Senior Software Engineer (AI)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
clean, well‐tested, well‐documented code that your peers can build on and maintain. Debugging, improving, and taking ownership of live systems – reliability and observability included. Contributing to technical design and architecture discussions within the team. Collaborating with the Product & Prototyping Lead to take validated concepts through to production quality. … real data and services. Prompt design, model evaluation, and the practical trade‐offs of LLM systems in production. Strong fundamentals: clean code, testing, documentation, observability, and operational reliability. Collaborative and comfortable working from well‐defined problems alongside product and engineering peers. Experience in financial services, B2B SaaS, or other regulated ...

Senior AI Consultant

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Well-formed opinions on and experience of AI-enhanced SDLCs Set teams up for sustainable success through: Strong engineering practices (testing, CI/CD, observability, quality) Clear system boundaries and maintainable architecture Safe and secure deployment patterns for AI components at scale AI performance, accuracy, and reliability monitoring through evals … testing strategies (unit/integration/e2e where appropriate) good CI/CD hygiene and code review practices clean boundaries and readable, extensible code observability and operational readiness (logging, monitoring, failure modes) Confidence working with AI uncertainty and risk You understand that AI systems behave differently to deterministic software ...

EMEA VP of AI-Observability Sales

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Snowflake is seeking a Sales Leader for the EMEA region to build and lead a high-performing sales team focused on AI-driven observability solutions. The ideal candidate will have over 10 years of experience in cloud and enterprise software sales, with a track record of managing successful sales teams. … This role offers a unique opportunity to shape the future of data observability in a fast-growing environment. A BA/BS degree is required, alongside strong leadership and coaching skills. #J-18808-Ljbffr ...

SRE Lead: Automation, Observability & Reliability

Hiring Organisation
Jobleads-UK
Location
Bromley, England, United Kingdom
Huxley is seeking an experienced SRE Lead to oversee SRE strategy within an investment banking environment. The role focuses on driving automation, improving observability, and enhancing reliability by design. Ideal candidates will possess over 8 years of SRE experience, particularly in resilience engineering, and demonstrable skills in scaling operations. This ...

Senior Product Manager

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
About ITRS At ITRS, we make society's critical technology work. Our mission is to deliver automated and holistic IT observability solutions that safeguard critical applications and enable innovation. We are the only monitoring and observability platform designed for the most demanding and regulated industries — trusted by 90% of Tier … trading resilience and Market Data Observability. These workstreams sit at the heart of the Geneos and ITRS Analytics (IAX) product line, a monitoring and observability platform used by 90% of Tier 1 capital markets firms tp ensure resilience of low-latency trading, core banking, payments, and market data infrastructure. This ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, London, United Kingdom
Employment Type
Permanent
Salary
£50,000
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
compliance requirements Contribute to secure software design, development, testing, and deployment Maintain and improve CI/CD pipelines and deployment processes, including Jenkins Support observability and monitoring practices using Open Telemetry, Loki and Grafana Write efficient, maintainable code and support code reviews and testing activities Work closely with cross-functional … Entity Framework Strong SQL Server development skills Experience with API development and integration Working knowledge of CI/CD practices and Jenkins Experience with observability tools and practices, including Open Telemetry, Loki and Grafana Understanding of software security principles and secure coding practices Proficiency in HTML, CSS, and JavaScript Experience ...

Senior Platform Engineer Telephony / VoIP / Linux

Hiring Organisation
PiPcall
Location
North London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
issues across application, operating system, and network layers Analyse logs, metrics, traces, and packet captures to identify root causes and prevent recurrence Improve monitoring, observability, automation, and operational tooling Collaborate closely with engineers working across backend services, APIs, and platform integrations What Were Looking For Essential: Strong experience running … Essential: Comfortable taking ownership, making sound technical decisions, and working effectively in a small engineering team Desirable: Experience with cloud infrastructure, automation, observability tooling, or relational databases Why Senior Engineers Join Us Flexible hours and regular remote working Competitive salary aligned to experience and technical depth End-to-end ownership ...

Head of Platforms - Technology, Infrastructure and Operations

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
testing, and automation. Drive adoption of AI-enabled engineering practices. Ensure secure and efficient-by-default platform services through automation. Operational Excellence Ensure reliability, observability, and cost efficiency of platform services. Define resilience, incident management, and operational models. Track and report on platform maturity and performance. Collaboration & Influence Partner with … developer experience. Demonstrated stakeholder influence across complex organizations. Experience leading distributed engineering teams. Familiarity with AI-enabled engineering practices. Strong grounding in SRE, observability, and secure-by-design. Excellent communication and leadership skills. Success Measures Increased developer productivity and satisfaction. Adoption of platform capabilities across engineering teams. Reduction in toil ...

Machine Learning Ops Engineer

Hiring Organisation
CMC Markets UK Plc
Location
City of London, London, United Kingdom
Employment Type
Permanent
meeting availability, latency, and freshness targets for ML services Debugging production issues across data, infrastructure, and model layers Improving system robustness through automation and observability Collaborating with platform and security teams on access, secrets, and compliance Engineering rigor Writing production-grade Python used in long-running services and pipelines Establishing … frameworks, experiment tracking, structured datasets Pipelines & Orchestration: Workflow schedulers for batch and near-real-time processing Deployment: Containers, model serving frameworks, infrastructure-as-code Observability: Metrics, logging, and alerting across data and model layers Cloud: Managed compute, storage, and networking (provider-agnostic mindset) The stack will evolve. We value engineers ...

Head of Platforms - Technology, Infrastructure and Operations

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
testing, and automation.• Drive adoption of AI-enabled engineering practices.• Ensure secure and efficient-by-default platform services through automation.## Operational Excellence• Ensure reliability, observability, and cost efficiency of platform services.• Define resilience, incident management, and operational models.• Track and report on platform maturity and performance.## Collaboration & Influence• Partner with … developer experience.• Demonstrated stakeholder influence across complex organizations.• Experience leading distributed engineering teams.• Familiarity with AI-enabled engineering practices.• Strong grounding in SRE, observability, and secure-by-design.• Excellent communication and leadership skills.# Success Measures• Increased developer productivity and satisfaction.• Adoption of platform capabilities across engineering teams.• Reduction in toil ...