1,101 to 1,125 of 1,270 Observability Jobs

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
compliance requirements Contribute to secure software design, development, testing, and deployment Maintain and improve CI/CD pipelines and deployment processes, including Jenkins Support observability and monitoring practices using Open Telemetry, Loki and Grafana Write efficient, maintainable code and support code reviews and testing activities Work closely with cross-functional … Entity Framework Strong SQL Server development skills Experience with API development and integration Working knowledge of CI/CD practices and Jenkins Experience with observability tools and practices, including Open Telemetry, Loki and Grafana Understanding of software security principles and secure coding practices Proficiency in HTML, CSS, and JavaScript Experience ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit Ltd, an IQVIA business
Location
Wythenshawe, England, United Kingdom
compliance requirements · Contribute to secure software design, development, testing, and deployment · Maintain and improve CI/CD pipelines and deployment processes, including Jenkins · Support observability and monitoring practices using Open Telemetry, Loki and Grafana · Write efficient, maintainable code and support code reviews and testing activities · Work closely with cross-functional … Entity Framework · Strong SQL Server development skills · Experience with API development and integration · Working knowledge of CI/CD practices and Jenkins · Experience with observability tools and practices, including Open Telemetry, Loki and Grafana · Understanding of software security principles and secure coding practices · Proficiency in HTML, CSS, and JavaScript · Experience ...

Senior Software Engineer - Quantum Computing

Hiring Organisation
DeepRec.ai
Location
City of London, London, United Kingdom
research teams to integrate new technologies from prototype through to production deployment. Build and improve CI/CD pipelines, deployment tooling, monitoring, and observability capabilities. Profile and optimise system performance, identifying bottlenecks and implementing measurable improvements. Contribute to software architecture decisions, technical design reviews, and engineering best practices. Improve code … adjacent systems within on-premise or data-centre environments. Knowledge of data acquisition, digital signal processing, timing synchronisation, telecommunications, or RF systems. Experience with observability and monitoring platforms. Exposure to scientific instrumentation, advanced computing platforms, or quantum technologies. Benefits & Opportunity Work on highly innovative next-generation computing technologies. Collaborate with ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
United Kingdom
Operations Center (NOC) responsibilities and engineering‐driven reliability practices . This role focuses on 24/7 service reliability, incident response, operational automation, and observability , while actively reducing operational toil through software and automation. Unlike a traditional NOC analyst, an SRE‐NOC is expected to engineer problems away , not just … Security, and Product teams Execute and improve runbooks, playbooks, and escalation paths Drive blameless post‐incident reviews (PIRs) and track corrective actions Monitoring, Alerting & Observability Own service health monitoring across infrastructure, applications, and dependencies Design and maintain alerting strategies that align with SLIs/SLOs Build dashboards using tools such ...

Senior Platform Engineer Telephony / VoIP / Linux

Hiring Organisation
PiPcall
Location
North London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
issues across application, operating system, and network layers Analyse logs, metrics, traces, and packet captures to identify root causes and prevent recurrence Improve monitoring, observability, automation, and operational tooling Collaborate closely with engineers working across backend services, APIs, and platform integrations What Were Looking For Essential: Strong experience running … Essential: Comfortable taking ownership, making sound technical decisions, and working effectively in a small engineering team Desirable: Experience with cloud infrastructure, automation, observability tooling, or relational databases Why Senior Engineers Join Us Flexible hours and regular remote working Competitive salary aligned to experience and technical depth End-to-end ownership ...

Machine Learning Ops Engineer

Hiring Organisation
CMC Markets UK Plc
Location
City of London, London, United Kingdom
Employment Type
Permanent
meeting availability, latency, and freshness targets for ML services Debugging production issues across data, infrastructure, and model layers Improving system robustness through automation and observability Collaborating with platform and security teams on access, secrets, and compliance Engineering rigor Writing production-grade Python used in long-running services and pipelines Establishing … frameworks, experiment tracking, structured datasets Pipelines & Orchestration: Workflow schedulers for batch and near-real-time processing Deployment: Containers, model serving frameworks, infrastructure-as-code Observability: Metrics, logging, and alerting across data and model layers Cloud: Managed compute, storage, and networking (provider-agnostic mindset) The stack will evolve. We value engineers ...

Staff Software Engineer, AI Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
England, United Kingdom
Responsibilities Develop appropriate Service Level Objectives for large language model serving systems, balancing availability and latency with development velocity. Design and implement monitoring and observability systems across the token path. Assist in the design and implementation of high-availability serving infrastructure across multiple regions and cloud providers Lead incident response … more ML hardware accelerators (GPUs, TPUs, Trainium). Understand ML-specific networking optimizations like RDMA and InfiniBand. Have expertise in AI-specific observability tools and frameworks. Have experience with chaos engineering and systematic resilience testing. Have contributed to open-source infrastructure or ML tooling. The annual compensation range for this ...

AI Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
clean, well‐tested, well‐documented code that your peers can build on and maintain. Debugging, improving, and taking ownership of live systems – reliability and observability included. Contributing to technical design and architecture discussions within the team. Collaborating with the Product & Prototyping Lead to take validated concepts through to production quality. … real data and services. Prompt design, model evaluation, and the practical trade‐offs of LLM systems in production. Strong fundamentals: clean code, testing, documentation, observability, and operational reliability. Collaborative and comfortable working from well‐defined problems alongside product and engineering peers. Experience in financial services, B2B SaaS, or other regulated ...

Lead AI Consultant

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Well‐formed opinions on and experience of AI‐enhanced SDLCs Set teams up for sustainable success through: Strong engineering practices (testing, CI/CD, observability, quality) Clear system boundaries and maintainable architecture Safe and secure deployment patterns for AI components at scale AI performance, accuracy, and reliability monitoring through evals … automated testing strategies (unit/integration/e2e where appropriate) good CI/CD hygiene and code review practicesclean boundaries and readable, extensible code observability and operational readiness (logging, monitoring, failure modes) Confidence working with AI uncertainty and risk You understand that AI systems behave differently to deterministic software ...

Principal AI Architect for Agentic Platforms

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A deep-tech company in Greater London seeks a Principal AI Engineer dedicated to shaping AI-driven engineering workflows. You will build a platform for advanced simulation tools and lead innovative efforts in agent architecture ...

Senior Consultant - Applied AI Engineer, TC

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Senior Consultant - Applied AI Engineer, TC Location: London Other locations: Primary Location Only Date: 9 Apr 2026 Requisition ID: 1700053 At EY, we’re all in to shape your future with confidence. We’ll help ...

Senior Software Engineer, Core Platform — Secure AI & RBAC

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A leading deep-tech company is seeking a Senior Software Engineer to join their Core Services team in London. The ideal candidate will be responsible for building foundational systems for an AI-driven simulation software ...

Head of Brand & Content: Observability Thought Leader

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Itrs Insights is seeking a Head of Brand & Content to lead brand evolution and establish thought leadership in the observability space. This role involves creating a cohesive brand narrative, managing content strategy, and positioning ITRS executives as industry authorities. The ideal candidate will have over 10 years of B2B marketing ...

Monitoring & Observability Engineer

Hiring Organisation
17918
Location
London, United Kingdom
across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in... CRWG1_UKTJ ...

Monitoring & Observability Engineer

Hiring Organisation
COMPUTACENTER (UK) LIMITED
Location
London, UK
Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & xkybehq Observability Engineer, you'll work in Make your application after reading the following skill and qualification requirements for this position. Please click on the apply button to read ...

Head of Support & Service Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
Guildford, England, United Kingdom
transition from a single-tenant architecture to a multi-tenant SaaS platform, requiring a fundamental shift from reactive ticket handling to systemic reliability, observability, and customer experience management at scale. You will own the end-to-end operational integrity of the platform, ensuring availability, performance, and customer trust, while partnering …/P2) Drive improvements in: Mean Time to Detect (MTTD) Mean Time to Resolve (MTTR) Ensure clear, consistent internal and external communication during incidents Observability & Monitoring Define and implement a comprehensive observability strategy, including technical telemetry (infrastructure, application, APIs) Business telemetry (transactions, payment success rates, usage) End-to-end customer ...

Product Owner - Operational Resilience

Hiring Organisation
TEKsystems
Location
Sheffield, Yorkshire, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
functional requirements, release readiness, operational acceptance). - Champion practices such as chaos engineering, game days, fault injection, capacity and performance testing, and DR readiness. Observability & insights - Partner with monitoring/observability teams to improve telemetry, alert quality, and actionable dashboards. - Use data to identify systemic risks, recurring failure modes ...

Senior DevOps Engineer

Hiring Organisation
Stealth IT Consulting Limited
Location
Telford, Shropshire, United Kingdom
Employment Type
Contract
Contract Rate
GBP 580 Daily
Observability Engineer (SC Eligible) Rate: £580/day Inside IR35 Duration: 6 months Location: Mostly remote (Telford occasional onsite - 2 days/month) Clearance: SC Eligible Role Overview We are seeking an experienced Observability Engineer to design, implement, and support enterprise-grade monitoring and observability solutions across complex technology environments ...

Senior DevOps Engineer

Hiring Organisation
17918
Location
Telford, Shropshire, United Kingdom
Observability Engineer (SC Eligible) Rate: £580/day Inside IR35 Duration: 6 months Location: Mostly remote (Telford occasional onsite - 2 days/month) Clearance: SC Eligible Role Overview We are seeking an experienced Observability Engineer to design, implement, and support enterprise-grade monitoring and observability solutions across complex technology environments. ...

Senior DevOps Engineer

Hiring Organisation
Stealth IT Consulting Limited
Location
Telford, Shropshire, UK
Description Observability Engineer (SC Eligible) Rate: £580/day Inside IR35 Considering making an application for this job Check all the details in this job description, and then click on Apply. Duration: 6 months Location: Mostly remote (Telford occasional onsite - 2 days/month) Clearance: SC Eligible Role Overview … seeking an experienced Observability Engineer to design, implement, and support enterprise-grade monitoring and observability solutions across complex technology xkybehq environments. The role focuses on improving servic Please click on the apply button to read the full job description ...

AI/ML Engineer

Hiring Organisation
PRACYVA
Location
Edinburgh, Scotland, United Kingdom
Summary Responsible for automating model deployment, ensuring version control, monitoring inference systems, and safely integrating AI agents into production with strong observability and rollback capabilities. Responsibilities • Automate ML model deployment workflows • Manage model versioning and release processes • Monitor inference cost, latency, and drift • Integrate AI agents safely into production systems … Implement observability, alerting, and rollback mechanisms Required Experience 7+ years in ML Engineering/AI Engineering Required Skills • Model deployment automation • Model version control • Monitoring inference cost, latency & drift • Production integration of AI/LLM agents • Observability & rollback systems Preferred Skills • Experience with containerized deployments • Familiarity with CI/ ...

MongoDB Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Knutsford, England, United Kingdom
clusters, sharding, replica sets, backups). Troubleshoot and resolve complex production issues across L1-L3. Build automation using Python, Ansible, TDD, Agile . Improve observability with better monitoring, alerting, and performance insights. Reduce toil by engineering tools and automation that transform the platform. Required Skills Deep MongoDB administration expertise. Strong … Manager and backup tooling. Solid troubleshooting and production support capability. SRE fundamentals and an automation‐first mindset. Hands‐on Python and Ansible experience. Observability experience (monitoring, alerting, dashboards). Why Apply Perfect for Senior DBAs wanting to transition into SRE/Engineering . ~25% of your time spent coding ...

Performance Analyst - Active SC - 12 Months

Hiring Organisation
Stealth IT Consulting
Location
United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Produce performance reports and present findings to technical and non-technical stakeholders. Define and track KPIs, SLAs, and operational metrics. Recommend improvements to monitoring, observability, and reporting processes. Support major releases and change activities by assessing performance impacts. Essential Skills & Experience Strong experience working as a Performance Analyst , Monitoring Analyst … queries Dashboards Alerts Reports Data visualisations Experience analysing large datasets to identify trends and performance issues. Understanding of application performance monitoring and observability principles. Experience supporting complex enterprise environments. Strong stakeholder management and communication skills. Experience producing technical and management-level reporting ...

Lead Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
ownership, quality and continuous improvementDefine and evolve engineering standards, frameworks and best practices across the entire engineering organisationDrive improvements in software quality, testing strategy, observability and release confidencePartner with engineering, platform, product and security to deliver large-scale, cross-functional improvementsShape and deliver internal tooling and AI-assisted engineering workflows … with an engineering-first mindset and influencing people with a broader organisational impactPrevious experience on building enterprise level, highly scalable projects, focused on performance, observability and security best practices.Experience in a technology-driven organisation with strong engineering standardsYou’ll also bring:Strong experience improving engineering quality, reliability and operational maturity ...

Service Architect

Hiring Organisation
NineTech
Location
City of London, London, United Kingdom
operations and the integration of service management tools and processes. Proven experience applying service architecture principles to technical initiatives, including cloud migrations and observability platform integrations. Solid understanding of modern technology environments, including cloud platforms, AI infrastructure, and traditional IT estates. Subject matter expertise in at least one technical domain … such as networking, security, observability, or applications is highly desirable. Key Skills & Attributes Excellent communication skills with the ability to explain complex service and operating models to both technical and non-technical audiences. Comfortable engaging with stakeholders ranging from executive leadership to operational teams. Highly adaptable, with the ability ...