1,026 to 1,050 of 1,200 Permanent Observability Jobs

Software Engineer - Python/Go

Hiring Organisation
Atarus
Location
United Kingdom
Doing ⚙️ • Building and scaling backend platform services powering real-time AI products • Breaking down monolithic systems into high-performance distributed services • Improving observability, deployment processes, and platform reliability • Working across LLM, speech, and telephony infrastructure • Optimising systems for low latency and high throughput workloads • Improving internal tooling and overall developer … system design, scalability, and reliability Tech 🧱 • Python/Go • Distributed systems & microservices • LLMs, speech systems, and real-time AI infrastructure • Modern cloud, deployment, and observability tooling Why It’s Interesting 💡 • High-impact engineering role within a rapidly scaling AI company • Opportunity to work on genuinely difficult real-time systems problems ...

Senior Engineering Manager

Hiring Organisation
Myn
Location
Manchester, England, United Kingdom
closely with product and senior stakeholders to maintain clarity on priorities Help shape engineering practices across areas such as CI/CD, automation and observability Drive improvements in delivery flow, release processes and overall engineering effectiveness Contribute to improving reliability, scalability and overall system health Manage cross-team dependencies, risks … distributed systems, production scale and operational complexity Track record of improving delivery across multiple teams Familiarity with modern engineering practices (CI/CD, automation, observability) Comfortable working across teams, stakeholders and organisational boundaries Strong leadership and communication skills Nice to have Experience in financial services, fintech or other complex, regulated ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, London, United Kingdom
Employment Type
Permanent
Salary
£50,000
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Birchanger, Hertfordshire, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
Stansted, Birchanger, Essex, United Kingdom
Employment Type
Permanent
Salary
£40000 - £50000/annum
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate. Responsibilities … improve speed, accuracy and consistency Supporting major changes, deployments and post-incident reviews with data-driven evidence Qualifications Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud) Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling ...

Head of Technology (Germany Onsite)

Hiring Organisation
Questhiring
Location
England, United Kingdom
execution. Drive implementation of LLM orchestration frameworks such as LangGraph, CrewAI, AutoGen, Semantic Kernel, or equivalent platforms. Establish best practices for AI evaluations, guardrails, observability, governance, safety, and reliability. Own the complete lifecycle of AI systems including experimentation, validation, production deployment, monitoring, and continuous optimization. Engineering & Platform Leadership Lead engineering … enabled experiences. Drive modernization of legacy platforms into cloud-native, API-first, AI-ready architectures. Ensure engineering excellence through scalable system design, automation, observability, and operational rigor. Guide teams in building highly available, secure, and resilient distributed systems. AI Platform & Infrastructure Build enterprise-grade AI platforms supporting model serving, vector ...

Engineering Manager

Hiring Organisation
Myn
Location
Manchester, England, United Kingdom
delivery progress Work closely with product and stakeholders on priorities, scope and trade-offs Support adoption of modern engineering practices (CI/CD, automation, observability) Drive improvements in flow, release processes and overall delivery effectiveness Contribute to system reliability, scalability and operational performance Collaborate with senior engineers and architects … environments with scale, distributed systems or complexity Track record of improving team delivery and performance Familiarity with modern engineering practices (CI/CD, automation, observability) Experience working in agile environments and collaborating with product stakeholders Strong leadership, communication and organisational skills Nice to have Experience in complex or regulated environments ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
City of London, London, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...

Staff AI Security Engineer

Hiring Organisation
Spring Health
Location
Seattle, Washington, United States
Employment Type
Permanent
Salary
USD Annual
security guardrails and tooling that enable safe experimentation across engineering and business teams Establish AI-specific governance frameworks covering identity, access control, auditability, and observability Take ownership of and lead our AI Red Team to proactively identify vulnerabilities Design and implement AI observability pipelines to detect anomalous model behavior ...

Staff AI Security Engineer

Hiring Organisation
Spring Health
Location
San Francisco, California, United States
Employment Type
Permanent
Salary
USD Annual
security guardrails and tooling that enable safe experimentation across engineering and business teams Establish AI-specific governance frameworks covering identity, access control, auditability, and observability Take ownership of and lead our AI Red Team to proactively identify vulnerabilities Design and implement AI observability pipelines to detect anomalous model behavior ...

Senior Frontend Developer

Hiring Organisation
Pentasia
Location
Newcastle Upon Tyne, England, United Kingdom
platforms. - Lead or contribute to technical initiatives spanning multiple systems. - Mentor team members and support knowledge sharing and hiring activities. - Implement testing, monitoring, and observability best practices. - Contribute to secure development practices and compliance requirements where applicable. Desired experience: - Proven experience in a senior engineering role with ownership of complex … cloud environments (AWS preferred). - Familiarity with CI/CD pipelines, automated testing, and modern delivery practices. - Experience with performance optimisation, debugging, and observability tools. - Exposure to legacy systems and modernisation projects is advantageous. - Knowledge of secure coding practices and common security standards. - Experience mentoring engineers or influencing technical decisions ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
compliance requirements Contribute to secure software design, development, testing, and deployment Maintain and improve CI/CD pipelines and deployment processes, including Jenkins Support observability and monitoring practices using Open Telemetry, Loki and Grafana Write efficient, maintainable code and support code reviews and testing activities Work closely with cross-functional … Entity Framework Strong SQL Server development skills Experience with API development and integration Working knowledge of CI/CD practices and Jenkins Experience with observability tools and practices, including Open Telemetry, Loki and Grafana Understanding of software security principles and secure coding practices Proficiency in HTML, CSS, and JavaScript Experience ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit Ltd, an IQVIA business
Location
Wythenshawe, England, United Kingdom
compliance requirements · Contribute to secure software design, development, testing, and deployment · Maintain and improve CI/CD pipelines and deployment processes, including Jenkins · Support observability and monitoring practices using Open Telemetry, Loki and Grafana · Write efficient, maintainable code and support code reviews and testing activities · Work closely with cross-functional … Entity Framework · Strong SQL Server development skills · Experience with API development and integration · Working knowledge of CI/CD practices and Jenkins · Experience with observability tools and practices, including Open Telemetry, Loki and Grafana · Understanding of software security principles and secure coding practices · Proficiency in HTML, CSS, and JavaScript · Experience ...

Senior Software Engineer - Quantum Computing

Hiring Organisation
DeepRec.ai
Location
City of London, London, United Kingdom
research teams to integrate new technologies from prototype through to production deployment. Build and improve CI/CD pipelines, deployment tooling, monitoring, and observability capabilities. Profile and optimise system performance, identifying bottlenecks and implementing measurable improvements. Contribute to software architecture decisions, technical design reviews, and engineering best practices. Improve code … adjacent systems within on-premise or data-centre environments. Knowledge of data acquisition, digital signal processing, timing synchronisation, telecommunications, or RF systems. Experience with observability and monitoring platforms. Exposure to scientific instrumentation, advanced computing platforms, or quantum technologies. Benefits & Opportunity Work on highly innovative next-generation computing technologies. Collaborate with ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
United Kingdom
Operations Center (NOC) responsibilities and engineering‐driven reliability practices . This role focuses on 24/7 service reliability, incident response, operational automation, and observability , while actively reducing operational toil through software and automation. Unlike a traditional NOC analyst, an SRE‐NOC is expected to engineer problems away , not just … Security, and Product teams Execute and improve runbooks, playbooks, and escalation paths Drive blameless post‐incident reviews (PIRs) and track corrective actions Monitoring, Alerting & Observability Own service health monitoring across infrastructure, applications, and dependencies Design and maintain alerting strategies that align with SLIs/SLOs Build dashboards using tools such ...

Senior Platform Engineer Telephony / VoIP / Linux

Hiring Organisation
PiPcall
Location
North London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
issues across application, operating system, and network layers Analyse logs, metrics, traces, and packet captures to identify root causes and prevent recurrence Improve monitoring, observability, automation, and operational tooling Collaborate closely with engineers working across backend services, APIs, and platform integrations What Were Looking For Essential: Strong experience running … Essential: Comfortable taking ownership, making sound technical decisions, and working effectively in a small engineering team Desirable: Experience with cloud infrastructure, automation, observability tooling, or relational databases Why Senior Engineers Join Us Flexible hours and regular remote working Competitive salary aligned to experience and technical depth End-to-end ownership ...

Machine Learning Ops Engineer

Hiring Organisation
CMC Markets UK Plc
Location
City of London, London, United Kingdom
Employment Type
Permanent
meeting availability, latency, and freshness targets for ML services Debugging production issues across data, infrastructure, and model layers Improving system robustness through automation and observability Collaborating with platform and security teams on access, secrets, and compliance Engineering rigor Writing production-grade Python used in long-running services and pipelines Establishing … frameworks, experiment tracking, structured datasets Pipelines & Orchestration: Workflow schedulers for batch and near-real-time processing Deployment: Containers, model serving frameworks, infrastructure-as-code Observability: Metrics, logging, and alerting across data and model layers Cloud: Managed compute, storage, and networking (provider-agnostic mindset) The stack will evolve. We value engineers ...

Staff Software Engineer, AI Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
England, United Kingdom
Responsibilities Develop appropriate Service Level Objectives for large language model serving systems, balancing availability and latency with development velocity. Design and implement monitoring and observability systems across the token path. Assist in the design and implementation of high-availability serving infrastructure across multiple regions and cloud providers Lead incident response … more ML hardware accelerators (GPUs, TPUs, Trainium). Understand ML-specific networking optimizations like RDMA and InfiniBand. Have expertise in AI-specific observability tools and frameworks. Have experience with chaos engineering and systematic resilience testing. Have contributed to open-source infrastructure or ML tooling. The annual compensation range for this ...

AI Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
clean, well‐tested, well‐documented code that your peers can build on and maintain. Debugging, improving, and taking ownership of live systems – reliability and observability included. Contributing to technical design and architecture discussions within the team. Collaborating with the Product & Prototyping Lead to take validated concepts through to production quality. … real data and services. Prompt design, model evaluation, and the practical trade‐offs of LLM systems in production. Strong fundamentals: clean code, testing, documentation, observability, and operational reliability. Collaborative and comfortable working from well‐defined problems alongside product and engineering peers. Experience in financial services, B2B SaaS, or other regulated ...

Lead AI Consultant

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Well‐formed opinions on and experience of AI‐enhanced SDLCs Set teams up for sustainable success through: Strong engineering practices (testing, CI/CD, observability, quality) Clear system boundaries and maintainable architecture Safe and secure deployment patterns for AI components at scale AI performance, accuracy, and reliability monitoring through evals … automated testing strategies (unit/integration/e2e where appropriate) good CI/CD hygiene and code review practicesclean boundaries and readable, extensible code observability and operational readiness (logging, monitoring, failure modes) Confidence working with AI uncertainty and risk You understand that AI systems behave differently to deterministic software ...

Senior Consultant - Applied AI Engineer, TC

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Senior Consultant - Applied AI Engineer, TC Location: London Other locations: Primary Location Only Date: 9 Apr 2026 Requisition ID: 1700053 At EY, we’re all in to shape your future with confidence. We’ll help ...

Principal AI Architect for Agentic Platforms

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A deep-tech company in Greater London seeks a Principal AI Engineer dedicated to shaping AI-driven engineering workflows. You will build a platform for advanced simulation tools and lead innovative efforts in agent architecture ...

Senior Software Engineer, Core Platform — Secure AI & RBAC

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A leading deep-tech company is seeking a Senior Software Engineer to join their Core Services team in London. The ideal candidate will be responsible for building foundational systems for an AI-driven simulation software ...

Head of Brand & Content: Observability Thought Leader

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Itrs Insights is seeking a Head of Brand & Content to lead brand evolution and establish thought leadership in the observability space. This role involves creating a cohesive brand narrative, managing content strategy, and positioning ITRS executives as industry authorities. The ideal candidate will have over 10 years of B2B marketing ...

Monitoring & Observability Engineer

Hiring Organisation
17918
Location
London, United Kingdom
across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in... CRWG1_UKTJ ...