126 to 150 of 244 Observability Jobs in the UK excluding London

MuleSoft & Salesforce Agentic Engineer

Hiring Organisation
Arbuthnot Latham & Co., Limited
Location
Wolverhampton, West Midlands, United Kingdom
Employment Type
Permanent
Gateway (LLM/MCP/A2A) to integrate agents into existing flows, data models and processes. Agent Control, Monitoring & Governance Implement control, monitoring and observability for Salesforce agents, including usage, decisioning outcomes, errors and exceptions. Ensure agent behaviour aligns with internal policies, regulatory expectations and audit requirements appropriate to asset ...

Founding Engineer

Hiring Organisation
RedTech Recruitment Ltd
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£95,000
develop high-quality frontend interfaces that make complex AI outputs intuitive and actionable for users Build and maintain deployment pipelines, testing frameworks, monitoring, and observability systems Design and implement secure data pipelines with appropriate access controls and auditability Ensure the platform meets enterprise-grade security and compliance requirements ...

Staff Site Reliability Engineer - Cloud

Hiring Organisation
Jobleads-UK
Location
Newcastle upon Tyne, England, United Kingdom
Newcastle: UK - London: UK - Leedstime type: Full timeposted on: Posted Todayjob requisition id: R55272**Elevate Global Operations as our Next Cloud Site Reliability Engineer (Observability Expert)!**Trimble is an industrial technology company transforming the way the world works by delivering solutions that enable our customers to thrive. We create technologies … progress with connected hardware and software solutions.**What Makes This Role Great:**In this role, you will be the primary architect of our Observability Centre of Excellence, directly influencing the reliability and uptime of global platforms that keep world industries moving.**Key Exciting Responsibilities:*** Lead a global "OTel First" strategy ...

DevOps Engineer

Hiring Organisation
Twinstream Limited
Location
Bristol, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £600/day
container services and AMQP messaging. Working closely with feature delivery teams, you’ll help drive reliable production releases, maintain CI/CD pipelines, improve observability and ensure systems continue to meet demanding SLA/SLO targets. This is an excellent opportunity for a seasoned engineer who enjoys solving complex operational … promote releases into production efficiently and safely Maintaining highly available services using real-time monitoring and system metrics Building and improving monitoring, alerting and observability capabilities Investigating alerts and incidents, implementing preventative and remedial actions Working with customer stakeholders to coordinate releases and evolving service requirements Driving automation to reduce ...

Platform Engineer

Hiring Organisation
Digital Waffle
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £75,000 per annum
data movement Support and evolve data platforms (Databricks ideal) Build and maintain data pipelines (batch + streaming/ETL/ELT) Improve platform reliability, observability, and performance Collaborate with engineering teams to improve developer experience Requirements Strong Azure cloud experience Background in Platform Engineering, DevOps, or SRE Strong experience with … Strong understanding of data pipelines and distributed systems Focus on automation, scalability, and reliability Nice to Have Lakehouse or large-scale data platform experience Observability tooling (Datadog, Grafana, Prometheus) SaaS/high-growth product experience Strong developer experience mindset ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Birmingham, England, United Kingdom
pipelines to facilitate smooth deployments and automate workflows. Collaborate with development teams to establish best practices in system architecture, deployment, and monitoring. Implement observability solutions to gain insights into system performance and user experience. Participate in on-call rotations to respond to system alerts, perform root cause analysis, and implement … code tools (Terraform, Ansible, etc.) for automating deployments. Proficiency in scripting and programming languages such as Python, Go, or Bash. Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK stack). Excellent problem-solving skills and the ability to work effectively in high-pressure situations. Health Care Plan (Medical, Dental ...

Site Reliability Engineer (Kubernetes / Multi-Cloud) UK Based

Hiring Organisation
Jobleads-UK
Location
Hereford, England, United Kingdom
Cluster Autoscaler, KEDA, Karpenter) Help improve workload reliability and performance Support networking, identity, compute, and storage services Assist with maintaining secure and scalable environments Observability & Monitoring Work with Prometheus, Grafana, OpenTelemetry, Azure Monitor, and CloudWatch Build dashboards, alerts, and logging/tracing pipelines Support monitoring aligned to SLIs/SLOs … networking, and scaling Cloud Experience with Azure and/or AWS Familiarity with networking, IAM, and core services Infrastructure as Code Experience with Terraform Observability Familiarity with monitoring/logging tools (Prometheus, Grafana, loki) Other Technical Skills Helm Charts/Kustomize creation and maintenance Containers (Docker) Exposure to both Azure ...

Principal Java Architect

Hiring Organisation
Jobleads-UK
Location
Nottingham, England, United Kingdom
LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excellence in delivering the services our customers expect ...

Senior Software Engineer (£60k + benefits)

Hiring Organisation
Jobleads-UK
Location
Wigan, England, United Kingdom
production. As a Senior Software Engineer you’d play a key role in system design, helping to modernise their existing microservices and improve observability and testability by using modern approaches like hexagonal architecture and agentic behaviour driven design. The money is good too – up to £60k plus benefits including ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
Engineer (SRE) to join a high-performing team supporting multiple data product and platform groups. This role is focused on improving the reliability, scalability, observability, deployment, and operational support of critical data-driven platforms and services operating within complex production environments. Responsibilities Work closely with engineering, platform, and operational support ...

Senior Software Engineer (Python)

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £100,000 per annum
cross-service features Make pragmatic architecture and design decisions Own services end-to-end, including performance, reliability, and incident response Set standards for testing, observability, and security Mentor engineers and contribute to strong team practices Collaborate closely with Product, Design, and Data teams What we're looking for: Strong experience ...

AI Platform Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £130,000 per annum
environment Work closely with product and design to ensure AI features are user focused and production ready Help establish best practice around observability, governance and responsible AI development Your Skills and Experience You will be a product minded engineer who enjoys building platforms from the ground up. Strong commercial experience ...

Data Quality Lead

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
with business stakeholders to define critical data elements, data definitions, and “fit for use” requirements. Familiarity with data quality tooling and modern orchestration/observability practices. Comfortable building processes from scratch in a newly formed team. Resourceful, motivated self-starter with the ability to collaborate across business and technology Strong ...

Voice AI Technical Lead

Hiring Organisation
Vallum Associates Limited
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Contract
Contract Rate
Up to £600 per day
understood units of work. Raise the engineering bar across the team introduce and enforce best practice for prompt engineering, evals, regression testing, latency budgeting, observability, CI/CD and release management for LLM-driven systems through code review, pairing, internal guilds, brown-bags and written playbooks. Build the evaluation ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Knutsford, England, United Kingdom
challenges at scale. As part of the Database SRE team, you will be data-driven and work to eliminate TOIL through simplification, automation, and observability, thereby enhancing the reliability of our platforms. With a focus on database scalability, availability, security, and performance, you will work closely with the Engineering team ...

Principal Network Engineer

Hiring Organisation
Ageas Insurance Limited
Location
Chandler's Ford, Eastleigh, Hampshire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
architecture decisions, and transformation initiatives Ensure network resilience, performance, availability, and security across environments Drive platform modernisation, lifecycle planning, and technical debt reduction Embed observability, automation, and reliability engineering into operations Skills and experience you need as Principal Network Engineer: Recognised technical authority with strong stakeholder influence Calm and methodical ...

Lead Engineer (AI Native)

Hiring Organisation
Jobleads-UK
Location
Leeds, England, United Kingdom
ensuring decisions and changes are traceable and explainable. Build and coach quality from the start by applying AI to strong foundational techniques such as observability, verification and build automation. Help clients make deliberate AI‐focused technology and tooling choices that avoid unnecessary lock‐in and allow delivery approaches to evolve ...

Lead Engineer (AI Native)

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
ensuring decisions and changes are traceable and explainable. Build and coach quality from the start by applying AI to strong foundational techniques such as observability, verification and build automation. Help clients make deliberate AI‐focused technology and tooling choices that avoid unnecessary lock‐in and allow delivery approaches to evolve ...

Lead Engineer (AI Native)

Hiring Organisation
Jobleads-UK
Location
City of Edinburgh, Scotland, United Kingdom
ensuring decisions and changes are traceable and explainable. Build and coach quality from the start by applying AI to strong foundational techniques such as observability, verification and build automation. Help clients make deliberate AI‐focused technology and tooling choices that avoid unnecessary lock‐in and allow delivery approaches to evolve ...

Senior Azure Platform Engineer

Hiring Organisation
Rebel Recruitment
Location
Salford, Greater Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
about being part of that journey. You'll be working in an environment where tools like GitHub Copilot, OpenAI models, Claude, Gemini, AI-powered observability platforms, intelligent deployment workflows, and internal AI tooling are actively being explored and introduced to improve how engineering teams work day to day. This … designing and improving Azure infrastructure, evolving Kubernetes platforms within AKS, building reusable Infrastructure-as-Code patterns using Terraform and Crossplane, and helping improve reliability, observability, and security across the wider platform estate. You'll also spend time improving developer tooling and CI/CD processes, helping engineering teams deploy faster ...

Ai Engineer

Hiring Organisation
Morgan McKinley
Location
Yorkshire and Humberside, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
with Generative and Agentic AI patterns, including LLM integration, RAG architectures, prompt-driven workflows, and AI service orchestration. Integrate AI capabilities with enterprise systems, observability tooling, and security frameworks. Design and maintain CI/CD pipelines within cloud-native engineering environments. Support benchmarking, evaluation, experimentation, and cost optimisation … Skills Experience with Kong API Gateway, Kong Mesh, and Flux CD. RESTful API and microservices development. Terraform and GitOps workflows. Exposure to prompt evaluation, observability, or AI red-teaming tools. SQL and NoSQL database experience. Understanding of vector search technologies and Retrieval-Augmented Generation (RAG) patterns. About You A proactive ...

Lead AI Engineer

Hiring Organisation
Morgan McKinley
Location
Yorkshire and Humberside, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
with Generative and Agentic AI patterns, including LLM integration, RAG architectures, prompt-driven workflows, and AI service orchestration. Integrate AI capabilities with enterprise systems, observability tooling, and security frameworks. Design and maintain CI/CD pipelines within cloud-native engineering environments. Support benchmarking, evaluation, experimentation, and cost optimisation … Skills Experience with Kong API Gateway, Kong Mesh, and Flux CD. RESTful API and microservices development. Terraform and GitOps workflows. Exposure to prompt evaluation, observability, or AI red-teaming tools. SQL and NoSQL database experience. Understanding of vector search technologies and Retrieval-Augmented Generation (RAG) patterns. About You A proactive ...

Senior Developer

Hiring Organisation
Addition
Location
Watford, Hertfordshire, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 per annum
Doing: Designing, deploying and managing automation and monitoring platforms that support large-scale applications and services Building and maintaining monitoring, alerting and observability tooling across the platform Creating dashboards that translate complex technical data into meaningful insights for stakeholders Developing automation to integrate new systems using existing frameworks Managing … Docker) Strong Python development skills , including scripting and Lambda functions Experience building and managing CI/CD pipelines , ideally with GitHub Actions Monitoring and observability tooling such as AppDynamics, Grafana, InfluxDB, Graphite, Sensu or similar Experience working with serverless architectures (Lambda, API Gateway, DynamoDB, EventBridge) Solid understanding of Linux/ ...

Site Reliability Engineer Newcastle upon Tyne, England, GB Posted 13 hours ago

Hiring Organisation
Jobleads-UK
Location
Newcastle upon Tyne, England, United Kingdom
service and infrastructure.### ****Key Responsibilities:***** Develop and maintain infrastructure as code (IaC) using Terraform to ensure reliable and scalable cloud environments;* Implement and enhance observability solutions using tools like New Relic, DataDog, Sumologic and Splunk for monitoring, logging, and alerting;* Perform code deployments and manage CI/CD pipelines using … like;* Prometheus, Grafana, New Relic, DataDog, Splunk, Cloudwatch, Sumologic etc.* Strong understanding of networking and security concepts;### ### ****Additional experience preferred in:***** SRE observability experience with NewRelic or Datadog;* OpenTelemetry;* AIOps/MLOps;* SecOps.****How to Apply:**** Please submit an online application for this position by clicking ...

Principal Software Development Engineer

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
/CD pipelines, Infrastructure as Code, automation frameworks, and database-as-code practices using Redgate Flyway.Take ownership of critical customer systems, ensuring operational resilience, observability, performance optimisation, and rapid incident response.Collaborate closely with Product, Delivery, Operations, and Commercial teams to shape technical solutions, delivery plans, and strategic outcomes.Promote secure … Connect or Genesys Cloud.Proven ability to design and deliver secure, scalable, and resilient cloud-native solutions within complex enterprise environments.Strong understanding of observability, operational support, reliability engineering, and end-to-end ownership practices.Knowledge of regulated financial services environments, including UK GDPR and FCA Consumer Duty requirements.Excellent communication and stakeholder management ...