676 to 700 of 709 Observability Jobs in England

AI Engineering Manager

Hiring Organisation
Gravitas Recruitment Group (Global) Ltd
Location
London Area, United Kingdom
business needs into technical deliverables. Drive agentic workflows and AI tooling adoption across the product development lifecycle to deliver tangible value. Establish robust evaluation, observability, and quality practices for AI systems, balancing speed with reliability. Guide teams through ambiguity and rapid change, making pragmatic decisions and removing blockers. Measure success … development. Hands-on experience with AI models, tools, and frameworks, including agent orchestration, prompt engineering, RAG pipelines, evaluation frameworks, LangChain, Codex, Claude, Gemini, and observability tools and best practices. Strong technical problem-solving skills and the ability to guide teams through ambiguous, fast-changing environments. Excellent communication skills across technical ...

Gen AI Architect - London, UK

Hiring Organisation
Capgemini
Location
Greater London, United Kingdom
Employment Type
Full Time
production-grade AI systems using Amazon Bedrock, retrieval-augmented generation (RAG), agentic workflows, and cloud-native AWS services. Drive architecture standards, model orchestration, governance, observability, and operational excellence across the GenAI lifecycle while collaborating with engineering, security, compliance, and business stakeholders Hybrid working: The places that you work from … customization, prompt orchestration, retrieval pipelines, and agentic workflows Design agentic AI systems incorporating tool use, workflow orchestration, memory management, and autonomous decision flows Implement observability for prompts, model responses, vector retrieval quality, and agent execution workflows Integrate GenAI capabilities into enterprise applications, APIs, workflow platforms, and data ecosystems Work with ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit Ltd, an IQVIA business
Location
Wythenshawe, Greater Manchester, UK
compliance requirements · Contribute to secure software design, development, testing, and deployment · Maintain and improve CI/CD pipelines and deployment processes, including Jenkins · Support observability and monitoring practices using Open Telemetry, Loki and Grafana · Write efficient, maintainable code and support code reviews and testing activities · Work closely with cross-functional … Entity Framework · Strong SQL Server development skills · Experience with API development and integration · Working knowledge of CI/CD practices and Jenkins · Experience with observability tools and practices, including Open Telemetry, Loki and Grafana · Understanding of software security principles and secure coding practices · Proficiency in HTML, CSS, and JavaScript · Experience ...

Full Stack Developer

Hiring Organisation
Medicines Evaluation Unit Ltd, an IQVIA business
Location
Wythenshawe, England, United Kingdom
compliance requirements · Contribute to secure software design, development, testing, and deployment · Maintain and improve CI/CD pipelines and deployment processes, including Jenkins · Support observability and monitoring practices using Open Telemetry, Loki and Grafana · Write efficient, maintainable code and support code reviews and testing activities · Work closely with cross-functional … Entity Framework · Strong SQL Server development skills · Experience with API development and integration · Working knowledge of CI/CD practices and Jenkins · Experience with observability tools and practices, including Open Telemetry, Loki and Grafana · Understanding of software security principles and secure coding practices · Proficiency in HTML, CSS, and JavaScript · Experience ...

Senior Software Engineer - Python/AWS

Hiring Organisation
Lunio
Location
City of London, London, United Kingdom
break down ambiguity, coordinate across Product/Design/Data to land outcomes. Raise the quality bar: define practical standards for testing, security, and observability, act as approver on critical PRs, model excellent reviews and pairing. Operate and improve production: own service performance targets for your area, lead incident response … simple. Technical Execution & Delivery: Leads execution across multiple stories/engineers, breaks down ambiguous problems, and delivers predictably with sensible trade-offs. Testing, Reliability & Observability: Bakes in testability, defines/uses service performance targets, alerts, logs, and traces, advocates for reliability alongside features. Security & Privacy: Applies secure-by-default patterns ...

Head of Platforms - Technology, Infrastructure and Operations

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
code generation, testing, and automation. Drive adoption of AI‐enabled engineering practices. Ensure secure and efficient‐by‐default platform services through automation. Ensure reliability, observability, and cost efficiency of platform services. Define resilience, incident management, and operational models. Track and report on platform maturity and performance. Partner with Business Unit … developer experience. Demonstrated stakeholder influence across complex organizations. Experience leading distributed engineering teams. Familiarity with AI‐enabled engineering practices. Strong grounding in SRE, observability, and secure‐by‐design. Excellent communication and leadership skills. Success Measures Increased developer productivity and satisfaction. Adoption of platform capabilities across engineering teams. Reduction in toil ...

AI Architect

Hiring Organisation
Tata Consultancy Services
Location
City of London, London, United Kingdom
operated safely over time. Key responsibilities: Architect and govern multi-agent and agent-swarm systems at enterprise scale. Define agent safety, governance, observability, and testing standards. Establish AI guardrails, frameworks, governance models, and safety controls. Design human-in-the-loop optimisation to balance autonomy, reliability, and performance. Own patterns … native and agent-based design principles. Design and govern enterprise-scale distributed systems with embedded AI capabilities. Architect and evolve agent orchestration platforms. Own observability, reliability, security, scalability, performance, and cost management (FinOps). Ensure platforms are production-ready, secure, auditable, and compliant. Partner with CTOs and senior leadership ...

Staff Software Engineer, AI Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
England, United Kingdom
Responsibilities Develop appropriate Service Level Objectives for large language model serving systems, balancing availability and latency with development velocity. Design and implement monitoring and observability systems across the token path. Assist in the design and implementation of high-availability serving infrastructure across multiple regions and cloud providers Lead incident response … more ML hardware accelerators (GPUs, TPUs, Trainium). Understand ML-specific networking optimizations like RDMA and InfiniBand. Have expertise in AI-specific observability tools and frameworks. Have experience with chaos engineering and systematic resilience testing. Have contributed to open-source infrastructure or ML tooling. The annual compensation range for this ...

Observability & Monitoring Engineer (Dynatrace)

Hiring Organisation
COMPUTACENTER (UK) LIMITED
Location
London, UK
Employment Type
Full-time
across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in... LFWQ1_UKTJ ...

Monitoring & Observability Engineer

Hiring Organisation
COMPUTACENTER (UK) LIMITED
Location
London, UK
Employment Type
Full-time
Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & xxuwjjq Observability Engineer, you'll work in Make your application after reading the following skill and qualification requirements for this position. Please click on the apply button to read ...

Solutions Architect (New Relic/Snowflake)

Hiring Organisation
FDM Group
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£80,000 per annum, Negotiable, Pro-rata, Inc benefits
Hybrid role based in London. Our client, a major UK retailer, is undergoing a significant integration transformation — building out a centralised platform and observability capability as part of a wider initiative to modernise how integration works across the business. The successful candidate will sit within the IAA team and play … role in supporting the design and build-out of the platform, working closely with the observability team to shape the technical direction and underpin the delivery backlog. This is a hands-on architecture role that combines design ownership with close collaboration across engineering and delivery teams. Responsibilities Support the design ...

Dynatrace Expert

Hiring Organisation
Norton Blake
Location
Basildon, England, United Kingdom
Dynatrace Expert/Observability CoE Lead Location: Basildon, UK (Fully Onsite) Daily Rate: £350 – £360 per day (Maximum) Contract Duration: Initial contract until July 2027 Start Date: Immediate Agency: Norton Blake About the Role Norton Blake is partnering with a leading global technology consultancy to recruit a contract Dynatrace Expert. … This position is for a major, long-term initiative establishing a unified, EMEA-wide Observability Centre of Excellence (CoE) for a world-renowned consultancy. This unique contract role blends strategic leadership with hands-on technical execution. You will act as a core member of the newly formed CoE—defining enterprise ...

Network Reliability Specialist

Hiring Organisation
Ncounter
Location
East London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£160,000 - £180,000 per annum
processes, and preventing incidents before they occur. Working across data centre, enterprise, and cloud environments, you will take ownership of the tooling, automation, and observability capabilities that allow the wider business to operate with confidence. Key Responsibilities • Build and enhance network observability, monitoring, and alerting frameworks across critical infrastructure • Develop … production networks where uptime and reliability are critical • Hands-on experience with network automation using Python, Ansible, or similar technologies • Strong knowledge of monitoring, observability, and alerting platforms • Experience building operational tooling, automation frameworks, or reliability-focused engineering solutions • Understanding of network security principles and secure infrastructure practices • Experience with ...

C Engineer (Real-Time Full Tick Re-platform

Hiring Organisation
Hays
Location
London, United Kingdom
Employment Type
Contract
Your new role You'll step in as a Senior C++ Engineer, leading the design and rollout of observability across real-time, latency-sensitive platforms. This is a hands-on role where you'll shape how customer experience, system reliability, and operational insight are measured end-to-end. … engineering teams to embed metrics, tracing, logging, profiling, and telemetry into high-performance C++ systems. What you'll need to succeed Deep experience in observability engineering - metrics, tracing, logging, profiling, telemetry pipelines. Strong C++ engineering background in real-time or high-performance environments. Understanding of customer-experience measurement ...

Software Engineer, Product & Platform

Hiring Organisation
Bikebook
Location
Hove, England, United Kingdom
Next.js on the frontend, with some mobile work in React Native. The product also involves relational data, APIs, background jobs, third-party integrations, observability, CI/CD and production systems. You do not need to know every part of this stack already, but you should be comfortable learning across it. … tools to make integrations easier to support. Performance and reliability You might improve busy parts of the app, including React rendering, data loading, caching, observability and production debugging. Example projects could include making a slow bookings screen faster, improving error visibility, or making background jobs easier to monitor and retry ...

Lead AI Architect

Hiring Organisation
MicroTECH Global Ltd
Location
Berkshire, England, United Kingdom
Employment Type
Full-Time
Salary
£119,000 - £120,000 per annum
neuro-symbolic AI concepts or hybrid reasoning architectures. Experience designing transparent, inspectable, or explainable AI methods. Practical experience with agentic reasoning evaluation, testing, benchmarking, observability, or failure analysis. Full-stack web development experience, including backend APIs and frontend application development. Technical Skills Strong Python engineering skills. Experience with modern … knowledge management, decision support, research automation, legal/financial/technical analysis, or complex operational workflows. Experience with production-grade AI system design, including observability, monitoring, testing, security, latency, cost control, and reliability. Familiarity with human-in-the-loop systems, provenance tracking, workflow auditability, or regulated environments. Experience integrating LLMs ...

Programme Director - Connectivity

Hiring Organisation
GSK
Location
Greater London, United Kingdom
Employment Type
Full Time
accountable for end‐to‐end programme delivery across multiple integrated workstreams (WAN/SD‐WAN, campus LAN/Wi‐Fi, cloud connectivity, security, DDI, observability and the managed services operating model). The role drives structured governance, disciplined execution and measurable benefits realisation, delivering change with minimal disruption across … dependency and change control) with clear decision points and escalation paths Drive cross‐workstream integration and sequencing across WAN, campus, security/segmentation, DDI, observability and MSP transition Own benefits tracking and value realisation, including cost reduction, performance/SLA improvements, resilience metrics and adoption of target standards Lead ...

VP, Marketing Operations and Agentic AI Systems

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
aligned to revenue targets and approved at exec level. Set the standard for what constitutes a production-grade agent at Sidetrade: evaluation suites, guardrails, observability, and retirement criteria. Advise the CMO and CEO on emerging AI capabilities and their commercial implications for pipeline and revenue. Lead delivery of production agents … quality parity or better. Own prompt libraries, tool definitions, and evaluation suites that govern agent quality across the team. Stand up and own the observability layer: cost per task, accuracy, drift, human escalation rate. Report the performance of the agentic layer monthly to the exec team and quarterly ...

Splunk Lead Engineer

Hiring Organisation
VIQU IT Recruitment
Location
London, UK
client a leading finance house are looking for a Lead Splunk Engineer to take the lead in the design and implementation of monitoring and observability patterns and standards within the Observability Team. This role will act as a technical authority, ensuring best practices are followed, automation first xkybehq approach ...

Principal SRE Engineer / Grafana Specialist - (Outside IR35)

Hiring Organisation
17918
Location
Bristol, Gloucestershire, United Kingdom
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ - Largely Remote (Occasional Travel) Day Rate: Outside IR35 - £700 p/d Duration: 3-6 Months Initial - with intention to extend Payment Terms: Monthly Our client is a FTSE100 Wealth/Asset Management firm … seeking to engage a Lead SRE Engineer (Observability SME) to support the implementation and ins... WKCL1_UKTJ ...

AI Platform Engineer

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
reliability and production readiness Develop agent workflows interacting with structured data and knowledge systems Build scalable retrieval systems across complex, versioned datasets Implement guardrails, observability, and evaluation frameworks for AI systems Design APIs and integrations with external tools and platforms Collaborate closely with product and design to deliver … frameworks, or AI-powered workflows Solid programming experience including JavaScript and modern frontend frameworks Experience designing and building APIs and distributed systems Understanding of observability, scalability, and reliable architecture Comfortable working in fast-paced, product-focused environments What They Offer Competitive salary between £100,000 and £130,000 plus equity ...

Technical Integration & Reporting Lead

Hiring Organisation
eTeam Workforce Limited
Location
Sheffield, Yorkshire, United Kingdom
Employment Type
Contract
Contract Rate
GBP 537 Daily
platforms. Required Skills & Experience: 10+ years in engineering, infrastructure, or technical architecture roles in complex technology environments. Familiarity with SRE disciplines such as observability, service-level indicators/objectives (SLIs/SLOs), and automation of operational tasks. Demonstrated ability to interpret and apply control requirements in technical design contexts. Experience … external regulatory requirements (eg, DORA, EBA, PRA). Background in service reliability, system diagnostics, or incident response. Experience of Testing tools Experience of Observability systems such as Splunk, App Dynamics etc If you are interested in this position and would like to learn more, please send through your ...

Senior Software Engineer - Systematic Trading Infrastructure

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
execution systems and support live trading workflows Build tooling for real-time monitoring, diagnostics, performance attribution, and post-trade analysis Improve system design, scalability, observability, operational resilience, and maintainability Preferred Qualifications 5+ years of hands-on software engineering experience designing, building and operating systems with high complexity. Prior experience … data, research, back testing, execution, monitoring, and post-trade analysis. Strong system design skills and sound engineering judgment. High standards for correctness, reliability, testing, observability, and maintainability For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at https:/ ...

Data Engineering Manager

Hiring Organisation
Skyscanner
Location
Greater London, United Kingdom
Employment Type
Full Time
search, social and programmatic. In other words, not just dashboards... but decisions. Along the way, you'll help evolve our data platform, improving scalability, observability and governance within a modern cloud environment. You'll partner across Marketing, Product, Analytics and Data Science to turn complex data into clear, actionable direction. … Partnering cross-functionally: You'll work closely with Marketing, Product, Analytics and Marketing Technology to shape and deliver the data roadmap. Improving data quality & observability: You'll champion reliable, trustworthy datasets with strong SLAs and clear monitoring. Balancing speed and sustainability: You'll navigate the trade-offs between rapid delivery ...

Senior Software Engineer I - Personal & Business Pricing Team

Hiring Organisation
Wise
Location
Greater London, United Kingdom
Employment Type
Full Time
Salary
87000 to 111000 GBP Annually
This job is with Wise, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly. Wise is a global technology company ...