351 to 375 of 470 Observability Jobs in England

Oracle Middleware Engineer

Hiring Organisation
KBC Technologies Group
Location
England, United Kingdom
Operating Systems Linux, UNIX (Solaris exposure desirable) Common Mandatory Skills SQL Oracle databases, performance and troubleshooting queries Linux Scripting, process analysis, log review Monitoring & Observability Dynatrace, Splunk, Elastic stack ITIL Incident, Change, Problem, Release Management Service Management tools Remedy or equivalent Jira, Confluence, Knowledge Base authoring ...

Operational Resilience Consultant

Hiring Organisation
Xcede
Location
South Yorkshire, England, United Kingdom
challenge assumptions and drive practical outcomes Desirable Financial services experience Knowledge of UK Operational Resilience and/or DORA Exposure to ServiceNow, CMDBs, or observability tooling Infrastructure, cloud, or enterprise architecture background Candidate Profile We are looking for delivery-focused consultants who can combine operational resilience understanding with practical execution ...

Senior Network Architect, GPU Fabric and AI Infrastructure

Hiring Organisation
We Love Alfa
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 180,000 - 240,000 Annual
directly impact customer training workloads. This person will own network architecture across GPU fabric, InfiniBand, RoCE v2, Ethernet leaf spine, edge connectivity, peering, observability, deployment standards and operational handover. We are looking for someone who has: Deep GPU cluster or HPC deployment experience Strong InfiniBand production experience RoCE v2 experience ...

Senior Infrastructure Architect

Hiring Organisation
ALFA TECHNOLOGY RECRUITMENT LTD
Location
City of London, London, United Kingdom
Employment Type
Temporary
directly impact customer training workloads. This person will own network architecture across GPU fabric, InfiniBand, RoCE v2, Ethernet leaf spine, edge connectivity, peering, observability, deployment standards and operational handover. We are looking for someone who has: Deep GPU cluster or HPC deployment experience Strong InfiniBand production experience RoCE v2 experience ...

Back End Developer

Hiring Organisation
NearTech Search
Location
London Area, United Kingdom
backend initiatives end-to-end, from architecture to rollout • Strengthen testing strategy across unit and integration layers • Improve data and integration workflows with observability and resilience • Optimise Postgres (RDS) and MongoDB performance, modelling and migrations The role requires... • Strong commercial experience with Node.js and TypeScript • Deep API design expertise, including ...

BDR Language Speaker

Hiring Organisation
Pareto
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £35,000 per annum
must speak Filipino fluently to qualify for this role* Our client is a global data platform that helps turn data into action for Observability, IT, Security and more. Leaders in their field, our client is growing at an exciting rate and as such are now looking for new bi-lingual ...

Senior Frontend Developer

Hiring Organisation
SEEKR
Location
City of London, London, United Kingdom
bridges so builders can wire their products into hundreds of third‐party tools without hand‐rolling every integration. It handles managed auth, real‐time observability and connector sprawl so product teams can focus on great agent experiences instead of glue code. Your job is to make the surface they ...

Modern Workplace Solution Architect (M365)

Hiring Organisation
Pontoon
Location
Leeds, West Yorkshire, United Kingdom
Employment Type
Contract
migrations Driving improvements in automation and infrastructure (Terraform, CI/CD pipelines) Exploring AI and modern workplace innovations (e.g., Copilot) Enhancing data visibility and observability across platforms Nice-to-Have Automation experience (PowerShell, Terraform, CI/CD) Exposure to Power Platform (Power Apps, Power Automate) Experience with data, monitoring … observability tools Background in regulated environments (e.g., financial services) Why Join Us? Work on high-impact, enterprise-scale projects that make a difference! Gain exposure to a wide variety of technologies and initiatives. Have the opportunity to influence modern workplace strategy and design . Thrive in a collaborative, fast-moving ...

Senior Software Engineer

Hiring Organisation
Harrington Starr
Location
City of London, London, United Kingdom
business-critical trading platform. The role combines software engineering with reliability engineering. You’ll be involved in designing and building internal tooling, improving observability, automating operations, supporting development teams, and helping ensure trading systems remain stable, scalable, and high performing. It would suit someone who enjoys solving technical problems … speed, resilience, and continuous improvement matter. What you will do Build tools, automation, and internal services that improve platform reliability Implement monitoring, telemetry, and observability standards across distributed systems Analyse performance across application, OS, and network layers to identify bottlenecks Help define and improve SLOs/SLAs for critical services ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Engineering Manager (.NET) - Contract

Hiring Organisation
La Fosse
Location
City of London, London, United Kingdom
resource/capacity management and delivery ownership. - Experience writing executive updates and technical summaries for senior stakeholders. - Strong knowledge of CI/CD, automation, observability, and DevOps maturity models. - Evidence of driving adoption of new tools, frameworks, or processes across multiple teams. Technical Skills & Tools - Languages & Frameworks: C#/.NET … Framework and Core), React - Platforms & Infrastructure: Azure, AKS, Docker, on-prem Windows Server, SQL Server. - IAM and App Gateways: Okta, APIM, Apigee - Monitoring & Observability: Dynatrace, Application Insights - CI/CD & DevOps: Azure DevOps pipelines, SonarCloud, Github - Architecture & Patterns: Microservices, event-driven architecture, domain-driven design, modern scalable design principles ...

Site Reliability Engineer (SRE)

Hiring Organisation
Reading Industrial Pertemps
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 per annum
platform automation, CI/CD, and developer tooling.This is a hands-on role split between supporting engineers and building scalable infrastructure, automation, and observability solutions. You’ll work closely with the Head of Technology and engineering teams to improve reliability, developer experience, and platform performance. What You’ll Be Doing … Build reusable Terraform modules and manage infrastructure-as-code standards Develop internal tooling, automation scripts, self-service tooling, and platform improvements Own and improve observability across monitoring, dashboards, alerting, and runbooks Identify opportunities to automate manual processes and improve platform reliability Contribute to scalable, maintainable, and secure infrastructure practices What ...

Artificial Intelligence Engineer

Hiring Organisation
Soho Square Solutions
Location
London Area, United Kingdom
implement AI agents, including: ◦ Retrieval (RAG) ◦ Orchestration workflows ◦ Tool/function invocation ◦ Policy-based routing • Build evaluation frameworks for accuracy, latency, and reliability • Implement observability and monitoring for agent lifecycle AI Platform Integration • Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) • Build abstraction layers to support … production (agents, RAG, orchestration) • Proficiency in Python, Java, or similar backend languages • Experience with: ◦ CI/CD pipelines ◦ Infrastructure as code ◦ Monitoring and observability tools • Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience • Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) • Experience designing multi ...

Forward Deployed Engineers

Hiring Organisation
Randstad Technologies
Location
London, UK
Employment Type
Full-time
implement AI agents, including: Retrieval (RAG) Orchestration workflows Tool/function invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability Implement observability and monitoring for agent lifecycle AI Platform Integration Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) Build abstraction layers to support … production (agents, RAG, orchestration) Proficiency in Python, Java, or similar backend languages Experience with: CI/CD pipelines Infrastructure as code Monitoring and observability tools Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) Experience designing multi ...

Forward Deployed Engineers

Hiring Organisation
Randstad Digital
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £500 per day + Inside IR35
implement AI agents, including: Retrieval (RAG) Orchestration workflows Tool/function invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability Implement observability and monitoring for agent lifecycle AI Platform Integration Integrate with AI providers (e.g., OpenAI, Anthropic, Google Vertex, open-source models) Build abstraction layers to support … production (agents, RAG, orchestration) Proficiency in Python, Java, or similar backend languages Experience with: CI/CD pipelines Infrastructure as code Monitoring and observability tools Hands-on experience with AI platforms (OpenAI, Claude, Vertex AI, or similar) Preferred Experience Experience with agent frameworks (e.g., LangGraph, AutoGen, CrewAI) Experience designing multi ...

Cloud Engineer

Hiring Organisation
Spectrum It Recruitment Limited
Location
Southampton, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£65,000
Terraform to automate and standardise infrastructure delivery. You'll support the migration and modernisation of traditional infrastructure into cloud services. You'll improve monitoring, observability, security and resilience across cloud platforms. You'll work with engineering, infrastructure and business teams to turn requirements into practical cloud solutions. You'll contribute … teams and wider stakeholders Useful: Cloud migration experience Azure DevOps and YAML pipelines PowerShell, Python or Bash scripting Docker or containerised environments Monitoring and observability tooling Experience in regulated or customer-critical environments Why apply? This is a good opportunity for a Cloud Engineer who wants to work on meaningful ...

Platform Engineer

Hiring Organisation
Albert Bow
Location
City of London, London, United Kingdom
preparation, turning compliance into a competitive advantage Build and maintain robust CI/CD pipelines across backend, frontend, and data services Establish company-wide observability — logging, metrics, tracing, alerting, and on-call culture Take ownership of cloud cost management, optimising spend without compromising performance Champion operational excellence across the engineering … What You'll Bring Technical Cloud & IaC: Azure (AWS a bonus), Terraform, AKS/Kubernetes, Docker, GitHub Actions Observability: Hands-on experience with logging, metrics, and distributed tracing frameworks Security: Secrets management, security scanning, and infrastructure hardening best practices Networking: VPCs, DNS, load balancers, VPNs, firewalls — you know your ...

Performance and Monitoring Engineer

Hiring Organisation
Solus Accident Repair Centres
Location
North London, UK
Employment Type
Full-time
talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, ... LFWQ1_UKTJ ...

DevOps Engineer ( Azure )

Hiring Organisation
Experis
Location
Wembley, England, United Kingdom
Responsibilities Observability & Monitoring Platform Design, implement, and own an Azure observability playbook, delivering comprehensive dashboards, alerting rules, and operational runbooks using Application Insights, Log Analytics, and Kusto Query Language (KQL). AIOps & Intelligent Automation Develop AI‐driven alerting and detection mechanisms to surface early‐warning signals, including IP reputation degradation … scale. Infrastructure as Code Expertise Deep proficiency in Terraform, including module design, remote state management, workspace strategies, and multi‐environment deployment patterns. Monitoring & Observability Expertise Advanced experience with KQL for Azure Log Analytics, with the ability to design and build custom Azure Monitor Workbooks for operational insight and reporting. Security ...

AI Deployment & Platform Engineer

Hiring Organisation
LEC AI
Location
London, England, United Kingdom
engineering team to deploy AI systems into live environments, manage runtime infrastructure, scale orchestration systems, optimise inference performance, and build the deployment pipelines and observability that keep everything running. This is a deeply hands-on engineering role for someone who enjoys building production infrastructure, solving operational problems, and making … inference infrastructure and deployment automation • Design scalable runtime environments for multi-agent systems Reliability and Scaling • Monitor system performance, latency, throughput, and uptime • Build observability, logging, and alerting systems • Manage autoscaling and infrastructure optimisation • Debug production failures and runtime bottlenecks Infrastructure Operations • Monitor model drift, data drift, and runtime quality ...

Splunk Developer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
North East, Glasgow, UK
application teams to deliver scalable monitoring, service health, and analytics solutions.________________________________________ Key Responsibilities Technical Leadership Act as Technical Lead for Splunk implementations across monitoring, observability, and service intelligence use cases.Own end to end Splunk solution design including data onboarding, data models, dashboards, alerts, and ITSI objects.Review and govern Splunk development … Studio/Classic dashboardsDesign meaningful alerts using:oCorrelation searchesoRisk based alerting principlesTranslate operational and business requirements into actionable insights.Observability & Production SupportIntegrate Splunk with enterprise observability tools (APM, infrastructure monitoring, cloud platforms).Support production incidents using Splunk, driving root cause analysis and post incident reviews.Improve alert quality by reducing noise ...

DevOps Engineer

Hiring Organisation
Reed Technology
Location
Durham, County Durham, North East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
pipelines using Azure DevOps Supporting monitoring, reliability, and operational readiness Working alongside engineers to embed better DevOps and platform practices Contributing to security, observability, and continuity planning What they're looking for Proven experience in an Azure-focused DevOps or platform engineering role Hands-on Terraform experience used in live … essential) DevSecOps exposure Cloud cost management/FinOps awareness Understanding of .NET/C# based platforms Scripting with PowerShell, Bash or Python Experience with observability and monitoring tools Interest in using AI tools to improve engineering productivity Working setup & culture Hybrid working with a flexible, trust-based approach Supportive, inclusive ...

Java Software Engineer

Hiring Organisation
Addition
Location
Cheltenham, England, United Kingdom
secure coding principles to meet strict performance and security standards Contribute to architecture decisions, code quality, testing and continuous improvement Implement monitoring, logging and observability to support live environments Main Skills Needed: Strong Java development experience (Java 11+) using frameworks like Spring Boot Solid understanding of software architecture … building microservices and distributed systems Proven ability to deliver scalable, high-performance backend applications Familiarity with DevSecOps tools (Docker, Kubernetes, CI/CD, testing, observability) Confident working with stakeholders to translate requirements into technical solutions Active enhanced DV clearance What’s in It for You: Work on meaningful projects that ...

Cloud DevOps Engineer - Derby- £70K

Hiring Organisation
Akkodis
Location
Derbyshire, United Kingdom
Employment Type
Permanent
Salary
£50000 - £70000/annum
where there's genuinely a lot going on, in a good way. They're moving away from legacy infrastructure, modernising their cloud estate, improving observability, and continuing to build out their platform engineering capability. So if you enjoy being part of real change rather than just keeping the lights … collaboration too, you'll be working closely with Dev, QA and Product, helping teams release software reliably while also pushing forward things like monitoring, observability and overall platform resilience. Tech-wise? It's an Azure-first setup, but they're open to people who've worked across ...

Software Engineering Manager Forecasting & Rostering, Capacity Management, Platform Enablement and APIM

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent
best practice, reduce duplication, and promote maintainable, secure and performant systems. Enhance delivery capability through platform reliability and DevOps maturity - Continuously improve deployment pipelines, observability, alerting, incident handling, recovery procedures and operational readiness across Field Ops engineering teams. Manage stakeholders and ensure transparent communications - Build strong relationships across product, operations … decisions Funding for technical enablers Field Ops workflow design and data requirements Use of Data/Insight/Automation Uses engineering metrics, performance insights, observability data and AI[1]assisted diagnostics to guide decisions. Ensures human judgement remains central. Constraints Centrica architectural principles, engineering guardrails, data privacy/security policies ...