326 to 350 of 1,080 Observability Jobs in the UK

Associate Director, Data Science/Gen AI Lead - ER&I

Hiring Organisation
Deloitte LLP
Location
Birmingham, England, United Kingdom
/GenAI governance & ethics (bias detection, explainability). GenAI Platform & Infrastructure Architecture (Cloud, Lakehouse). GenAI ModelOps & Performance Monitoring. AI-driven business intelligence & reporting. Observability & FinOps for AI/GenAI. Cloud Infrastructure, Networking, & Security for AI. Aligning GenAI Architectures Across Organizations: Experience aligning GenAI architecture blueprints across business units ...

Associate Director, Data Science/Gen AI Lead - ER&I

Hiring Organisation
Deloitte LLP
Location
Belfast, Northern Ireland, United Kingdom
/GenAI governance & ethics (bias detection, explainability). GenAI Platform & Infrastructure Architecture (Cloud, Lakehouse). GenAI ModelOps & Performance Monitoring. AI-driven business intelligence & reporting. Observability & FinOps for AI/GenAI. Cloud Infrastructure, Networking, & Security for AI. Aligning GenAI Architectures Across Organizations: Experience aligning GenAI architecture blueprints across business units ...

Head of Solutions Architecture - EMEA

Hiring Organisation
ClickHouse
Location
London, England, United Kingdom
customers and ARR that has more than quadrupled over the past year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads. ClickHouse’s incredible momentum was confirmed in its recent $350M Series C financing that included new, tier one investors, Khosla Ventures, BOND, IVP, Battery ...

Manager, Software Engineering (Data)

Hiring Organisation
Firstup
Location
London, England, United Kingdom
call rotations, incident response, and post-incident reviews in a “you build it, you run it” environment. Lead operational excellence initiatives to improve observability, resiliency, automation, and alignment with defined SLOs and enterprise SLAs. Minimum Qualifications Bachelor’s Degree in Computer Science, Information Technology or a related field of study ...

Software Engineer in Test (Mobile app)

Hiring Organisation
Motability Operations
Location
Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type
Permanent, Part Time, Work From Home
scripts for mobile using industry-standard testing tools and frameworks to support delivery. These scripts will validate software functionality, performance, and security. Ensuring the observability and reliability of the product is another key responsibility. You will utilize tools like Dynatrace, Splunk, and OpsGenie to achieve this. This role requires ...

Machine Learning Engineer - Python, SQL, NoSQL, and Vector databases

Hiring Organisation
FactSet
Location
London, England, United Kingdom
deployment of applications. Act as a mentor to team members, promoting a culture of innovation and continuous learning within the team. Implement robust observability and tracing across data enrichment pipelines to ensure end-to-end transparency to product and engineering stakeholders, enabling proactive monitoring and alerting, and rapid identification ...

Senior Software Engineer - Croydon, England, United Kingdom; Manchester, England, United Kingdom

Hiring Organisation
Jane's Group
Location
Croydon, England, United Kingdom
coaching skillsStrong problem solving and communication skillsStrong understanding of SDLCExpertise with cloud technologies especially AWSGood experience delivering solutions and impact in agile environmentsGood with Observability, Monitoring and Serverless technologyExperience providing data for consumption via APIExperience and strong understanding of API First principlesOur Mission:Creating trusted open-source intelligence has always ...

Senior Software Engineer - Croydon, England, United Kingdom; Manchester, England, United Kingdom

Hiring Organisation
Jane's Group
Location
Manchester, England, United Kingdom
coaching skillsStrong problem solving and communication skillsStrong understanding of SDLCExpertise with cloud technologies especially AWSGood experience delivering solutions and impact in agile environmentsGood with Observability, Monitoring and Serverless technologyExperience providing data for consumption via APIExperience and strong understanding of API First principlesOur Mission:Creating trusted open-source intelligence has always ...

Site Reliability Engineer / SRE / Systems Engineer

Hiring Organisation
AWD Online
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
effective incident management across live environments. This Site Reliability Engineer/Systems Engineer role offers the chance to work with modern cloud technologies, containerisation, observability tools and automation practices, while influencing long-term reliability improvements across business-critical systems. APPLY TODAY Ready to make your next career move? Apply … live production issues through to resolution or handover System Monitoring and Availability: Maintaining high availability, performance and scalability of production platforms and services Observability Implementation: Managing logging, monitoring, alerting and metrics to proactively identify and resolve issues Reliability Improvements: Collaborating with development teams to translate operational insights into long-term ...

Senior Product Manager for AI Observability

Hiring Organisation
London Stock Exchange Group
Location
London, England, United Kingdom
**Role Profile** This role is responsible for defining how we collect, structure, analyse and act on AI-specific telemetry signals—including prompt patterns, model performance, MCP call usage, latency, error conditions, cost metrics, user behaviour ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Edinburgh, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Leeds, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Belfast, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Cardiff, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Leicester, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Sheffield, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Swindon, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Nottingham, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Dartford, Kent, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
South London, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Derby, Derbyshire, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Reading, Berkshire, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...

Senior Platform Engineer

Hiring Organisation
Akixi
Location
Lincoln, Lincolnshire, UK
Employment Type
Full-time
inventory structuring, and role-based automation. Manage secrets securely using services such as AWS Secrets Manager or HashiCorp Vault. Implement robust monitoring, alerting, and observability tooling (e.g., CloudWatch, Prometheus, Grafana, Datadog). Participate in incident response, root cause analysis, and resilience improvements. Maintain and evolve CI/CD pipelines using … container orchestration and deployment (Docker, ECS, or Kubernetes). Proficient with GitOps or IaC-based workflows. Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence. Understanding of systems reliability metrics and associated tooling Soft Skills & Behaviours Self-driven with a bias toward action and ownership. Excellent communicator ...