51 to 75 of 76 Observability Jobs in the North West

Founding AI Engineer

Hiring Organisation
Adria Solutions
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
world environments Experience with several of: LLM applications Agentic AI frameworks Multi-agent systems RAG architectures Vector databases AI orchestration platforms AI evaluation and observability tools Model optimisation and fine-tuning Mindset We're particularly interested in individuals who: Thrive in startup environments Enjoy building from zero to one Think ...

Software Engineer, Full-Stack Applications - Ratings Technology

Hiring Organisation
Fitch Group
Location
Manchester, United Kingdom
Employment Type
Full Time
Apache Airflow for workflow management, or Streamlit for building interactive data applications • Advanced Data Management – Strong SQL design, query optimization, and database architecture expertise • Observability – Experience with observability patterns and tools like Datadog, distributed tracing, monitoring, and logging best practices • DevOps and Infrastructure – Familiarity with ArgoCD for GitOps and Security ...

DevOps Engineer

Hiring Organisation
Eligo Recruitment Ltd
Location
Manchester, Stockport, United Kingdom
Employment Type
Permanent
Salary
£70000 - £80000/annum
Building and maintaining Infrastructure as Code using Terraform Automating infrastructure provisioning and deployment pipelines Managing Kubernetes and containerised workloads Implementing monitoring, logging and observability solutions Driving platform reliability, security and best practices Collaborating with engineering teams to improve developer experience Skills & Experience Essential: Strong commercial experience with Google Cloud Platform … container technologies Experience with Linux and scripting (Bash, Python or Go) Understanding of networking, IAM and cloud security principles Experience with monitoring and observability tooling Desirable: Experience with GitOps practices Knowledge of Prometheus, Grafana or similar tools Experience in a platform engineering or SRE environment Certifications in GCP are advantageous ...

Site Reliability Engineer (DV Security Clearance)

Hiring Organisation
CGI
Location
Manchester, United Kingdom
Employment Type
Full Time
Engineer (SRE) to join a high-performing team supporting multiple data product and platform groups. This role is focused on improving the reliability, scalability, observability, deployment, and operational support of critical data-driven platforms and services operating within complex production environments. The successful candidate will work closely with engineering, platform … services across cloud and containerised environments. - Manage and support Kubernetes clusters and Helm-based deployments across multiple environments. - Enhance monitoring, alerting, logging, and observability solutions to improve operational visibility and system reliability. - Investigate incidents, analyse logs, identify root causes, and drive timely resolution of production issues. - Participate in incident response ...

Software Engineer

Hiring Organisation
Moorepay
Location
Manchester, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
documentation for services, features, and reusable components. Cloud-Native Engineering & DevOps Practices: Deploy and maintain services using CI/CD pipelines. Instrument code for observability, logging, and performance insights. Participate in incident resolution and root-cause analysis for issues within the squad’s domain. Follow best practices for cloud development … Agents. Experience with cloud services such as AWS, Azure, or serverless platforms. Interest in distributed systems, event-driven architectures, or DDD concepts. Familiarity with observability tooling and debugging complex systems. ...

Head of ML and AI

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
powered and generative experiences Setting technical direction across modelling, evaluation and ML lifecycle management Partnering closely with Product, Engineering and MLOps teams Improving scalability, observability and speed-to-production across ML systems Leading and developing a high-performing team of ML specialists and leaders Acting as the senior ...

Senior Network Engineer

Hiring Organisation
Employment International
Location
Cheshire East, Cheshire, UK
segmentation. Experience with network security assessments. Experience with Cisco, Fortinet, Palo Alto, Aruba, Meraki or similar network/security platforms. Experience with monitoring and observability tooling such as Datadog, SolarWinds, PRTG, SNMP, syslog or NetFlow/IPFIX. Experience supporting office moves, network migrations or infrastructure transformation projects. Key Skills Network ...

Senior Network Engineer

Hiring Organisation
Employment International
Location
Cheshire East, England, United Kingdom
segmentation. Experience with network security assessments. Experience with Cisco, Fortinet, Palo Alto, Aruba, Meraki or similar network/security platforms. Experience with monitoring and observability tooling such as Datadog, SolarWinds, PRTG, SNMP, syslog or NetFlow/IPFIX. Experience supporting office moves, network migrations or infrastructure transformation projects. Key Skills Network ...

Senior DBA

Hiring Organisation
Morson Edge
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
deployment and ongoing maintenance Support Infrastructure as Code and configuration management using tools such as Ansible and Terraform Collaborate with engineering teams to improve observability, resilience and operational efficiency Provide technical guidance and mentorship to junior team members Participate in incident management, root cause analysis and continuous improvement activities Contribute … Code tools, including Ansible or Terraform Strong troubleshooting and problem-solving skills across database and system layers Familiarity with CI/CD pipelines and observability tooling Desirables Experience with cloud-managed database services Knowledge of schema migration tools Understanding of disaster recovery, retention and data protection strategies Experience with monitoring ...

Observability and Automation Architect

Hiring Organisation
Capgemini
Location
Manchester, United Kingdom
Employment Type
Full Time
recruiter directly. About the job you're considering To support the continued growth of our Cloud and Infrastructure Services business, Capgemini is expanding its observability and automation capability. We are seeking a Technical Architect to design, govern, and assure best‐practice client solutions for infrastructure observability and automation across complex … will be unable to work at home 100% of the time. Your role This role includes the following responsibilities: Design end‐to‐end observability architectures (metrics, logs, traces, alerting, dashboards) using platforms like Splunk Architect infrastructure automation and configuration management solutions aligned to IaC and DevOps (e.g. Ansible) Define ...

Vice President, Full-Stack Engineer

Hiring Organisation
BNY
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
engineering teams; set clear objectives, coach talent, and foster succession planning. Own end-to-end delivery for critical software: requirements, architecture, implementation, testing, deployment, observability, and reliability. Raise engineering excellence and resilience: best practices and automation across code, testing, microservices/APIs, performance, and infrastructure; secure-by-design with threat … scalable, observable, testable systems; strong API design. Strong DevOps practices: CI/CD (e.g., GitLab), automated testing (JUnit/Spock), code reviews, telemetry/observability (Splunk, AppDynamics), containers (Docker), and cloud. Hands-on AI development using modern tools and IDEs (e.g., Windsurf) and experience integrating AI into product workflows. Excellent ...

Vice President, Full-Stack Engineer

Hiring Organisation
17918
Location
Manchester, Lancashire, United Kingdom
engineering teams set clear objectives, coach talent, and foster succession planning. Own end-to-end delivery for critical software: requirements, architecture, implementation, testing, deployment, observability, and reliability. Raise engineering excellence and resilience: best practices and automation across code, testing, microservices/APIs, performance, and infrastructure secure-by-design with threat … scalable, observable, testable systems strong API design. Strong DevOps practices: CI/CD (e.g., GitLab), automated testing (JUnit/Spock), code reviews, telemetry/observability (Splunk, AppDynamics), containers (Docker), and cloud. Hands-on AI development using modern tools and IDEs (e.g., Windsurf) and experience integrating AI into product workflows. Excellent ...

Senior Vice President, DevOps Production Services

Hiring Organisation
17918
Location
Manchester, Lancashire, United Kingdom
enterprise applications and ensure platform stability, resiliency, and availability. Monitor application health, system performance, batch jobs, interfaces, and alerts using enterprise monitoring and observability tools. Investigate, troubleshoot, and resolve production incidents within defined SLAs. Perform root cause analysis (RCA) for recurring issues and drive permanent fixes. Analyze production logs, identify … Cloud experience preferred. Knowledge of automation/scripting using Python, Shell, or PowerShell. Exposure to DevOps/SRE practices, CI/CD pipelines, and observability tooling. Strong communication skills with the ability to provide concise incident and executive status updates. At BNY, our culture allows us to run our company ...

Senior / Principal DevOps Engineer

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Bury, Lancashire, England, United Kingdom
Employment Type
Contractor
Contract Rate
£700 - £800 per day
best practices across engineering teams and onboard products onto shared platforms. Build and maintain secure, scalable, and high-performing cloud infrastructure in AWS. Implement observability, monitoring, and operational insights across multiple environments. Improve deployment processes, reduce friction, and enable self-service capabilities for development teams. Support cloud and infrastructure incident … focus on automation. Experience with containerisation and workload orchestration technologies. Scripting and programming experience with tools such as Python and Bash. Strong understanding of observability, reliability, and operational best practices. Knowledge of information security principles and experience embedding security throughout the software delivery lifecycle. If you're interested in this ...

Senior / Principal DevOps Engineer

Hiring Organisation
Hays Technology
Location
Bury, Greater Manchester, United Kingdom
Employment Type
Contract
Contract Rate
£700 - £800/day £700 - £800 p/d (depending on level)
best practices across engineering teams and onboard products onto shared platforms. Build and maintain secure, scalable, and high-performing cloud infrastructure in AWS. Implement observability, monitoring, and operational insights across multiple environments. Improve deployment processes, reduce friction, and enable self-service capabilities for development teams. Support cloud and infrastructure incident … focus on automation. Experience with containerisation and workload orchestration technologies. Scripting and programming experience with tools such as Python and Bash. Strong understanding of observability, reliability, and operational best practices. Knowledge of information security principles and experience embedding security throughout the software delivery lifecycle. If you're interested in this ...

Senior / Principal DevOps Engineer

Hiring Organisation
Hays Technology
Location
Ramsbottom, Lancashire, United Kingdom
Employment Type
Contract
Contract Rate
GBP 700,000 - 800,700 Daily
best practices across engineering teams and onboard products onto shared platforms. Build and maintain secure, scalable, and high-performing cloud infrastructure in AWS. Implement observability, monitoring, and operational insights across multiple environments. Improve deployment processes, reduce friction, and enable self-service capabilities for development teams. Support cloud and infrastructure incident … focus on automation. Experience with containerisation and workload orchestration technologies. Scripting and programming experience with tools such as Python and Bash. Strong understanding of observability, reliability, and operational best practices. Knowledge of information security principles and experience embedding security throughout the software delivery lifecycle. If you're interested in this ...

SRE Managing Consultant - Cloud Operating Model

Hiring Organisation
Capgemini
Location
Manchester, United Kingdom
Employment Type
Full Time
Budgets : Establish service measures and targets (SLIs/SLOs) and introduce Error Budgets to enable data-driven trade-offs between reliability and delivery velocity. Observability & Operational Insight: Shape observability approaches (metrics/logs/traces) and operational monitoring models that make reliability risks visible and actionable, improving operational decision-making. … large‐scale delivery contexts; associate‐level certifications are desirable but not mandatory. Design, establish, and evolve SRE‐led centres of excellence (e.g. Reliability, Observability, or Operational Excellence), setting enterprise‐level standards for SLIs/SLOs, incident management, observability, and continuous improvement across cloud and hybrid platforms. Exposure to modern observability ...

DevOps Engineer

Hiring Organisation
Oscar Associates (UK) Limited
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
Salary
£70,000
scalable, reliable and cost-efficient as it moves into full production. Working closely with engineering teams, you'll drive automation, improve deployment pipelines, strengthen observability and ensure the platform performs under high-volume, real-time workloads. This is a hands-on position with genuine ownership and plenty of opportunity … enhancing CI/CD pipelines with blue/green deployments and automated rollback Driving platform reliability, resilience and scalability Developing monitoring, alerting and observability across the environment Managing cloud costs and implementing best FinOps practices Participating in a small production on-call rota Technology AWS ECS Fargate Terraform Aurora ...

Vice President, DevOps Production Services

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
enterprise applications and ensure platform stability, resiliency, and availability. Monitor application health, system performance, batch jobs, interfaces, and alerts using enterprise monitoring and observability tools. Investigate, troubleshoot, and resolve production incidents within defined SLAs. Perform root cause analysis (RCA) for recurring issues and drive permanent fixes. Analyze production logs, identify … Cloud experience preferred. Knowledge of automation/scripting using Python, Shell, or PowerShell. Exposure to DevOps/SRE practices, CI/CD pipelines, and observability tooling. Strong communication skills with the ability to provide concise incident and executive status updates. #J-18808-Ljbffr ...

Vice President, Production Services Application Support

Hiring Organisation
BNY
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
urgency, recover priority incidents under pressure, and maintain core support coverage across on-site and offshore support hours. Use SQL scripting, automation, monitoring, and observability tools to improve operational resilience, service health, reliability, and incident response. To be successful in this role, were seeking the following: Excellent SQL scripting skills. … solutions for alert correlation, anomaly detection, predictive monitoring, and service optimisation. Strong understanding of Site Reliability Engineering (SRE) principles, including service health, reliability, availability, observability, incident reduction, and continuous service improvement. Experience with SRE practices such as monitoring and alert tuning, incident management, post-incident reviews, root cause analysis ...

Vice President, Production Services Application Support

Hiring Organisation
17918
Location
Manchester, Lancashire, United Kingdom
urgency, recover priority incidents under pressure, and maintain core support coverage across on-site and offshore support hours. Use SQL scripting, automation, monitoring, and observability tools to improve operational resilience, service health, reliability, and incident response. To be successful in this role, were seeking the following: Excellent SQL scripting skills. … solutions for alert correlation, anomaly detection, predictive monitoring, and service optimisation. Strong understanding of Site Reliability Engineering (SRE) principles, including service health, reliability, availability, observability, incident reduction, and continuous service improvement. Experience with SRE practices such as monitoring and alert tuning, incident management, post-incident reviews, root cause analysis ...

Vice President, Production Services Application Support

Hiring Organisation
BNY
Location
manchester, north west england, united kingdom
urgency, recover priority incidents under pressure, and maintain core support coverage across on-site and offshore support hours. Use SQL scripting, automation, monitoring, and observability tools to improve operational resilience, service health, reliability, and incident response. To be successful in this role, we’re seeking the following: Excellent SQL scripting … solutions for alert correlation, anomaly detection, predictive monitoring, and service optimisation. Strong understanding of Site Reliability Engineering (SRE) principles, including service health, reliability, availability, observability, incident reduction, and continuous service improvement. Experience with SRE practices such as monitoring and alert tuning, incident management, post-incident reviews, root cause analysis ...

Network Monitoring & Observability Architect

Hiring Organisation
Pontoon
Location
Chester, Cheshire, United Kingdom
Employment Type
Contract
Join Our Team as a Network Monitoring & Observability Architect ! Contract Length: 12 months Location: Chester Working Pattern: 3 days per week in the office, Via Umbrella Company Are you ready to take your skills to the next level? We're looking for a talented Monitoring Architect to join our dynamic ...

Senior AI Engineer

Hiring Organisation
Adria Solutions Ltd
Location
Manchester, United Kingdom
Employment Type
Permanent
Salary
£75000 - £110000/annum
solutions securely within enterprise environments. Ensure solutions leverage Private Endpoints, secure networking, identity management, and enterprise-grade governance controls. Establish monitoring, evaluation, and observability frameworks for AI systems, including hallucination detection, model drift monitoring, performance tracking, and cost optimisation. Partner with operational and commercial stakeholders to identify high-value … evaluation. Experience applying Data Science methodologies to solve complex business problems and identify opportunities for AI adoption. Experience with GenAIOps, LLMOps, MLOps, and AI observability platforms. Exposure to Computer Vision, OCR, Voice AI, Conversational AI, or multimodal AI solutions. Experience working within operational, retail, automotive, logistics, or customer-centric organisations. ...

AI Engineer

Hiring Organisation
Hyre AI Limited
Location
Paddington, Warrington, United Kingdom
Employment Type
Permanent
Salary
GBP 60,000 - 80,000 Annual
tool-calling patterns Extend the MCP server with new tools and capabilities Enforce structured outputs and validation across LLM boundaries 2. LLM Quality, Evals & Observability Build the layer that lets the team ship LLM features with confidence. You will: Design and grow the eval platform - golden datasets, regression suites … judge Integrate observability and tracing across providers and prompt versions Track cost, latency, and quality per prompt, model, and client Build guardrails for prompt injection, PII, and output safety Drive prompt engineering practice - versioning, A/B testing, platform overlays 3. Cloud & Data Infrastructure Own the cloud substrate that runs ...