326 to 350 of 547 Remote/Hybrid Observability Jobs

Forward Deployed Engineer

Hiring Organisation
Tact
Location
United Kingdom
longer the bottleneck. The models are good enough. What stops AI agents reaching production is everything around the model: the architecture, the deployment, the observability, and getting an organisation to trust the system enough to let it run. Closing that gap is the job. What you'll do Embed with ...

Principal Wintel Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Server, SharePoint 2019 VMware: VMware Cloud Foundation vSphere, ESXi, NSXT, and vSAN Endpoint & Configuration: Windows 10 & 11 Microsoft Endpoint Configuration Manager (MECM) Monitoring & Observability: Microsoft System Centre Operations Manager (SCOM) PKI Technologies: Microsoft Certificate Services, Hardware Security Modules (HSMs), and lifecycle key management Security Clearance This role is subject ...

Enterprise Solutions Architect

Hiring Organisation
Jobleads-UK
Location
Oxford, England, United Kingdom
Success Factors Strong integration design capability: Domain Driven Design, event‐based integration, API design principles, resilience patterns, operational considerations (SLAs, observability, incident readiness) Excellent stakeholder management and communication: can influence at exec level, simplify complex trade‐offs, and align diverse teams behind common patterns and outcomes Desirable attributes: TOGAF certification ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent
Salary
£54000 - £60900/annum
large-scale enterprise environment. An exciting opportunity working on a greenfield Kubernetes platform built using modern engineering practices across Azure, GitOps, service mesh, observability and event-driven architecture. The Role You will be responsible for building, operating and improving a shared Kubernetes platform used by application, AI and integration engineering … teams. Hands-on role covering infrastructure as code, Kubernetes operations, CI/CD, networking, observability and platform reliability. Working closely with architects and engineering teams shaping the future of the platform while helping maintain high standards across automation, security, scalability and operational excellence. Key Responsibilities Build and operate Azure Kubernetes ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, England, United Kingdom
enterprise environment. This is an exciting opportunity to work on a greenfield Kubernetes platform built using modern engineering practices across Azure, GitOps, service mesh, observability and event-driven architecture. The Role As Platform Engineer, you will be responsible for building, operating and improving a shared Kubernetes platform used by application … integration engineering teams. This is a hands-on role covering infrastructure as code, Kubernetes operations, CI/CD, networking, observability and platform reliability. You'll work closely with architects and engineering teams to shape the future of the platform while helping maintain high standards across automation, security, scalability and operational ...

Lead Devops Engineer

Hiring Organisation
Venquis
Location
Essex, England, United Kingdom
engineers Drive infrastructure automation and Infrastructure as Code Manage and improve CI/CD pipelines Own cloud infrastructure and platform reliability Improve scalability, monitoring, observability and security Work closely with software engineering and architecture teams Influence DevOps strategy and technical direction Technical Environment AWS and/or Azure Kubernetes Terraform … Docker CI/CD pipelines Linux Monitoring & observability tooling Infrastructure as Code Experience Required Proven experience in a DevOps, Platform Engineering or SRE leadership role Strong cloud infrastructure background Experience leading or mentoring technical teams Excellent automation and CI/CD knowledge Strong Kubernetes and Terraform experience preferred Ability ...

Go Full Stack Developer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
event-driven services Contribute to CI/CD pipelines and cloud-native deployments Review code and champion engineering best practices Improve application performance, observability and reliability Collaborate within Agile delivery teams across multiple projects Support technical decision-making and continuous improvement Skills & Experience We are looking for candidates with strong … reviews, testing and engineering governance Experience with any of the following would be highly advantageous: Microsoft Azure Python GitOps tooling (Argo CD/Flux) Observability tooling (Prometheus, Grafana, OpenTelemetry) AI/LLM-enabled applications Event-driven architectures and messaging platforms What's on Offer Opportunity to work on cutting-edge ...

Principal Artificial Intelligence (AI) Platform Engineer/Architect

Hiring Organisation
WTW
Location
Greater London, United Kingdom
Employment Type
Full Time
engagement—building credibility and driving adoption across the organization Provide escalation pathways for architecture questions and unblock teams on complex integration challenges Implement monitoring, observability, and governance systems that provide transparency without creating bottlenecks Collaborate with security, compliance, and data teams to embed safety guardrails into platform capabilities Participate … experience) Proven ability to design systems that abstract complexity and enable teams to self-serve at scale Strong software engineering fundamentals (system design, testing, observability, operational excellence, SDLC practices) Experience building or maintaining developer-facing platforms, SDKs, or internal tools Comfortable articulating technical architecture, vision, and strategy to both technical ...

ML Infrastructure Lead

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
versioning, reproducibility, experimentation, feature management and release management Own and improve the production environment for machine learning systems, ensuring strong standards for availability, performance, observability and resilience Define and implement monitoring across model and platform layers, including system health, data quality, drift, latency, throughput and cost efficiency Build or optimise … pipelines, infrastructure-as-code and workflow orchestration Experience with tools such as Airflow or similar platform and orchestration technologies Good understanding of model observability, data quality, feature pipelines, lineage and reproducibility Experience designing scalable infrastructure for ML workloads, including training, batch inference and real-time serving Strong appreciation of reliability ...

SRE Engineer: High-Availability, Hybrid – London

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
approach. The candidate will work with Product Engineering teams to manage high availability and uptime, utilizing a code-first approach. Responsibilities include implementing observability standards, managing incidents, and optimizing platform costs. Offering a competitive salary of £85,000 - £90,000, the role is ideal for those passionate about reliability ...

Site Reliability Engineer

Hiring Organisation
WTW
Location
Cambridgeshire, United Kingdom
Employment Type
Full Time
their technology. This role will have the opportunity to help the team and product deal with exciting, complex and large-scale client propositions where observability will be essential and help transform how the product is designed and deployed. You will join a cross-team guild of Site Reliability Engineers, which … enables you to not only influence direction within your product family, but to also help shape how we handle observability and monitoring across ICT. This role is open to flexible and hybrid working arrangements, with presence in the Cambridge office a minimum of two days per week. The Role Collaborate ...

BDR Language Speaker

Hiring Organisation
Pareto
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £35,000 per annum
must speak Filipino fluently to qualify for this role* Our client is a global data platform that helps turn data into action for Observability, IT, Security and more. Leaders in their field, our client is growing at an exciting rate and as such are now looking for new bi-lingual ...

Principal Architect

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
cross‐team delivery of strategic outcomes; fewer blockers; consistent adherence to standards Developer experience and reliability Partner with Platform to uplift CI/CD, observability, SLOs and incident learning; advocate for fitness functions and paved roads Improved flow and stability metrics; faster, safer releases; measurable DX improvements People leadership Mentor ...

Python Engineer - up to £60,000 + Bonus - Hybrid

Hiring Organisation
Involved Solutions
Location
Ireland
Employment Type
Full-Time
Salary
£50,000 - £60,000 per annum
driving engineering best practice across the software delivery lifecycle. The Python Engineer role is suited to an engineer who enjoys clean coding principles, automation, observability and modern DevOps practices. Responsibilities for the Python Engineer: Design, develop, test and maintain backend services and microservices Build and enhance RESTful APIs aligned … ensure code quality and reliability Containerise applications using Docker and support CI/CD deployment pipelines Implement logging, monitoring and metrics to improve platform observability Collaborate with QA, DevOps and architecture teams across delivery initiatives Troubleshoot and resolve production and application issues Contribute towards continuous improvement of engineering standards ...

Site Reliability Engineer (SRE)

Hiring Organisation
Pertemps Reading
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£45,000
platform automation, CI/CD, and developer tooling. This is a hands-on role split between supporting engineers and building scalable infrastructure, automation, and observability solutions. Youll work closely with the Head of Technology and engineering teams to improve reliability, developer experience, and platform performance. What Youll Be Doing Developer … Build reusable Terraform modules and manage infrastructure-as-code standards Develop internal tooling, automation scripts, self-service tooling, and platform improvements Own and improve observability across monitoring, dashboards, alerting, and runbooks Identify opportunities to automate manual processes and improve platform reliability Contribute to scalable, maintainable, and secure infrastructure practices What ...

Senior DevOps Engineer JBLE1 NI

Hiring Organisation
MCS Group
Location
Belfast, UK
Driving the migration of legacy systems into cloud-native architectures Developing and maintaining infrastructure-as-code using Terraform or comparable tooling Improving system reliability, observability, and cost-efficiency across environments Modernising CI/CD practices to enable safe, rapid delivery across multiple environments Supporting incident response and improving operational visibility … Strong scripting skills in Bash, Python, Go, or similar Useful but not essential: Experience with Ansible, Cloudflare, PostgreSQL, or MySQL in production Familiarity with observability platforms such as Datadog, Splunk, or ELK Exposure to multi-tenant SaaS architectures Experience leading or contributing to cloud migration projects The details: Location: Belfast ...

Cloud Native DevOps Engineer

Hiring Organisation
Anson McCade
Location
England, United Kingdom
scalability are all core design considerations. The brief will suit someone comfortable operating across infrastructure engineering, platform automation, CI/CD, container platforms, and observability, while working closely with technical and non-technical stakeholders in Agile delivery settings. Employer Overview The employer is a major global technology and transformation organisation … Implement and optimise CI/CD pipelines to support secure, reliable, continuous delivery for critical applications - Monitor system health, performance, and security using modern observability and logging tooling - Work in Agile delivery teams, engaging stakeholders to translate requirements into iterative platform and infrastructure improvements Candidate Profile/Technical Skillset - Proven ...

Senior Platform Engineer

Hiring Organisation
AJ Bell
Location
Salford, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
evolving our core engineering platforms, including: Backstage and internal developer portal capabilities Engineering data platforms, including ELT workflows, DBT and SQL-based data pipelines Observability and monitoring Grafana platforms Internal automation and workflow platforms that support software delivery and engineering operations You’ll also contribute to broader platform engineering initiatives … Strong understanding of cloud platforms, containerisation and infrastructure as code Experience building self-service tooling, templates and developer enablement capabilities Experience with monitoring and observability Good understanding of security best practices in software delivery and platform design Strong problem-solving, communication and collaboration skills Ability to provide technical leadership, mentor ...

Platform Engineer

Hiring Organisation
hireful
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £85000/annum £80,000 - £85,000 + 10% Bonus + Bene
We are recruiting founding Platform Engineers on behalf of a fast-growing enterprise level (global, 500+ staff) software business with a strong engineering culture and a genuine commitment to doing things the right way. They ...

Linux Support Engineer - Nvidia/GPU workload experience essential

Hiring Organisation
Swisstech Recruitment
Location
United Kingdom
Employment Type
Contract
Contract Rate
GBP 350 - 500 Daily
Data Centre, Infrastructure Support, RMAs, Platform upgrades etc.) - Provide technical expertise and contribute to the build out and configuration of our internal observability platform. - Create and improve documentation around key operational activities. - Identify and drive improvements in performance, stability, and security. The successful candidate must have experience running Nvidia/ ...

Principal Engineer (Post-Purchase)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Tulip apps), fulfilment, shipping and CS tooling. Scale, reliability and delivery: Lead cross‐team initiatives that increase throughput and reduce cost‐to‐serve. Improve observability and operability across the flow from “buy” to “delivered,” reducing WISMO and manual interventions. Data and tooling coherence: Assist in enabling a 360° order view ...

Enterprise Network Architect

Hiring Organisation
Jobleads-UK
Location
Bournemouth, England, United Kingdom
tools.Deep understanding of security frameworks, firewalls, endpoint protection, and SIEM tools.Strong knowledge of data management platforms, databases, data lakes, Fabric and ETL processes.Experience with observability tools and practices, including monitoring, logging, tracing, and metrics collection using platforms such as ELK stack, Grafana, Solarwinds & Azure Monitor.Ability to design and implement observability ...

DevOps Engineer

Hiring Organisation
Reed
Location
County Durham, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £60,000 per annum, Inc benefits
pipelines using Azure DevOps Supporting monitoring, reliability, and operational readiness Working alongside engineers to embed better DevOps and platform practices Contributing to security, observability, and continuity planning What they’re looking for Proven experience in an Azure-focused DevOps or platform engineering role Hands-on Terraform experience used in live … essential) DevSecOps exposure Cloud cost management/FinOps awareness Understanding of .NET/C# based platforms Scripting with PowerShell, Bash or Python Experience with observability and monitoring tools Interest in using AI tools to improve engineering productivity Working setup & culture Hybrid working with a flexible, trust-based approach Supportive, inclusive ...

SRE DevOps Engineer

Hiring Organisation
WTW
Location
Surrey, United Kingdom
Employment Type
Full Time
product team to develop and support operationally resilient cloud infrastructure. The ideal candidate will have a track record in Microsoft Azure and Observability platforms in complex SaaS environments and have excellent communication skills. You will be joining our growing engineering organization building a wide range of market-leading InsurTech solutions … with focus on high cadence and cost effectiveness Implement infrastructure as code Support the team in infrastructure and networking related issues Maintain and configure observability platforms such as Datadog Proactively monitor production and other environments to ensure stability, availability, security and integrity Participate in incident response, troubleshooting, and root cause ...

Principal Site Reliability Engineer

Hiring Organisation
F5 consultants
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
improve platform reliability across complex Kubernetes and OpenShift environments. You'll work within a modern cloud-native environment leveraging Kubernetes, OpenShift, GitOps, service mesh, observability tooling, and automation-first engineering practices. This is a technically hands-on role where you'll take a leading voice in platform stability, mentor others … Kubernetes and OpenShift (non-negotiable) Experience working in complex multi-cloud or hybrid environments Proficiency in service mesh technologies such as Istio Experience with observability stacks including Prometheus, Grafana, Loki, and Tempo Strong Infrastructure as Code experience using Kustomize or Helm, with scripting skills in Bash and/or Python ...