651 to 675 of 1,265 Observability Jobs

Azure SRE Engineer

Hiring Organisation
Oscar Associates (UK) Limited
Location
Glasgow, Lanarkshire, United Kingdom
Employment Type
Permanent
Salary
GBP 575 - 625 Daily
Contract We're looking for two experienced Azure Site Reliability Engineers to join a major Financial Services programme focused on platform health, reliability, and observability across a large-scale Azure environment click apply for full job details ...

Infrastructure & Devops Engineer (m/w/d)

Hiring Organisation
iVentureGroup GmbH
Location
Hammerbrook, Hamburg, Germany
Employment Type
Permanent
Salary
EUR Annual
Verantwortung für unseren operativen IT-Betrieb (24/7), während du gleichzeitig moderne Plattform-Initiativen vorantreibst. Ob Kubernetes-Cluster, CI/CD-Pipelines oder Observability - du bist in deinem Element . click apply for full job details ...

GCP SRE for BI Platform — Reliability & Incidents

Hiring Organisation
Jobleads-UK
Location
United Kingdom
experienced Site Reliability Engineer to oversee the health of GCP-hosted APIs and services. This role involves monitoring uptime, leading incident responses, and building observability infrastructures. The ideal candidate has 2+ years in a Site Reliability or DevOps role, practical GCP experience, and a solid grasp of cloud security. Join ...

Lead Platform Engineer – Cloud Native, Kubernetes & Mentorship

Hiring Organisation
Jobleads-UK
Location
United Kingdom
London to manage teams and stakeholders while working with cutting edge technology. This role involves shaping platform strategy, mentoring engineers, and ensuring the observability and reliability of systems. With an annual salary of £80,000 to £100,000, the company promotes professional growth by funding multiple Kubernetes certifications ...

Senior SDET: Architect Modern Testing & Quality Enablement

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
promoting a test-first mindset within our eCommerce Engineering organization. The ideal candidate will be experienced in microservice testing, CI/CD processes, and observability tools. We value diverse perspectives and are committed to creating an inclusive environment. #J-18808-Ljbffr ...

Lead Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
throughput workloads. Full Lifecycle Ownership: A strong "DevSecOps" mindset with expertise in building and maintaining CI/CD pipelines, infrastructure-as-code, and robust observability (monitoring, logging, tracing) for production systems. Quality as a Feature: A deep commitment to quality, demonstrated by implementing comprehensive testing strategies (unit, integration … Interest or prior experience in traditional financial markets, trading systems, or investment platforms. Containerization & Deployment: Proficiency with containerization technologies such as Docker or Kubernetes. Observability: Hands-on experience with modern observability tooling (e.g., Prometheus, DataDog, Jaeger, OpenTelemetry). Data Governance: Experience with data privacy (GDPR/CCPA) and security compliance ...

Software Engineer

Hiring Organisation
Acceler8 Talent
Location
City of London, London, United Kingdom
training efficiency across 1,000+ GPU clusters Improve utilisation, throughput, and reliability across distributed training infrastructure Build tooling for orchestration, monitoring, scheduling, and observability Work closely with research teams to accelerate large-scale model training 🔧 What They’re Looking For Deep GPU infrastructure/distributed systems experience Strong knowledge ...

Senior Software Engineer (£60k + benefits)

Hiring Organisation
Jobleads-UK
Location
Wigan, England, United Kingdom
production. As a Senior Software Engineer you’d play a key role in system design, helping to modernise their existing microservices and improve observability and testability by using modern approaches like hexagonal architecture and agentic behaviour driven design. The money is good too – up to £60k plus benefits including ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater Manchester, England, United Kingdom
Engineer (SRE) to join a high-performing team supporting multiple data product and platform groups. This role is focused on improving the reliability, scalability, observability, deployment, and operational support of critical data-driven platforms and services operating within complex production environments. Responsibilities Work closely with engineering, platform, and operational support ...

Senior Software Engineer (Python)

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £100,000 per annum
cross-service features Make pragmatic architecture and design decisions Own services end-to-end, including performance, reliability, and incident response Set standards for testing, observability, and security Mentor engineers and contribute to strong team practices Collaborate closely with Product, Design, and Data teams What we're looking for: Strong experience ...

AI Engineer - Up To £100,000 P/A - London - Hybrid

Hiring Organisation
Hunter Bond
Location
London Area, United Kingdom
real-time audio processing Optimize LLM performance through fine-tuning, prompt engineering, and evaluation frameworks Architect scalable AI infrastructure, including model serving, monitoring, and observability in production environments Apply Today ...

AI Engineer

Hiring Organisation
IO Associates
Location
Swindon, Wiltshire, South West, United Kingdom
Employment Type
Permanent
Salary
£750 - £950 per day + Remote
solutions leveraging Azure-native tooling and services Integrate AI systems into existing enterprise data platforms and business applications Optimise performance, reliability, governance, and observability of deployed AI systems Collaborate with architecture, data, and security teams to ensure production readiness If this role is something you'd like to explore further ...

Data Architect

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Leading the design of cloud data platforms (AWS) – including services like Redshift, S3, Glue, Athena Evaluating and introducing new technologies (e.g. Iceberg, Delta Lake, observability tools) Driving data governance, standards, and best practice Acting as a trusted advisor to both technical and business stakeholders What We’re Looking For Proven ...

AI Platform Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £130,000 per annum
environment Work closely with product and design to ensure AI features are user focused and production ready Help establish best practice around observability, governance and responsible AI development Your Skills and Experience You will be a product minded engineer who enjoys building platforms from the ground up. Strong commercial experience ...

Agentic Quality Platform Lead

Hiring Organisation
Oliver Bernard
Location
London Area, United Kingdom
scaling pipelines Knowledge of AI and how to implement AI tools and processes to reduce Manual workloads Prior hands-on experience across Monitoring and Observability to establish feedback loops and improve platform reliability (eg. Datadog, Prometheus etc) Previous Performance Testing experience, and knowledge on how to integrate this into engineering ...

Security Architect

Hiring Organisation
FBI &TMT
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£700 - £750 per day
Bedrock, Azure OpenAI, and Microsoft 365, to support teams in adopting these capabilities securely. Provide guidance on model security, deployment patterns, integration architectures, observability, and evaluation techniques across various business use cases. Translate platform-specific risks, limitations, and best practices into actionable engineering patterns and governance controls. Collaborate with ...

Software Engineering Manager - Cilium - Isovalent

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
passionate engineers. The Engineering Manager will play a pivotal role in driving the development of our innovative networking for the Cilium project, security and observability solutions. The ideal candidate will have a strong technical background in software engineering, excellent leadership skills, a proven track record of delivering high-quality software ...

Backend Software Engineer - Python

Hiring Organisation
Talent Locker
Location
London, United Kingdom
Employment Type
Permanent
system design Collaborate with data scientists and ML engineers to deploy models into production Mentor engineers and contribute to engineering best practices Improve observability, monitoring, and incident response processes Write maintainable, well-tested code and contribute to code reviews Requirements 5+ years of experience building and operating backend systems ...

VMware Cloud Foundation (VCF) Consultant

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
design/build/operate experience Strong NSX (T0/T1, DFW, VRFs, EVPN), vSAN (ESA/OSA) Automation with Terraform, Ansible, PowerCLI, APIs Observability with Aria Ops/Ops for Networks/Logs Migration experience with HCX Strong communication, documentation, and stakeholder engagement Preferred Skills Kubernetes on vSphere ...

Data Platform Manager

Hiring Organisation
Harnham
Location
London Area, United Kingdom
data engineering and architecture teams to improve platform design, performance, reliability, and scalability Drive best practice across platform management including access control, cost optimisation, observability, incident management, and operational governance Challenge and guide technical decisions to ensure solutions are robust, reusable, and aligned to long term platform strategy Support ...

Data Quality Lead

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
with business stakeholders to define critical data elements, data definitions, and “fit for use” requirements. Familiarity with data quality tooling and modern orchestration/observability practices. Comfortable building processes from scratch in a newly formed team. Resourceful, motivated self-starter with the ability to collaborate across business and technology Strong ...

Software Architect

Hiring Organisation
Spectrum IT Recruitment
Location
Uxbridge, England, United Kingdom
models and integration patterns Reviewing significant technical changes and guiding solution design Improving platform-wide reliability, security, maintainability and developer productivity Ensuring security, resilience, observability and scalability are built into solution designs Identifying architectural risks, technical debt and platform constraints Working closely with engineering and product leadership on major technical ...

Enterprise Tech Arch Sr Manager

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
security and cost outcomes: Embed zero‐trust IAM, security‐by‐design, HA/DR, and operational controls. Define SLOs/SLIs and bake in observability (metrics, logs, traces). Apply AIOps to reduce noise, accelerate incident triage, and improve reliability, and embed FinOps to manage performance and run‐cost value ...

Director, Solutions Engineering Splunk UKI

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
within the UKI region. Experience across multiple customer segments (Enterprise, Public Sector, Service Provider, Commercial). Strong domain expertise in enterprise software (e.g., Cybersecurity, Observability, Cloud & AI, IT Operations, Application Performance Management, or Big Data). Exceptional communication and articulation skills; ability to translate complex technical ideas into clear business ...

Platform Engineering Director — Hybrid Cloud & On-Prem

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
drive operational excellence across platforms and enterprise applications, including incident and problem management, service restoration, capacity planning, performance optimisation and continual improvement through automation, observability and site reliability engineering practices. You will act as Product Owner for key platforms and services, defining roadmaps, prioritising enhancements and ensuring delivery ...