226 to 250 of 294 Observability Jobs in London

Senior Software Engineer

Hiring Organisation
Oracle
Location
London, UK
Employment Type
Full-time
network security services and related attack/defense patterns. Solid networking knowledge: TCP/IP, IPv4/IPv6, BGP fundamentals; DNS/DHCP understanding. Observability experience (metrics, tracing, alerting) and operational excellence mindset. Preferred qualifications Experience with anycast routing, traffic steering, and multi-region service readiness. Exposure to SDN, programmable … post-incident improvements. Build automation-first workflows: CI/CD pipelines, test frameworks, canary/blue-green releases, and infrastructure-as-code. Create robust observability (metrics, logs, traces) and capacity/scale modeling for high-throughput, high-availability systems. Partner with product, SRE, and network engineering to deliver roadmap features ...

Senior Software Engineer

Hiring Organisation
Oracle
Location
South London, UK
Employment Type
Full-time
network security services and related attack/defense patterns. Solid networking knowledge: TCP/IP, IPv4/IPv6, BGP fundamentals; DNS/DHCP understanding. Observability experience (metrics, tracing, alerting) and operational excellence mindset. Preferred qualifications Experience with anycast routing, traffic steering, and multi-region service readiness. Exposure to SDN, programmable … post-incident improvements. Build automation-first workflows: CI/CD pipelines, test frameworks, canary/blue-green releases, and infrastructure-as-code. Create robust observability (metrics, logs, traces) and capacity/scale modeling for high-throughput, high-availability systems. Partner with product, SRE, and network engineering to deliver roadmap features ...

AI Prompt Engineer

Hiring Organisation
Staffworx Limited
Location
East London, London, United Kingdom
Employment Type
Contract
Contract Rate
market rates, outside IR35, remote first, UK but 1-2 days on site
AI Prompt Engineer Technically Sharp & Systems-Minded Youll design and optimize prompts, architect LLM-powered systems and deploy scalable GenAI workflows that connect people and intelligent systems in new, high-impact ways. What Youll Do ...

Kafka Data Architect(Streaming And Payment)

Hiring Organisation
IBU
Location
Greater London, England, United Kingdom
SLAs Access controls Retention and lineage Implement security best practices: Data classification KMS-based encryption Tokenization where required Least-privilege IAM Immutable audit logging Observability, Reliability & FinOps Build observability for streaming and data platforms using: CloudWatch, Prometheus, Grafana Track operational KPIs: Throughput (TPS) Processing lag Success/error rates Cost ...

Senior DevOps Engineer

Hiring Organisation
Xact Placements Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 per annum
doing Designing and evolving distributed, multi-region infrastructure Solving complex scaling, reliability and performance challenges Driving DevOps performance across deployment, availability and recovery Improving observability, fault tolerance and operational maturity Championing infrastructure-as-code, automation and secure-by-design practices Collaborating across engineering, product and security teams What they … looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Full Stack Software Engineer

Hiring Organisation
Firenze
Location
London Area, United Kingdom
scalable architecture . This is a hands-on role with plenty of opportunity to develop your technical depth , gain exposure to DevOps, cloud, and observability practices, and help define how we build software at Firenze. With no legacy systems , you’ll be shaping our platform from the ground … design resilient data flows. Quality & Testing: Contribute to our testing culture (JUnit, Playwright, CI pipelines) and learn how to scale reliable delivery. Performance & Observability: Gain hands-on experience with monitoring tools and learn how to build observable systems from day one. Documentation & Developer Experience: Help create clear technical docs ...

Full Stack Software Engineer

Hiring Organisation
Firenze
Location
City of London, London, United Kingdom
scalable architecture . This is a hands-on role with plenty of opportunity to develop your technical depth , gain exposure to DevOps, cloud, and observability practices, and help define how we build software at Firenze. With no legacy systems , you’ll be shaping our platform from the ground … design resilient data flows. Quality & Testing: Contribute to our testing culture (JUnit, Playwright, CI pipelines) and learn how to scale reliable delivery. Performance & Observability: Gain hands-on experience with monitoring tools and learn how to build observable systems from day one. Documentation & Developer Experience: Help create clear technical docs ...

Senior Platform Engineer

Hiring Organisation
Prism Digital
Location
City of London, London, United Kingdom
ambiguous platform work Build and enhance Azure landing zones and internal platform services Deliver infrastructure-as-code, CI/CD , self-service tooling and observability end-to-end Challenge design assumptions with a “show me the code” approach Pair with engineers to unblock delivery and lift team-wide engineering standards … governance, landing zones Azure PaaS: App Service, Functions, container platforms (ACA/AKS) CI/CD: GitHub Actions or Azure DevOps with full automation Observability: logging, metrics, dashboards and alerting Incident Response: diagnosing and resolving complex platform issues Why Join: Shape a secure, scalable Azure platform in a regulated financial ...

Senior Platform Engineer

Hiring Organisation
Prism Digital
Location
London Area, United Kingdom
ambiguous platform work Build and enhance Azure landing zones and internal platform services Deliver infrastructure-as-code, CI/CD , self-service tooling and observability end-to-end Challenge design assumptions with a “show me the code” approach Pair with engineers to unblock delivery and lift team-wide engineering standards … governance, landing zones Azure PaaS: App Service, Functions, container platforms (ACA/AKS) CI/CD: GitHub Actions or Azure DevOps with full automation Observability: logging, metrics, dashboards and alerting Incident Response: diagnosing and resolving complex platform issues Why Join: Shape a secure, scalable Azure platform in a regulated financial ...

Senior Manager, DDOS Engineering and Development

Hiring Organisation
Oracle
Location
London, UK
Employment Type
Full-time
rigorous post-incident improvements. Champion automation-first operations: CI/CD, test frameworks, canary/blue-green releases, and infrastructure-as-code. Build robust observability (metrics, logs, traces) and near-real-time telemetry/streaming pipelines for detection at scale. Security, compliance, and risk Govern threat modeling, architecture reviews … more: Java, Go, Python, C++, or Rust; strong preference for Java for control-plane/services. Demonstrated leadership in incident management, resilience engineering, observability, and operational maturity. Excellent stakeholder management and executive communication; data-driven prioritization and tradeoff decision-making. Preferred qualifications Experience with anycast routing, traffic steering, multi-region ...

Senior Manager, DDOS Engineering and Development

Hiring Organisation
Oracle
Location
South London, UK
Employment Type
Full-time
rigorous post-incident improvements. Champion automation-first operations: CI/CD, test frameworks, canary/blue-green releases, and infrastructure-as-code. Build robust observability (metrics, logs, traces) and near-real-time telemetry/streaming pipelines for detection at scale. Security, compliance, and risk Govern threat modeling, architecture reviews … more: Java, Go, Python, C++, or Rust; strong preference for Java for control-plane/services. Demonstrated leadership in incident management, resilience engineering, observability, and operational maturity. Excellent stakeholder management and executive communication; data-driven prioritization and tradeoff decision-making. Preferred qualifications Experience with anycast routing, traffic steering, multi-region ...

Azure DevOps Engineer

Hiring Organisation
McCabe & Barton
Location
Central London, London, United Kingdom
Employment Type
Permanent
build failures, manage YAML pipeline configurations, support deployment processes across Azure environments, manage service connections, and collaborate with development teams on release automation. Monitoring & Observability - Proficient in implementing and managing Azure Monitor, Log Analytics workspaces, Application Insights, and Azure dashboards. Experience creating alert rules, action groups, workbooks, and analysing metrics … Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network ...

Software Engineering Manager - Unified Client Experience (UCX)

Hiring Organisation
Hargreaves Lansdown
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
guiding teams through design implementation , collaborating with product and design using tools like Figma . Familiar with cloud-native environments (AWS, Docker, Kubernetes) and observability tools like Prometheus and Grafana . Champions quality and security , embedding testing and scanning into development pipelines. Passionate about mentoring engineers , conducting code reviews …/CSS Figma/Git Testing frameworks : Jest, Cypress, Appium CI/CD pipelines : GitHub Actions, CircleCI, Bitrise Cloud-native architecture : AWS, Docker, Kubernetes Observability tools Interview Process 3 Stage Interview Stage 1 - Discussion with our Hiring Manager (30mins): A chance to talk with our Hiring Manager in more detail ...

Principal Software Engineer - Unified Client Experience (UCX)

Hiring Organisation
Hargreaves Lansdown
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
practices. Comfortable designing and operating in both on-prem and cloud-native environments , with working knowledge of AWS , Docker , and Kubernetes . Advocates for observability and service health , using tools like Prometheus and Grafana to ensure reliability and performance. Champions quality and security , embedding testing and scanning into CI/… RDBMS (Oracle, Sybase)/NoSQL (Document DB) AWS/Docker/Kubernetes CI/CD pipelines : GitHub Actions, Harness Testing frameworks : Jest, Cypress, Appium Observability tools : Prometheus, Grafana Interview Process 3 Stage Interview Stage 1 - Discussion with our Hiring Manager (30mins): A chance to talk with our Hiring Manager ...

Platform Engineer

Hiring Organisation
Ncounter LTD
Location
East London, London, United Kingdom
Employment Type
Permanent
Principal Platform Engineer, Observability £150,000170,000 + bonus Join a high-performance trading platform and take ownership of their Observability and SRE evolution. Youll work at scale, shaping data ingestion and monitoring pipelines that handle huge real-time trading flows. What youll work with Python … DevOps/SRE setting Linux, Kubernetes, public cloud Prometheus, Grafana, telemetry and full Observability tooling GitLab, Bitbucket and modern CI/CD Bonus: Slurm, HPC What theyre looking for 8+ years engineering with Python or Go Strong systems engineering mindset Confident in design discussions, delivering clean and reliable code Keen ...

Principal Platform Engineer

Hiring Organisation
XACT PLACEMENTS LIMITED
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£90,000
doing Designing and evolving distributed, multi-region infrastructure Solving complex scaling, reliability and performance challenges Driving DevOps performance across deployment, availability and recovery Improving observability, fault tolerance and operational maturity Championing infrastructure-as-code, automation and secure-by-design practices Collaborating across engineering, product and security teams What theyre looking … Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Principal Platform Engineer

Hiring Organisation
Xact Placements Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 per annum
doing Designing and evolving distributed, multi-region infrastructure Solving complex scaling, reliability and performance challenges Driving DevOps performance across deployment, availability and recovery Improving observability, fault tolerance and operational maturity Championing infrastructure-as-code, automation and secure-by-design practices Collaborating across engineering, product and security teams What they … looking for Significant experience building and operating distributed systems at scale Strong cloud background (AWS/Azure), Terraform and Kubernetes Experience with observability tooling (Prometheus, Grafana, EFK) and messaging systems (Kafka) Solid understanding of networking fundamentals and global architecture Comfortable operating at Principal level and influencing technical direction ...

Technical Architect

Hiring Organisation
Inara
Location
London, UK
Employment Type
Full-time
security, and product teams to turn business requirements into robust architectural solutions and clear technical roadmaps. Drive standards around cloud governance, infrastructure-as-code, observability, cost optimisation, and high availability. Provide hands-on technical leadership — code reviews, technical oversight, solution design — while mentoring teams and supporting delivery. What … Excellent knowledge of Python, Terraform, and modern automation patterns Experience designing scalable, secure, cloud-native platforms A solid DevOps mindset: CI/CD, IaC, observability, reliability Ability to influence technical direction while remaining hands-on Strong communication skills and confidence working with engineering, product, and security stakeholders ...

Technical Architect

Hiring Organisation
Inara
Location
South London, UK
Employment Type
Full-time
security, and product teams to turn business requirements into robust architectural solutions and clear technical roadmaps. Drive standards around cloud governance, infrastructure-as-code, observability, cost optimisation, and high availability. Provide hands-on technical leadership — code reviews, technical oversight, solution design — while mentoring teams and supporting delivery. What … Excellent knowledge of Python, Terraform, and modern automation patterns Experience designing scalable, secure, cloud-native platforms A solid DevOps mindset: CI/CD, IaC, observability, reliability Ability to influence technical direction while remaining hands-on Strong communication skills and confidence working with engineering, product, and security stakeholders ...

AI Platform Engineer

Hiring Organisation
The Portfolio Group
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Databricks, including ingestion and embedding pipelines. Scale and operate vector search infrastructure (Weaviate, OpenSearch, Algolia, AWS Bedrock Knowledge Bases). Implement strong observability, CI/CD, security, and governance across AI workloads. Enable future architectures such as multi-model orchestration and agentic workflows. Required Skills & Experience Strong experience designing … Kubernetes using Terraform . Solid understanding of distributed systems, cloud architecture, and API design , with a focus on scalability and reliability. Demonstrable ownership of observability, performance, cost efficiency, and operational robustness in production environments. Why Join? You'll own the foundational AI platform behind a growing suite of generative ...

AI Platform Engineer

Hiring Organisation
The Portfolio Group
Location
London, Castle Baynard, United Kingdom
Employment Type
Permanent
Databricks, including ingestion and embedding pipelines. Scale and operate vector search infrastructure (Weaviate, OpenSearch, Algolia, AWS Bedrock Knowledge Bases). Implement strong observability, CI/CD, security, and governance across AI workloads. Enable future architectures such as multi-model orchestration and agentic workflows. Required Skills & Experience Strong experience designing … Kubernetes using Terraform . Solid understanding of distributed systems, cloud architecture, and API design , with a focus on scalability and reliability. Demonstrable ownership of observability, performance, cost efficiency, and operational robustness in production environments. Why Join? You'll own the foundational AI platform behind a growing suite of generative ...

.NET Developer

Hiring Organisation
Avanti
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
native services) • PostgreSQL • Next.js (separate frontend team handles this) • Playwright for E2E testing • Databricks + ML tooling • GitHub Copilot/AI-assisted development • Modern observability stack (Application Insights, telemetry, logging) You will also work on large-scale integrations with global systems and external APIs. What We’re Looking For • Product … minded, ownership-driven approach • Strong backend engineering experience with .NET • Cloud experience (Azure, AWS or GCP) • Understanding of observability and reliable system design • Exposure to at least one additional programming language (Go, Python, Rust, Node, Java etc), commercial or side-project experience both count • Ability to work autonomously and solve ...

Full Stack Engineer (Python/Js/MS Stack) Agentic AI Project

Hiring Organisation
GCS
Location
City, London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
deliver end-to-end features-from robust Back End APIs to polished Front End experiences-on Azure, with a strong emphasis on reliability, observability, and secure data handling. What you'll do Design & build scalable services in Python (FastAPI/Django/Flask) and front-ends in JavaScript/TypeScript … agents (eg, task planning, tool calling) and LLM apps (chat, summarisation, RAG, structured outputs). Engineering excellence: CI/CD, automated testing, code reviews, observability, and secure development practices. Nice to have Experience with Azure OpenAI , OSS LLMs, model evaluation (BLEU/ROUGE/BERTScore, human-in-the-loop ...

Azure DevOps Engineer

Hiring Organisation
Prism Digital
Location
City of London, London, United Kingdom
ambiguous platform work Build and enhance Azure landing zones and internal platform services Deliver infrastructure-as-code, CI/CD , self-service tooling and observability end-to-end Challenge design assumptions with a “show me the code” approach Pair with engineers to unblock delivery and lift team-wide engineering standards … governance, landing zones Azure PaaS: App Service, Functions, container platforms (ACA/AKS) CI/CD: GitHub Actions or Azure DevOps with full automation Observability: logging, metrics, dashboards and alerting Incident Response: diagnosing and resolving complex platform issues Why Join: Shape a secure, scalable Azure platform in a regulated financial ...

Azure DevOps Engineer

Hiring Organisation
Prism Digital
Location
London Area, United Kingdom
ambiguous platform work Build and enhance Azure landing zones and internal platform services Deliver infrastructure-as-code, CI/CD , self-service tooling and observability end-to-end Challenge design assumptions with a “show me the code” approach Pair with engineers to unblock delivery and lift team-wide engineering standards … governance, landing zones Azure PaaS: App Service, Functions, container platforms (ACA/AKS) CI/CD: GitHub Actions or Azure DevOps with full automation Observability: logging, metrics, dashboards and alerting Incident Response: diagnosing and resolving complex platform issues Why Join: Shape a secure, scalable Azure platform in a regulated financial ...