176 to 200 of 243 Observability Jobs in London

Site Reliability Engineer

Hiring Organisation
VIQU IT
Location
United Kingdom, Whitechapel, Greater London
Employment Type
Permanent
Salary
£40000 - £50000/annum
Engineer to help improve the reliability, scalability and automation of their AWS estate. This is a hands-on engineering role working across cloud infrastructure, observability, CI/CD and platform tooling, helping development teams deliver faster and more reliably. You’ll be joining a collaborative engineering environment with the opportunity … scalable AWS infrastructure. Develop and manage Infrastructure as Code using AWS CDK. Support CI/CD pipelines and deployment automation. Improve monitoring, logging and observability across distributed systems. Support incident management, root cause analysis and platform reliability improvements. Work closely with engineering and architecture teams to improve operational performance ...
Hybrid / Remote Options View Job ❯

Dynatrace Expert

Hiring Organisation
BGTS LTD
Location
London, United Kingdom
Employment Type
Permanent
Salary
£65000 - £80000/annum
Microservices Integration The candidate will be responsible for integrating Dynatrace monitoring within our AWS cloud infrastructure and microservices ecosystem. This includes ensuring seamless observability across containerized environments (e.g., Kubernetes, Docker) and serverless architectures. The expert will collaborate closely with development and DevOps teams to embed monitoring best practices into … system administrators, and project managers. The ability to document monitoring strategies, root cause analyses, and best practices clearly is crucial for maintaining a robust observability culture within the organization. Preferred Qualifications Dynatrace Associate or Professional Certification. Experience with OpenTelemetry (OTEL) implementation. Familiarity with other monitoring and logging tools (e.g., Splunk ...

Engineering Manager

Hiring Organisation
Visa
Location
London, UK
Employment Type
Full-time
Job Description About UsVisa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories, dedicated to uplifting everyone, everywhere by being ...

Networking Specialist

Hiring Organisation
Ncounter
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£160,000 - £175,000 per annum
essential, alongside confidence working with modern data centre technologies. Nice to Haves: • Experience with automation using Python, Ansible, or similar tools • Exposure to observability and monitoring platforms • Understanding of network security and secure routing design • Hands-on experience with Arista and or Cisco in production environments • Industry certifications such ...

Bid Solution Architect - LONDON - PART TIME

Hiring Organisation
Reed
Location
Southwark, London, England, United Kingdom
Employment Type
Temporary
Salary
Salary negotiable
ready. Assure the full scheduling system architecture, focusing on performance and resilience. Validate integration assumptions, API patterns, data flows, and control mechanisms. Ensure system observability, failover, and peak-load behavior are credible and evidenced. Design or validate security controls across application, infrastructure, and operations. Ensure alignment of IAM, encryption, logging ...

Senior Data Platform Engineer

Hiring Organisation
ITSS Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£70000 - £100000/annum Bonus + Fantastic benefits
consistency and reusability across environments. * Build and optimise CI/CD pipelines using Azure DevOps and GitHub Actions to support rapid, reliable deployments. * Implement observability practices including logging, metrics, and alerting using observability tools. * Collaborate with the Lead Engineer and Architects to align implementation with platform standards and patterns. * Provide … Fabric. * Proven experience with infrastructure-as-code using Terraform and building CI/CD pipelines via Azure DevOps and GitHub Actions. * Strong grasp of observability practices, including logging, metrics, alerting, and performance optimisation. * Deep understanding of cloud security, with experience applying secure-by-design principles in Azure and/ ...

Senior DevOps Engineer

Hiring Organisation
INTEC SELECT LIMITED
Location
London, UK
Employment Type
Full-time
Azure and Terraform expertise, who is comfortable operating in a hands-on capacity, while also mentoring others and driving improvements across CI/CD, observability, and security.Role & Responsibilities Design, build, and manage Azure infrastructure using Terraform, including modules, state management, and pipelines Develop and maintain CI/CD workflows (GitHub … Actions, Azure DevOps or similar) Improve platform reliability, observability, and security across environments Take ownership of infrastructure and deployment processes within a fast-moving delivery team Collaborate closely with engineers to embed DevOps best practices and scalable patterns Mentor team members on infrastructure, automation, and platform engineering principles Identify ...

Senior DevOps Engineer (Azure / Terraform)

Hiring Organisation
INTEC SELECT LIMITED
Location
City of London, London, England, United Kingdom
Employment Type
Contractor
Contract Rate
£600 - £650 per day
Azure and Terraform expertise, who is comfortable operating in a hands-on capacity, while also mentoring others and driving improvements across CI/CD, observability, and security. Role & Responsibilities Design, build, and manage Azure infrastructure using Terraform, including modules, state management, and pipelines Develop and maintain CI/CD workflows … GitHub Actions, Azure DevOps or similar) Improve platform reliability, observability, and security across environments Take ownership of infrastructure and deployment processes within a fast-moving delivery team Collaborate closely with engineers to embed DevOps best practices and scalable patterns Mentor team members on infrastructure, automation, and platform engineering principles Identify ...

Senior DevOps Engineer

Hiring Organisation
INTEC SELECT LIMITED
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£550 - £650 per day
Azure and Terraform expertise, who is comfortable operating in a hands-on capacity, while also mentoring others and driving improvements across CI/CD, observability, and security. Role & Responsibilities Design, build, and manage Azure infrastructure using Terraform, including modules, state management, and pipelines Develop and maintain CI/CD workflows … GitHub Actions, Azure DevOps or similar) Improve platform reliability, observability, and security across environments Take ownership of infrastructure and deployment processes within a fast-moving delivery team Collaborate closely with engineers to embed DevOps best practices and scalable patterns Mentor team members on infrastructure, automation, and platform engineering principles Identify ...

Site Reliability Engineer

Hiring Organisation
Arrows
Location
London Area, United Kingdom
/CircleCI) 🔄 Operate and optimise Kubernetes environments (EKS primarily, GKE exposure a bonus) ☸️ Build and manage Infrastructure as Code using Terraform 🏗️ Champion reliability engineering: observability 👀, incident response 🚨, performance & cost optimisation 💡, and security best practices 🔐 Drive automation across environments and collaborate with cross-functional teams 🤝 ✅ What You’ll Bring Strong hands … pipelines end-to-end 🚀 A senior, self-sufficient communicator who can mentor and work across multiple teams 💬 ⭐ Nice to Have Experience with service mesh & observability tools (Istio, Prometheus, Grafana, Datadog) 📊 Policy as code exposure 📜 Scripting skills (Bash/Python/Go) 💻 Experience with GKE or multi-cloud environments 🌍 👉 Interested ...

DevOps Manager

Hiring Organisation
Harvey Nash
Location
London Area, United Kingdom
optimisation Drive CI/CD strategy using GitHub and modern DevOps tooling Champion Infrastructure as Code using Terraform/ARM Implement and maintain observability and monitoring solutions Partner closely with security teams to meet regulatory and cyber‐security standards Manage third‐party vendors and ensure service delivery standards Mentor engineers … background in DevOps and Azure cloud operations Proven experience leading engineering teams CI/CD, Git, GitHub pipelines Infrastructure as Code (Terraform, ARM, Ansible) Observability tools such as Prometheus and Grafana Containers and orchestration (Docker, Kubernetes) Scripting (PowerShell) Experience in regulated environments such as banking, trading, financial services or similar ...

Go Full Stack Developer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent
Salary
£54000 - £61000/annum
event-driven services Contribute to CI/CD pipelines and cloud-native deployments Review code and champion engineering best practices Improve application performance, observability and reliability Collaborate within Agile delivery teams across multiple projects Support technical decision-making and continuous improvement Skills & Experience We are looking for candidates with strong … reviews, testing and engineering governance Experience with any of the following would be highly advantageous: Microsoft Azure Python GitOps tooling (Argo CD/Flux) Observability tooling (Prometheus, Grafana, OpenTelemetry) AI/LLM-enabled applications Event-driven architectures and messaging platforms What's on Offer Opportunity to work on cutting-edge ...

Staff Software Engineer

Hiring Organisation
Visa
Location
London, UK
Employment Type
Full-time
standards Apply distributed systems principles includingidempotency and safe retries,failure isolation and graceful degradation,schemaand API versioning Build systems with clear SLAs, SLOs, and observability Maintain a strong security posture across services and data access Data-Intensive & Reporting Systems: Work closely with data engineering teams oncanonical data models,regime-specific … more ofJava, Python (or similar) Strong systemdesign skills acrossAPI-driven architectures,Data-intensive services,Batchand event-driven workflows Deep understanding of reliability, observability, and operational excellence Data & Analytics Awareness: Strong understanding of datamodellingconceptssuch ascanonicalmodelsand dimensional models Experience working alongside modern data platforms (e.g. Snowflake,BigQuery, Redshift) Ability to reason aboutData ...

Contract Observability Engineer

Hiring Organisation
Xpertise
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 650 Daily
Real Time Observability Engineer | Contract | £650 INSIDE IR35 | Hybrid/Global Platform | High Performance Systems We are working with a global Real Time data platform operating across multiple markets and high-throughput trading environments. Key responsibilities Build and maintain application-layer observability components that aggregate and correlate telemetry across distributed … Extend low-level instrumentation approaches where required, including Kernel-level or high-efficiency data capture techniques What you bring Strong software engineering background building observability, telemetry, or distributed data systems Experience with Real Time or streaming data environments Proficiency in systems-level programming (C++, Go, or Rust) alongside a higher ...

Service Architect- 6 month contract

Hiring Organisation
Opus Recruitment Solutions Ltd
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£400 - £420 per day
Design for a platform-based product delivery model that underpins hundreds of products Use of Dynamic Cis for product discovery and association Integration with observability tools such as Datadog • Familiarity with AWS or Confluent Cloud an advantage Details: 6 months (likely extension) Outside IR35 Fully remote ...

Back End Developer

Hiring Organisation
NearTech Search
Location
City of London, London, United Kingdom
backend initiatives end-to-end, from architecture to rollout • Strengthen testing strategy across unit and integration layers • Improve data and integration workflows with observability and resilience • Optimise Postgres (RDS) and MongoDB performance, modelling and migrations The role requires... • Strong commercial experience with Node.js and TypeScript • Deep API design expertise, including ...

Principal Product Manager

Hiring Organisation
ZEREN
Location
City of London, London, United Kingdom
serious scale. What you'll be working on: • 0-1 build of the strategy and roadmap for a GenAI platform spanning infrastructure, tooling, and observability • Designing platform capabilities that make experimentation and deployment of features frictionless - measured through DORA and Core4 metrics • Partnering with security, compliance, and data governance teams ...

Senior Frontend Developer

Hiring Organisation
SEEKR
Location
City of London, London, United Kingdom
bridges so builders can wire their products into hundreds of third‐party tools without hand‐rolling every integration. It handles managed auth, real‐time observability and connector sprawl so product teams can focus on great agent experiences instead of glue code. Your job is to make the surface they ...

Senior Software Engineer

Hiring Organisation
Harrington Starr
Location
London Area, United Kingdom
business-critical trading platform. The role combines software engineering with reliability engineering. You’ll be involved in designing and building internal tooling, improving observability, automating operations, supporting development teams, and helping ensure trading systems remain stable, scalable, and high performing. It would suit someone who enjoys solving technical problems … speed, resilience, and continuous improvement matter. What you will do Build tools, automation, and internal services that improve platform reliability Implement monitoring, telemetry, and observability standards across distributed systems Analyse performance across application, OS, and network layers to identify bottlenecks Help define and improve SLOs/SLAs for critical services ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
West London, UK
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation
Profile 29
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … using Terraform Build and operate scalable infrastructure in Amazon Web Services (AWS) Design, implement, and maintain robust CI/CD pipelines Improve system reliability, observability, performance, and security Implement monitoring, logging, and alerting solutions Troubleshoot production incidents and perform root cause analysis Collaborate with development teams to improve application resilience ...

Engineering Manager (.NET) - Contract

Hiring Organisation
La Fosse
Location
City of London, London, United Kingdom
resource/capacity management and delivery ownership. - Experience writing executive updates and technical summaries for senior stakeholders. - Strong knowledge of CI/CD, automation, observability, and DevOps maturity models. - Evidence of driving adoption of new tools, frameworks, or processes across multiple teams. Technical Skills & Tools - Languages & Frameworks: C#/.NET … Framework and Core), React - Platforms & Infrastructure: Azure, AKS, Docker, on-prem Windows Server, SQL Server. - IAM and App Gateways: Okta, APIM, Apigee - Monitoring & Observability: Dynatrace, Application Insights - CI/CD & DevOps: Azure DevOps pipelines, SonarCloud, Github - Architecture & Patterns: Microservices, event-driven architecture, domain-driven design, modern scalable design principles ...

Junior Site Reliability Engineer

Hiring Organisation
RevTech
Location
London Area, United Kingdom
role in platform automation and CI/CD improvement, working closely with engineering teams to support production systems, streamline deployments, and enhance observability across the estate. This role offers genuine ownership, strong technical exposure, and a clear progression path into a mid-level SRE position within 18 months. Required Skills … write clean, maintainable production-quality code in Python or Bash Solid networking knowledge including DNS, load balancing, and CDN concepts Experience with monitoring/observability platforms such as New Relic, Datadog, Prometheus, or Grafana Comfortable working in incident management and on-call environments Experience using AI-assisted development tools such ...

Platform Engineer

Hiring Organisation
Albert Bow
Location
City of London, London, United Kingdom
preparation, turning compliance into a competitive advantage Build and maintain robust CI/CD pipelines across backend, frontend, and data services Establish company-wide observability — logging, metrics, tracing, alerting, and on-call culture Take ownership of cloud cost management, optimising spend without compromising performance Champion operational excellence across the engineering … What You'll Bring Technical Cloud & IaC: Azure (AWS a bonus), Terraform, AKS/Kubernetes, Docker, GitHub Actions Observability: Hands-on experience with logging, metrics, and distributed tracing frameworks Security: Secrets management, security scanning, and infrastructure hardening best practices Networking: VPCs, DNS, load balancers, VPNs, firewalls — you know your ...

Staff SW Engineer-1

Hiring Organisation
Visa
Location
London, UK
Employment Type
Full-time
practices Apply distributed systems principles includingidempotency and safe retries,failure isolation and graceful degradation,schemaand API versioning Build systems with clear SLAs, SLOs, and observability Maintain a strong security posture across services and data access Data-Intensive & Reporting Systems: Work closely with data engineering teams oncanonical data models,regime-specific … more ofJava, Python (or similar)Strong systemdesign skills acrossAPI-driven architectures,Data-intensive services,Batchand event-driven workflowsDeep understanding of reliability, observability, and operational excellence Data & Analytics Awareness:Strong understanding of datamodellingconceptssuch ascanonicalmodelsand dimensional modelsExperience working alongside modern data platforms (e.g. Snowflake,BigQuery, Redshift)Ability to reason aboutData lineage ...