1 to 25 of 41 Permanent Observability Jobs in Central London

Solace Administrator

Hiring Organisation
BGC Group
Location
City of London, London, United Kingdom
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). ...

DevOps Engineer AWS EKS Linux

Hiring Organisation
Client Server
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
well as developing Infrastructure-as-Code using Terraform, supporting scalable and maintainable infrastructure deployments. You will manage and evolve the Kubernetes based platform, enhance observability across the platform and improve CI/CD pipelines for speed, reliability and quality of releases. Collaborating with engineering and product teams you will help ...

Senior Backend Software Engineer

Hiring Organisation
algo1
Location
City of London, London, United Kingdom
storage layers Integrate AI-driven personalisation and real-time insights into user flows Contribute to overall system design: service boundaries, data ownership, scaling, and observability Essential Qualifications: Built modern full-stack applications w/focus on backend (e.g. Python, Java, Go) Worked with a variety of storage technologies (e.g. Postgres ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
City of London, London, United Kingdom
Proficient in one or more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Machine Learning Engineer

Hiring Organisation
algo1
Location
City of London, London, United Kingdom
serving latency or pipeline robustness. Month 3: Own and deliver a major infrastructure component (e.g., feature store, training orchestration, or model registry); improve system observability with logging, metrics, and alerting. Month 6: Lead the end-to-end productionisation of our foundation model, meeting latency, throughput, and reliability SLAs; mentor teammates ...

Staff Software Engineer (Python | AI £180k)

Hiring Organisation
Paradigm Talent
Location
City of London, London, United Kingdom
orchestration (Docker, Kubernetes). Message queues and streaming systems (e.g. Kafka, SQS, RabbitMQ). Relational and/or NoSQL databases. CI/CD pipelines, observability, monitoring, and logging as first-class concerns. Exposure to computer vision, 3D data, or similar domains is a strong plus, but not required. ...

Senior Java Developer (Low-Latency Payments Systems)

Hiring Organisation
RE Partners
Location
City of London, London, United Kingdom
ensure production-grade solutions. Participate in architectural discussions, advocate for best practices, and provide mentorship to peers. Drive performance tuning, fault tolerance, and observability improvements across services. Required Skills & Experience: 5+ years of experience in Core Java development with a focus on performance and memory optimization. Proficient in SpringBoot ...

Principal Engineer

Hiring Organisation
Motive Group
Location
City of London, London, United Kingdom
experience with Kubernetes and container orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working ...

Senior Java Software Engineer

Hiring Organisation
Paritas Recruitment
Location
City of London, London, United Kingdom
. Hands-on experience with the full software lifecycle in agile environments and CI/CD pipelines. Strong understanding of performance, scalability, security, and observability in distributed systems. Ability to translate complex requirements into resilient, production-ready software. Excellent collaboration and communication skills. Why This Role This is your chance ...

Senior Data Engineer, Azure

Hiring Organisation
ARC IT Recruitment Ltd
Location
City of London, London, United Kingdom
Employment Type
Permanent
Server and Power BI Solid understanding of data warehousing, lakehouse and datalake architectures Familiarity with modern data engineering patterns (ETL/ELT, medallion architecture, observability) Experience working with Apache Spark and large-scale data processing For a full consultation on this exciting new role, please get in touch with ...

Head of Product Operations and Support

Hiring Organisation
Gray Global Placements LTD
Location
Central London, London, United Kingdom
Employment Type
Permanent
Requirements: - 15+ years of experience in leading support/operations roles in enterprise SaaS or technology environments. - Familiarity with cloud-based environments (AWS) and observability platforms. - Background in managing support across hybrid or multi-tenant platforms. - Proven experience in building and scaling global support teams and operational processes. - Expertise ...

Cloud Engineer

Hiring Organisation
Quantum Technology Solutions Inc
Location
City of London, London, United Kingdom
across Azure: · Identity and access models · RBAC and least-privilege enforcement · Secrets management and key rotation · Encryption at rest and in transit ·Implement strong observability and auditability, including logging, monitoring, alerting, and security events. ·Design systems assuming attack, failure, and misuse as default scenarios. ·Lead cloud-level incident response ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £75,000 per annum
shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What They … Looking For Experience in AWS cloud infrastructure (ideally in a regulated or high-traffic environment) Previous experience working with Monitoring and Observability Tools Hands-on Kubernetes know-how, specifically with EKS. Solid IaC experience with Terraform. Experience with containerisation (Docker, Helm) and CI/CD (GitHub Actions or similar) Solid ...

Backend Engineer (TypeScript/Ruby)

Hiring Organisation
Oliver Bernard
Location
City of London, London, United Kingdom
Kubernetes Why it’s exciting Work at scale in a high-traffic, consumer-facing environment Strong engineering culture: automated testing, reliable CI/CD, observability, distributed systems True end-to-end ownership and impact 📩 Interested? DM your CV and notice period today! 🔥 Software Engineer – Build a Platform Used by Millions ...

Cloud Platform Network Engineer

Hiring Organisation
Skillsbay Limited
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£80,000
across cloud and on-premise environments Troubleshoot and resolve connectivity issues quickly and effectively Automate network configuration using Terraform, PowerShell and Azure CLI Maintain observability using Azure Monitor, Log Analytics and Network Watcher Ensure deployments align with security and compliance standards Produce technical documentation and support knowledge sharing Required Experience ...

Forward Deployed Engineer (Graduate/Early Career)

Hiring Organisation
Plexe AI
Location
City of London, London, United Kingdom
engagements and related product features. • Product Development: leverage your forward deployment experience to improve the Plexe platform, including distributed ML infra, multi-agent systems, observability, client-facing APIs, and more. • Cross-Functional Collaboration: work closely with founders to translate business vision and customer feedback into actionable engineering plans. • Ownership: take ...

Palantir Consultant

Hiring Organisation
Staffworx Limited
Location
Central London, London, United Kingdom
Employment Type
Permanent
Scalability, Reliability & Operations Help investigate performance issues (eg parallelisation, partitioning, caching, compute configuration) with mentorship from more senior colleagues. Contribute to monitoring, alerting and observability setup for pipelines, applications and integrations. Participate in incident response and root cause analysis for platform and application issues. Assist in applying non-functional requirements ...

Senior Software Engineer - Quant Firm

Hiring Organisation
Dex
Location
City of London, London, United Kingdom
close to the metal" to optimize performance across networking, I/O, and compute layers, squeezing maximum efficiency out of hardware. Build World-Class Observability: Create robust monitoring and telemetry systems to provide real-time insights into pipeline health, trading activity, and model behavior. Work with the Best: Work side ...

Platform / DevSecOps Engineer - Remote - GCP

Hiring Organisation
Opus Recruitment Solutions
Location
Central London / West End, London, United Kingdom
running Kubernetes at scale, with strong experience in GCP, Terraform, and CI/CD pipelines. You’ll play a key role in improving security, observability, and reliability , so experience in these areas is a must-have. In return, our client is offering remote working , 4 weeks holiday, well-being expenses ...

Python Software Engineer

Hiring Organisation
Durlston Partners
Location
City of London, London, United Kingdom
time market data, trade execution, and reconciliation. Optimise performance and scalability across key trading infrastructure components. Partner with cross-functional teams to improve tooling, observability, and automation. Deliver robust, production-ready solutions in a fast-paced environment. Requirements Min 2+ years of Python experience in a professional setting (HFT, trading ...

Staff Software Engineer

Hiring Organisation
Annapurna
Location
City of London, London, United Kingdom
organisation Produce clear written documentation to align engineering, product and operations Tech Environment Full-stack TypeScript, React, Postgres Modern cloud infrastructure (AWS, Terraform) Strong observability and monitoring practices Data platforms and analytics tooling AI-powered systems working with complex, unstructured real-world data (Exact tooling is less important than strong ...

Software Engineer

Hiring Organisation
Capi Money
Location
City of London, London, United Kingdom
business owners in Africa Implement IT security and data protection best practices in a regulated environment Proactively drive architectural decisions e.g., improving scalability, observability, and modularity of the codebase Champion code quality through robust testing, documentation, and reviews Operational Observe opportunities for improvements internally to help automate our non-tech ...

Backend Engineer (Distributed Systems Engineer)

Hiring Organisation
TechChain Talent
Location
City of London, London, United Kingdom
coordination and execution • Fault-tolerant systems that scale across thousands of nodes • Performance-critical modules for consensus, networking, and data pipelines • Internal tooling for observability, testing, and simulation • Contributions to open-source infra and protocol-level primitives What You Bring • 5+ years of backend engineering experience in systems-level environments ...

Network Specialist

Hiring Organisation
Ncounter LTD
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
essential, alongside confidence working with modern data centre technologies. Nice to Haves: Experience with automation using Python, Ansible, or similar tools Exposure to observability and monitoring platforms Understanding of network security and secure routing design Hands-on experience with Arista and or Cisco in production environments Industry certifications such ...

Services AI Data Solution Principal (Services Technical PreSales), based London

Hiring Organisation
Dell
Location
City, London, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
product and partner ecosystem e.g. NVAIE, Run.ai, H2O.ai, ClearML, OpenShift, etc. Provide expert guidance on modern data stack components: data quality, metadata management, observability, data products, feature stores, with governance and Dell's maturity model frameworks. Stay current on emerging AI and associated Data Management technologies. Actively contribute field feedback ...