Observability Jobs

Employment Type

Remote Jobs

Hybrid/WFH 1,005

Sort By

Relevance
Date

Locations

Job Titles

DV Cleared Platform Engineer

swindon, wiltshire, south west england, united kingdom

Global Technology Solutions Ltd

the provisioning and management of systems using Infrastructure as Code (IaC) Support containerisation and orchestration technologies such as Docker and Kubernetes Monitor platform performance, availability, and security using modern observability tools Collaborate with DevOps, security, and application teams to ensure seamless and secure delivery pipelines Implement and maintain CI/CD pipelines and deployment automation Manage secure configurations, patching, and More ❯

Posted: 2 days ago

Principle DevOps Engineer

London, England, United Kingdom

Devoteam

tools, including experience with some of the following tools: GitLab CI, GitHub Actions, Concourse CI, Jenkins X, TeamCity, Artifactory, etc.; Infrastructure provisioning (at least one of Terraform, Ansible, CloudFormation); Observability and Application monitoring (ELK stack, TICK stack, Grafana, Prometheus, New Relic, Datadog, etc.); Networking concepts - Bastion hosts, Reverse Proxies, Load Balancing, TLS, etc. Key Soft Skills required: Naturally resilient, tenacious More ❯

Posted: Today

Senior Infrastructure Engineer - Full Stack - Cloud - Financial Services

London, England, United Kingdom

ZipRecruiter

and Firewalls. Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one Scripting (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery More ❯

Posted: Yesterday

Senior DevOps Engineer (SC Cleared)

City of London, London, United Kingdom
Hybrid / WFH Options

Amber Labs

working in Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset More ❯

Posted: 2 days ago

Senior DevOps Engineer (SC Cleared)

London Area, United Kingdom
Hybrid / WFH Options

Amber Labs

Posted: 2 days ago

Platform Engineer

London, England, United Kingdom

Capgemini

ARM, or Pulumi. Experience in building secure applications and infrastructure. Strong communication skills, with the ability to convey and understand complex technical concepts clearly and concisely. SRE skills including observability and telemetry monitoring. Familiarity with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerization using Docker, Kubernetes, OpenShift, and Helm. Programming skills in languages such as Python More ❯

Posted: 4 days ago

Senior Infrastructure Engineer I

London, England, United Kingdom

American Express

and firewalls. Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster More ❯

Posted: Today

Vice President, DevOps Engineer (NE)

London, England, United Kingdom
Hybrid / WFH Options

BlackRock, Inc

access to the best tools available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will: Use your site reliability expertise to design, operate and support Preqin's infrastructure, middleware and internal services. Improving their performance More ❯

Posted: Today

Senior Machine Learning Ops Engineer

London, England, United Kingdom

DailyPay

and high availability CI/CD Pipeline Development: Develop and maintain robust CI/CD pipelines for continuous integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with More ❯

Posted: Yesterday

Junior Delivery Engineer

United Kingdom
Hybrid / WFH Options

Sportserve

operating system for effective troubleshooting activities Awareness of any cloud infrastructure principles (like AWS, GCP or OCI), understanding basic principles of secure software delivery is a plus Familiar with Observability tools like Grafana or Prometheus, understanding the importance of giving the correct visibility to our platforms and environments We highly value ownership and initiative with capabilities to drive projects independently More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 2 days ago

Cloud Technical Architect / Data DevOps Engineer

Bristol, Gloucestershire, United Kingdom

Hewlett Packard Enterprise Development LP

etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 2 days ago

Manual Tester (DV Security Clearance)

Basingstoke, England, United Kingdom

Onyx-Conseil

Manual Tester (DV Security Clearance) Position Description Are you an experienced Test Analyst with a background in secure or classified programmes, ready to contribute to projects of national importance? Step into a role where you'll challenge the complex to More ❯

Posted: 2 days ago

Senior Platform Delivery Consultant IRC250319

London, England, United Kingdom

GlobalLogic

Consultant IRC250319 Job: IRC250319 Location: United Kingdom - London Designation: Senior Consultant Experience: 5-10 years Function: Engineering Skills: Cloud(Azure/AWS/GCP), Containers, DevOps Practices, Grafana, Kubernetes, Observability stack, SRE Management, Terraform Work Model: Hybrid We are seeking an experienced Platform Engineering leader with a hands-on engineering background, who can articulate the business benefits that Observability and … on the responsibility of handling client engagements from both technical and business perspectives. Requirements: We are ideally looking for someone with a strong background and experience in the following: Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong More ❯

Posted: Today

Senior Site Reliability Engineer - AWS Kubernetes

London, England, United Kingdom

SGI

and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster … performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability More ❯

Posted: 3 days ago

Lead Site Reliability Engineer

Belgium

Tenth Revolution Group

engineering teams to deploy faster and more confidently-without compromising stability or uptime. As the SRE Lead, you'll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers. What You'll Be Doing: Leadership & Strategy -Lead and grow … Proven experience in an SRE or DevOps leadership role. -Deep understanding of networking, containers (Docker, Kubernetes), and cloud infrastructure (AWS/GCP/Azure). -Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.). -Proficiency with infrastructure-as-code tools like Terraform or Pulumi. -Experience with CI/CD pipelines and GitOps practices. -Excellent communication and More ❯

Employment Type: Permanent

Salary: EUR Annual

Posted: 4 days ago

Senior Site Reliability Engineer - AWS Kubernetes

London, England, United Kingdom

Source Technology

Posted: Today

Site Reliability Engineer - Canada Life Limited

London, England, United Kingdom
Hybrid / WFH Options

ZipRecruiter

premises infrastructure to the cloud and understanding the challenges involved Familiarity with cloud security best practices, and access management (IAM), and encryption techniques Microsoft Azure certifications are a plus Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing … applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of security and scalability across all cloud environments. Enforce security best practices in AKS, including network policies, RBAC (Role-Based Access Control), and integration with Azure Active Directory Core Services Azure core services such as Azure Storage, including Blob, Azure VMs, Azure More ❯

Posted: 4 days ago

Senior Platform Developer

Edinburgh, United Kingdom
Hybrid / WFH Options

Registers of Scotland

WAF, CloudFront, API GW, AWS Organizations, S3, ECS, EKS, Route 53, ELBs, OpenShift, Kubernetes, Docker Languages: TypeScript, Python Security & Scanning: AWS Guardrails, Checkov, Prisma Cloud, OSV Scanner, SonarQube, Renovate Observability & Logging: CloudWatch, OpenSearch Operating System Management: RedHat Satellite, AMI lifecycle management, Ubuntu Landscape Testing Tools: Pytest, Jest, Cypress APIs/Microservices: RESTful APIs, API Gateway, containerised services Version Control: GitLab … to as Senior DevOps Engineer. On a typical day you will Design, build, and maintain scalable, high-quality software and platform systems Implement and manage CI/CD pipelines, observability, security automation, automated testing, and engineering standards Lead feature development from concept to production with focus on quality and performance Troubleshoot issues, ensuring resilience, reliability, and minimal user disruption Contribute More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 2 days ago

Staff SRE

London, United Kingdom

Index Exchange

emergency events outside of your local time-zone. Here's what you need: Technical Expertise In-depth understanding of the Linux operating environment: kernel tuning, network stack tuning, system observability & instrumentation, and security & access management. Solid understanding of layer 2-7 networking fundamentals and the relationship between servers & services, and the transit of their packets through network hardware. In-depth … experience engineering and maintaining a private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana, Nagios, Zabbix Familiarity with Big Data tools: Hadoop, HDFS, Spark, HBase Ability to write code in Go, Python, Bash, or Perl for automation. Work Experience 5-7+ years More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 2 days ago

Lead FX Trading Platform Specialist (DIR)

London, England, United Kingdom

London, United Kingdom

on industry trends and emerging technologies related to Forex trading and banking Responsibilities: Architecture & Design • Define the target micro service/event driven architecture (latency budget, throughput, HA & DR, observability). • Own protocol & data model standards for client to venue and internal flows (eg FIX 4.4/5.0, REST/gRPC, protobuf, JSON) Hands on Engineering • Lead development of price … binary encoded protocols (SBE, FAST), RTDS, gRPC, KAFKA, https api, udp • Pipeline: github, Jenkins, TeamCity, Sonar, XLDeploy, Docker, Kubernetes • Infra as code: Terraform, ansible, azure cloud • Datastores: PostGre, OCP • Observability: ELK, Grafana, OpenTelemetry • Batch: airflow (python) • Security & Compliance: TLS, OAuth2/OIDC, data masking, GDPR/MiFID controls • Project & Process: Scrum/Kanban, backlog grooming, metrics driven retrospectives Why join More ❯

Posted: 3 days ago

GCP Technical Lead

Slough, England, United Kingdom

JR United Kingdom

a fast-paced, dynamic environment. Previous experience working on large App/Data migrations engagements. Cloud Platforms and Technology Experience Core Skills: GCP – Networking, Security tool/Best Practices Observability - Operations suite, Logging, Monitoring, Alerting. Additional Skills: Good understanding of Linux OS. Bash, Scripting, Automation, Ansible, Networking, Security. Hands-on experience with DevOps Principles and Tools. Hands-on with Terraform More ❯

Posted: Today

Solace Messaging Administrator

London Area, United Kingdom

BGC Group

built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯

Posted: Yesterday

Solace Messaging Administrator

City of London, London, United Kingdom

BGC Group

Posted: Yesterday

Lead DevOps Engineers

United Kingdom

InterQuest Group (UK) Limited

Go • Significant experience with AWS cloud infrastructure • Deep understanding of IaC tools: Terraform, Packer, CloudFormation • Proven leadership in multidisciplinary delivery teams • Skills in Databases: MongoDB/Atlas, Messaging: Kafka, Observability: Prometheus, Grafana, Splunk • Experience of working in a DevOps environment - favouring and implementing Continuous Integration & Deployment over manual processes. • Experience of designing, implementing, securing and supporting Unix/Linux based More ❯

Employment Type: Contract

Rate: £600 - 650 per day

Posted: 21 days ago