1,226 to 1,250 of 1,337 Observability Jobs

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Brighton, East Sussex, UK
Employment Type
Full-time
join the Platform Engineering team which designs, builds, and operates the firm's global production trading infrastructure—from hardware and Linux systems to Kubernetes, observability, and build platforms. They work with a mix of open-source and in-house technologies to solve scaling, reliability, and performance problems in a highly … with a Platform Engineer who enjoys building highly scalable systems, adopting new technologies, and debugging complex production issues. Systems and platforms owned: In-house observability platform (ClickHouse, Redpanda, Rust) Firm-wide build and distribution systems Linux systems engineering for production trading On-prem Kubernetes clusters Hardware automation and operational tooling ...

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Stoke-on-Trent, Staffordshire, UK
Employment Type
Full-time
join the Platform Engineering team which designs, builds, and operates the firm's global production trading infrastructure—from hardware and Linux systems to Kubernetes, observability, and build platforms. They work with a mix of open-source and in-house technologies to solve scaling, reliability, and performance problems in a highly … with a Platform Engineer who enjoys building highly scalable systems, adopting new technologies, and debugging complex production issues. Systems and platforms owned: In-house observability platform (ClickHouse, Redpanda, Rust) Firm-wide build and distribution systems Linux systems engineering for production trading On-prem Kubernetes clusters Hardware automation and operational tooling ...

DevOps Advisory Managing Consultant

Hiring Organisation
Stealth iT Consulting
Location
London Area, United Kingdom
KPIs to measure engineering performance and drive continuous improvement. In-depth understanding of hybrid and multi-cloud environments, CI/CD pipelines, DevSecOps, SRE, observability, and ITIL practices. Experience working with developers to implement and evolve monitoring and observability strategies is a plus. Why Join Us? Client Variety: Work with ...

DevOps Advisory Managing Consultant

Hiring Organisation
Stealth iT Consulting
Location
City of London, London, United Kingdom
KPIs to measure engineering performance and drive continuous improvement. In-depth understanding of hybrid and multi-cloud environments, CI/CD pipelines, DevSecOps, SRE, observability, and ITIL practices. Experience working with developers to implement and evolve monitoring and observability strategies is a plus. Why Join Us? Client Variety: Work with ...

Senior DataOps Engineer

Hiring Organisation
Harnham
Location
Leeds, England, United Kingdom
DataOps Engineer, you’ll take a strategic, high-level view of the data platform while still diving deep when needed. You will focus on observability, automation, pipeline performance, operational excellence, and cloud cost optimisation. You’ll work cross-functionally with Data Engineering, DevOps, and FinOps teams, helping ensure that data … self-serve with confidence. What You’ll Be Doing Taking an overview of how pipelines run across the platform, improving performance and throughput Enhancing observability and monitoring across Azure-based data workloads Identifying bottlenecks and opportunities to streamline operational processes Using scheduling/orchestration tools to optimise workflows and improve ...

Software Engineering Manager

Hiring Organisation
NETGEAR
Location
Cork, Ireland
Employment Type
Permanent
Salary
EUR 80,000 - 100,000 Annual
Stripe, PayPal, or Apple Pay is an advantage. Demonstrated experience adopting or promoting AI tooling to enhance software delivery and team efficiency. Experience with observability and performance monitoring tools such as Datadog, New Relic, or AppDynamics, and an understanding of how observability can improve mobile app responsiveness and user experience. ...

Cloud Engineer

Hiring Organisation
Flowmentum, Inc
Location
England, United Kingdom
using Terraform and PowerShell Partner closely with application engineering, QA, and SRE to improve reliability, security, and operational maturity Lead improvements to deployment workflows, observability, and platform resiliency in production environments Diagnose and resolve issues end-to-end, including legacy system constraints and cross-team dependencies Qualifications Deep expertise … production Proven experience building and maintaining Azure DevOps pipelines at scale Advanced PowerShell scripting and infrastructure provisioning with Terraform Familiarity with monitoring and observability tooling such as Azure Monitor, Application Insights, or Log Analytics is a plus What to Expect A distributed, senior-leaning team that values ownership and follow ...

Site Reliability Engineer

Hiring Organisation
Block MB
Location
Slough, Berkshire, UK
Employment Type
Full-time
error budgets, operational readiness) across product teams. Optimise performance, availability and cost across cloud environments while maintaining strong security and resilience. Build and enhance observability stacks for metrics, logging, tracing and alerting, and guide incident response. Mentor engineers in distributed systems, reliability engineering and infrastructure design, acting as a technical … scale, plus strong skills in automation, CI/CD and cloud engineering. Strong understanding of distributed systems, performance and failure modes, and experience with observability tooling. Comfortable with several languages (e.g. Python, Java, Kotlin, TypeScript) and technologies like Kafka, Flink, Redis and clustered Postgres. Familiarity with networking concepts (routing, load ...

Site Reliability Engineer

Hiring Organisation
Block MB
Location
City of London, London, United Kingdom
error budgets, operational readiness) across product teams. Optimise performance, availability and cost across cloud environments while maintaining strong security and resilience. Build and enhance observability stacks for metrics, logging, tracing and alerting, and guide incident response. Mentor engineers in distributed systems, reliability engineering and infrastructure design, acting as a technical … scale, plus strong skills in automation, CI/CD and cloud engineering. Strong understanding of distributed systems, performance and failure modes, and experience with observability tooling. Comfortable with several languages (e.g. Python, Java, Kotlin, TypeScript) and technologies like Kafka, Flink, Redis and clustered Postgres. Familiarity with networking concepts (routing, load ...

Site Reliability Engineer

Hiring Organisation
Block MB
Location
London Area, United Kingdom
error budgets, operational readiness) across product teams. Optimise performance, availability and cost across cloud environments while maintaining strong security and resilience. Build and enhance observability stacks for metrics, logging, tracing and alerting, and guide incident response. Mentor engineers in distributed systems, reliability engineering and infrastructure design, acting as a technical … scale, plus strong skills in automation, CI/CD and cloud engineering. Strong understanding of distributed systems, performance and failure modes, and experience with observability tooling. Comfortable with several languages (e.g. Python, Java, Kotlin, TypeScript) and technologies like Kafka, Flink, Redis and clustered Postgres. Familiarity with networking concepts (routing, load ...

Network Engineer

Hiring Organisation
Autonomai Recruitment
Location
City of London, London, United Kingdom
hyperscale infrastructure). They have experience building networks from 0→1 and are comfortable operating across everything from bare‐metal Linux to modern build, observability, and automation stacks. This role sits at the intersection of advanced networking, large‐scale ML/AI platforms, and high‐end automation. Network Engineer – Overview … distributed networks with expert knowledge of routing, switching, and multicast concepts, ensuring predictable performance under extreme load. Build and evolve network monitoring, alerting, and observability, integrating telemetry into operational and analytical data stores to support ML/AI and systematic strategies. Manage and optimise Kubernetes cluster networking and container orchestration ...

Network Engineer

Hiring Organisation
Autonomai Recruitment
Location
London Area, United Kingdom
hyperscale infrastructure). They have experience building networks from 0→1 and are comfortable operating across everything from bare‐metal Linux to modern build, observability, and automation stacks. This role sits at the intersection of advanced networking, large‐scale ML/AI platforms, and high‐end automation. Network Engineer – Overview … distributed networks with expert knowledge of routing, switching, and multicast concepts, ensuring predictable performance under extreme load. Build and evolve network monitoring, alerting, and observability, integrating telemetry into operational and analytical data stores to support ML/AI and systematic strategies. Manage and optimise Kubernetes cluster networking and container orchestration ...

Software Developer Apprentice

Hiring Organisation
QA
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£18,000 per annum
programming languages. Integrate applications with AWS services, particularly AWS Simple Queue Service (SQS) and Lambdas for messaging and event-driven architectures. Implement and maintain observability solutions, including logging, monitoring, and tracing, to ensure system health and performance. Integrate applications with ForgeRock Identity and Access Management (IAM) suite for authentication … authorization. Troubleshoot and resolve technical issues across the application stack, including those related to observability, AWS, and other integrations. Collaborate with cross-functional teams, including security, infrastructure, and product owners, to deliver high-quality solutions. Participate in code reviews to ensure code quality and adherence to best practices. Contribute ...

Application Support Engineer

Hiring Organisation
Pioneer Search Ltd
Location
City, London, United Kingdom
Employment Type
Permanent
Salary
GBP 80,000 - 85,000 Annual
Event Grid, API Management .NET/C# microservices, APIs and integration layers CI/CD pipelines (Azure DevOps) and deployment troubleshooting Application monitoring and observability using tools such as Azure Monitor, App Insights, Log Analytics, dashboards and alert rules Triaging and resolving incidents across data pipelines, messaging services, background jobs … patterns and identify root causes Skilled in troubleshooting APIs, microservices, SQL queries, authentication issues, configuration errors and cloud platform dependencies Experience using monitoring and observability tooling: Azure Monitor, Application Insights, Log Analytics, Kibana, Grafana or equivalent Strong understanding of operational processes including incident management, SLAs, problem management and service improvement ...

Lead Web Developer

Hiring Organisation
Reed
Location
Worthing, West Sussex, England, United Kingdom
Employment Type
Full-Time
Salary
£64,000 - £84,000 per annum, Inc benefits
design and delivery of scalable, secure, and accessible web applications Defining architecture across front end, APIs, and services (component libraries, CI/CD, observability) Writing high-quality code and conducting thoughtful code reviews Guiding technical standards: performance budgets, testing strategy, documentation, and reliability practices Collaborating with product, design, and data … integration/e2e), and cloud platforms (AWS, Azure, or GCP) Familiarity with performance optimisation (Core Web Vitals), accessibility standards, and SEO fundamentals Comfort with observability tools (logs, metrics, tracing) and secure coding practices Excellent communication skills and the ability to align stakeholders and developers Bonus: Experience with headless CMS, GraphQL ...

Lead Web Developer

Hiring Organisation
Reed
Location
Brighton, East Sussex, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £60,000 per annum, Inc benefits
design and delivery of scalable, secure, and accessible web applications Defining architecture across front end, APIs, and services (component libraries, CI/CD, observability) Writing high-quality code and conducting thoughtful code reviews Guiding technical standards: performance budgets, testing strategy, documentation, and reliability practices Collaborating with product, design, and data … integration/e2e), and cloud platforms (AWS, Azure, or GCP) Familiarity with performance optimisation (Core Web Vitals), accessibility standards, and SEO fundamentals Comfort with observability tools (logs, metrics, tracing) and secure coding practices Excellent communication skills and the ability to align stakeholders and developers Bonus: Experience with headless CMS, GraphQL ...

Lead Web Developer

Hiring Organisation
Reed
Location
Lewes, East Sussex, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £70,000 per annum, Inc benefits
design and delivery of scalable, secure, and accessible web applications Defining architecture across front end, APIs, and services (component libraries, CI/CD, observability) Writing high-quality code and conducting thoughtful code reviews Guiding technical standards: performance budgets, testing strategy, documentation, and reliability practices Collaborating with product, design, and data … integration/e2e), and cloud platforms (AWS, Azure, or GCP) Familiarity with performance optimisation (Core Web Vitals), accessibility standards, and SEO fundamentals Comfort with observability tools (logs, metrics, tracing) and secure coding practices Excellent communication skills and the ability to align stakeholders and developers Bonus: Experience with headless CMS, GraphQL ...

Software Developer

Hiring Organisation
Transunion
Location
Alderley Edge, Cheshire, United Kingdom
Employment Type
Permanent
build reliable backend systems and infrastructure tooling Use TDD to write high-quality, maintainable code and build out automated test suites Own reliability, observability, and performance of key services Collaborate with clients to understand requirements, debug issues, and propose solutions Drive improvements to system architecture, automation, and deployment processes Mentor … Desirable Skills & Experience: Experience owning backend systems in production environments Experience with Cloud Platforms AWS or GCP Infrastructure-as-code, CI/CD, and observability tooling Experience scaling systems under sustained load Contributions to internal tooling or open source Experience with large datasets and machine learning models Impact ...

Senior Cloud Platform engineer UK/Remote

Hiring Organisation
Tenth Revolution Group
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£350.00 - £400.00 per hour
cloud Platform engineer with Snowflake experience. Key Responsibilities: Design and implement Infrastructure as Code (IaC) solutions.Optimize networking, security, and containerization within Azure DevOps environments.Support observability and monitoring strategies.Contribute to self-service enablement for BI and data platforms. Required Skillset: 5+ years Azure DevOps expertise across: IaCNetworkingSecurityContainerizationStrong exeprience with Snowflake.Experience with … observability and monitoring tools.Ability to collaborate on self-service BI capabilities. Location: Remote (UK only) Contract Type: 6 months (Possibility for extension) Start Date: January Daily rate - £400 (outside IR35) Sponsorship is not possible for this role. ...

Azure & Snowflake DevOps Engineer

Hiring Organisation
Tenth Revolution Group
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£400/day
Responsibilities Design and implement Infrastructure as Code (IaC) solutions using Azure DevOps. Configure and manage networking, security, and containerisation within Azure environments. Drive observability and monitoring across platforms. Support creation of self-service capabilities for BI and analytics teams. Collaborate with stakeholders to ensure robust, scalable, and secure cloud solutions. … DevOps and related tooling. Expertise in IaC, networking, security, and containerisation. Exposure to Snowflake or other modern data platforms. Solid understanding of monitoring and observability frameworks. To discuss this role further please submit your CV or contact Brandon Forbes Tenth Revolution Group are the go-to recruiter for Data ...

Azure & Snowflake DevOps Engineer

Hiring Organisation
Tenth Revolution Group
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£400 per day
Responsibilities Design and implement Infrastructure as Code (IaC) solutions using Azure DevOps. Configure and manage networking, security, and containerisation within Azure environments. Drive observability and monitoring across platforms. Support creation of self-service capabilities for BI and analytics teams. Collaborate with stakeholders to ensure robust, scalable, and secure cloud solutions. … DevOps and related tooling. Expertise in IaC, networking, security, and containerisation. Exposure to Snowflake or other modern data platforms. Solid understanding of monitoring and observability frameworks. To discuss this role further please submit your CV or contact Brandon Forbes Tenth Revolution Group are the go-to recruiter for Data ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
London Area, United Kingdom
level proficiency in Python, Rust, or Go (you write systems that can't fail) • Deep understanding of distributed systems, real-time data processing, and observability architectures • Production ML/AI experience : You've deployed models, debugged their failures, and built monitoring around them • System design mastery : You can architect … autonomous decision-making, goal-directed behaviour, tool use, memory systems • Familiarity with AI safety concepts : alignment, interpretability, robustness, adversarial examples • Experience with monitoring/observability : instrumentation, logging, tracing, alerting in complex systems Working Style: • You ship to production regularly and own what you deploy • You write documentation that others ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
City of London, London, United Kingdom
level proficiency in Python, Rust, or Go (you write systems that can't fail) • Deep understanding of distributed systems, real-time data processing, and observability architectures • Production ML/AI experience : You've deployed models, debugged their failures, and built monitoring around them • System design mastery : You can architect … autonomous decision-making, goal-directed behaviour, tool use, memory systems • Familiarity with AI safety concepts : alignment, interpretability, robustness, adversarial examples • Experience with monitoring/observability : instrumentation, logging, tracing, alerting in complex systems Working Style: • You ship to production regularly and own what you deploy • You write documentation that others ...

Staff Backend Engineer (Python | AI Lab | £170,000)

Hiring Organisation
Paradigm Talent
Location
Slough, Berkshire, UK
Employment Type
Full-time
Role: Staff Software Engineer (Python | Backend | Infrastructure) Location: Hybrid - 2-3 days in London Office Compensation: Up to £170,000 + equity We're working with a frontier AI lab pushing the boundaries of computational ...

Staff Backend Engineer (Python | AI Lab | £170,000)

Hiring Organisation
Paradigm Talent
Location
City of London, London, United Kingdom
Role: Staff Software Engineer (Python | Backend | Infrastructure) Location: Hybrid - 2-3 days in London Office Compensation: Up to £170,000 + equity We’re working with a frontier AI lab pushing the boundaries of computational ...