like organic search retrieval and personalized recommendations. Continuously assess and improve the health of the team's data stack, driving efforts to increase reliability, observability, and efficiency. Advocate for modern data engineering practices-e.g., data quality, monitoring, reproducibility-across the squad and broader org. Contribute to technical direction and strategic More ❯
Five or more years' successful track record selling complex software into the capital markets space specialising in technology that operates in the monitoring or observability space. A working understanding of how financial trading technology is deployed and used within capital markets and a working knowledge of the application infrastructures and More ❯
best. To learn more about life at LogicMonitor, check out our Careers Page . What You'll Do: LM Envision, LogicMonitor's leading hybrid observability platform powered by AI, helps modern enterprises gain operational visibility into and predictability across their IT stacks, so they can continue to deliver extraordinary employee More ❯
obsess over developer experience and write code we'd actually want to use ourselves. We're applying that same engineering mindset to Logfire, our observability platform built for today's development reality: AI workloads, multi-language environments, and cloud infrastructure that's designed to be straightforward to set up and More ❯
best. To learn more about life at LogicMonitor, check out our Careers Page . What You'll Do: LM Envision, LogicMonitor's leading hybrid observability platform powered by AI, helps modern enterprises gain operational visibility into and predictability across their IT stacks, so they can continue to deliver extraordinary employee More ❯
involves things like Working closely with internal engineers to identify pain points Making sure the product experience is as good as possible Setting up observability around how the platform is performing but also how users are interacting with the platform Why Join Neo4j? Neo4j is, without question, the most popular More ❯
software deployment and scalability. CI/CD Expertise: Automate software build, test, and deployment pipelines following agile methodologies. Terraform Exposure: Beneficial experience with Terraform. Observability Tools: Experience with Grafana and Splunk is beneficial, particularly in developing and applying an observability strategy across a large organization. Learn More For more information More ❯
is dedicated to empowering development teams by creating toolchains, guidelines, and standards. Our focus is on enabling seamless automation and CI/CD, comprehensive observability, and unwavering reliability in a secured cloud-native environment. The Opportunity The Staff Engineer position within the Platform As a Service team offers a compelling … adept professional with a rich background in architecting, constructing, and managing scalable infrastructures. This role specifically concentrates on three key areas: CI/CD, Observability, and application hosting. As a senior member of the Platform As a Service team, you will wield significant influence over Algolia's Search Products. Your … and executing systems pivotal to ensuring reliability, scalability, and cost optimisation. You will be instrumental in architecting robust CI/CD pipelines, establishing comprehensive observability frameworks, and managing hosting solutions focused on API Management and micro-services management. Moreover, as an expert within the team, you will actively participate in More ❯
and help build the next generation of scalable, cloud-native infrastructure. This role sits in a high-impact platform engineering team focused on automation, observability, and empowering development teams to ship faster and more securely. Why You Should Apply: Work with a forward-thinking, global financial firm Hybrid setup … and maintaining AWS-based infrastructure using Terraform Improving CI/CD pipelines with Python and Git workflows Supporting containerised environments (Docker/K8s) Driving observability with Grafana and proactive monitoring tools Enhancing developer experience through smart automation and tooling What We’re Looking For: 3 years of experience in Platform More ❯
london, south east england, united kingdom Hybrid / WFH Options
Harrington Starr
and help build the next generation of scalable, cloud-native infrastructure. This role sits in a high-impact platform engineering team focused on automation, observability, and empowering development teams to ship faster and more securely. Why You Should Apply: Work with a forward-thinking, global financial firm Hybrid setup … and maintaining AWS-based infrastructure using Terraform Improving CI/CD pipelines with Python and Git workflows Supporting containerised environments (Docker/K8s) Driving observability with Grafana and proactive monitoring tools Enhancing developer experience through smart automation and tooling What We’re Looking For: 3 years of experience in Platform More ❯
Reigate, Surrey, United Kingdom Hybrid / WFH Options
Willis Towers Watson
in a product team to develop and support operationally resilient cloud infrastructure. The ideal candidate will have a track record in Microsoft Azure and Observability platforms in complex SaaS environments and have excellent communication skills. You will be joining our growing engineering organization building a wide range of market-leading … on high cadence and cost effectiveness Implement infrastructure as code with Pulumi Support the team in infrastructure and networking related issues Maintain and configure observability platforms such as Datadog Proactively monitor production and other environments to ensure stability, availability, security and integrity Participate in incident response, troubleshooting, and root cause … skills (PowerShell, Terraform, ARM, Pulumi, Bicep etc.) Experience of Microsoft Azure in areas such as networking, storage, integration, compute and analytics Experience of cloud observability concerns (logging, tracing, metrics, monitoring & alerting) Experience of Windows & Linux containers and orchestration platforms (Docker, Kubernetes) Strong interpersonal skills, with the ability to work effectively More ❯
ensuring the platform is stable To drive and own the Monitoring strategy, defining clear goals, objectives, and deliverables. Optimise and reduce operational overheads through observability and service automation. Lead the definition and track Service Level Objectives (SLO) to measure service availability in combination with service, product and engineering communities. Collaborate … to prioritize and manage multiple tasks in a fast-paced environment. Experience in software development, infrastructure, or operations roles Strong background/appreciation in observability principles, techniques and toolsets. Demonstrable knowledge of developing and managing RESTful API services written within a modern OO language such as Java or Python Knowledge … C# Understand or worked within an Incident Management Process (ITSM) Desirable Requirements: AWS Linux - Debian, CentOS, Alpine and AWS Linux Terraform, Docker, Kubernetes, Git Observability/APM Platforms Jenkins, Nginx, MySQL Benefits We are actively committed to promoting a fully diverse and inclusive workforce and we welcome applications for this More ❯
of platform engineering maturity. You'll innovate to keep our products' global platform reliable, secure and fast. You'll help enhance our system availability, observability, security and reliability whilst resolving issues before they impact our customers. You'll also play a role in the continued evolution of software delivery tooling … easily with third-party providers, engineering teams and business stakeholders to ensure you offer the best possible experience. Self-motivated You're passionate about observability, availability and issue management. You enjoy solving problems and like to challenge yourself by quickly identifying and mitigating an issue before moving on to the … next one What You'll Do Implement best practices for CI/CD pipelines, infrastructure-as-code, and observability that supports our continuous delivery practices Lead initiatives to improve system reliability, performance, and security. Writing tooling to support our self-service automation portal to improve visibility for engineers Creating and More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
TieTalent
projects simultaneously using Agile practices. The ideal candidate will also have knowledge around or an interest in learning other key DevOps areas such as observability, CI/CD pipeline development and config management. The company have a personal development budget available to all staff for such courses and accreditations, to … services and architecture Strong experience working with Terraform (or other IaC technology) Proven team leadership experience Experience working with CI/CD pipelines (Jenkins), Observability (Grafana) & Configuration Management (Ansible, Chef, Puppet) Excellent communication skills are a must Along with an excellent work/life balance, this company also offer a More ❯
City, Edinburgh, United Kingdom Hybrid / WFH Options
TSB Bank
and implement improvements to processes and tooling to ensure engineering excellence. You'll have a strong understanding of operational requirements, and ensure Scalability, Resiliency, Observability, Security, Cost and Maintainability are at the forefront of all engineering activities. What you'll bring Bachelor's or Master's in Computer Science is … of software engineering including testing frameworks, and secure code delivery (ie SCA, SAST, DAST). Scripting/Coding (Bash, Python). End to End Observability solutions (logging, monitoring, alerting). Strong problem solving skills including level 3 support (not out of hours) and root cause analysis experience. What we offer More ❯
and mitigate technical debt , ensuring a balance between quick wins and long-term stability . Oversee the refactoring and modernization of legacy applications , ensuring observability, performance, and security . Collaborate with Chief Engineers and engineering teams to define best practices for ETL, data integration, and cloud adoption . Ensure engineering … related tools. Strategic Thinking – Ability to drive long-term engineering strategy while delivering incremental value. Technical Debt Management – Experience identifying and remediating inefficient architectures. Observability & Performance Optimization – Familiarity with monitoring and logging tools (e.g., Datadog, Splunk, Prometheus, New Relic). Stakeholder Management – Ability to engage with senior leadership, product managers More ❯
and mitigate technical debt , ensuring a balance between quick wins and long-term stability . Oversee the refactoring and modernization of legacy applications , ensuring observability, performance, and security . Collaborate with Chief Engineers and engineering teams to define best practices for ETL, data integration, and cloud adoption . Ensure engineering … related tools. Strategic Thinking – Ability to drive long-term engineering strategy while delivering incremental value. Technical Debt Management – Experience identifying and remediating inefficient architectures. Observability & Performance Optimization – Familiarity with monitoring and logging tools (e.g., Datadog, Splunk, Prometheus, New Relic). Stakeholder Management – Ability to engage with senior leadership, product managers More ❯
Job Title: Site Reliability Engineering (SRE) Lead – Observability Location: Stratford, London (Hybrid – 2 days per week onsite) Contract Length: 6 months Rate: £450–£500 per day (Inside IR35) Industry: Financial Services A leading Financial Services organisation in London is seeking a Site Reliability Engineering (SRE) Lead – Observability to join their … hybrid role requiring two days per week onsite at their Stratford, London offices. The role sits Inside IR35 . Key Responsibilities: Lead the SRE Observability team and champion observability practices across multiple product groups. Provide thought leadership from the Cognizant delivery team on all things SRE. Leverage hands-on experience … with Datadog to implement and enhance observability capabilities. Guide and oversee the day-to-day operation and maintenance of observability tools. Partner directly with engineering teams to support delivery of observability backlogs. Collaborate with product teams to create monitoring and alerting blueprints, patterns, and automation. Capture, analyse, and report on More ❯
london, south east england, united kingdom Hybrid / WFH Options
MarkJames Search
Job Title: Site Reliability Engineering (SRE) Lead – Observability Location: Stratford, London (Hybrid – 2 days per week onsite) Contract Length: 6 months Rate: £450–£500 per day (Inside IR35) Industry: Financial Services A leading Financial Services organisation in London is seeking a Site Reliability Engineering (SRE) Lead – Observability to join their … hybrid role requiring two days per week onsite at their Stratford, London offices. The role sits Inside IR35 . Key Responsibilities: Lead the SRE Observability team and champion observability practices across multiple product groups. Provide thought leadership from the Cognizant delivery team on all things SRE. Leverage hands-on experience … with Datadog to implement and enhance observability capabilities. Guide and oversee the day-to-day operation and maintenance of observability tools. Partner directly with engineering teams to support delivery of observability backlogs. Collaborate with product teams to create monitoring and alerting blueprints, patterns, and automation. Capture, analyse, and report on More ❯
Senior MLops (Full Stack) Engineer London Foundation Models Job details Posted 30 April 2025 Salary £80,000 - £110,000 per annum Benefits: Equity Location: London Job type: Permanent Discipline: AI/Machine Learning Reference: BK-45-1 What you'll More ❯
York, Yorkshire, United Kingdom Hybrid / WFH Options
Cathcart Technology
System Administrator Location: York (Hybrid - 1 day in the office per week) Hours: 37.5 per week, flexitime available Salary: Up to 55k - depending on experience About the Role: I'm looking for an experienced System Administrator to join a well More ❯
East Riding, Yorkshire, United Kingdom Hybrid / WFH Options
Cathcart Technology
System Administrator Location: York (Hybrid - 1 day in the office per week) Hours: 37.5 per week, flexitime available Salary: Up to 55k - depending on experience About the Role: I'm looking for an experienced System Administrator to join a well More ❯
Platform Engineer Location: Leeds Salary: Competitive salary & Package (Depending on level of experience) Please Note: Any offer of employment is subject to satisfactory BPSS and SC security clearance which requires 5 years continuous UK address history at the point of More ❯
Blip is a leading tech company focused on software engineering solutions for sports entertainment. We operate at scale. As part of Flutter Entertainment, we play an essential role in the Group's goal of becoming the global leader in online More ❯
Bexhill-on-sea, Sussex, United Kingdom Hybrid / WFH Options
Hastings Direct
us. Please see our Data Protection Statement and Job Application Terms & Conditions here for details on how your information will be stored. Senior Monitoring & Observability Engineer We're a digital insurance provider with ambitious plans to become The Best and Biggest in the UK market. Over the past few years … individuals who are eager to make a difference and contribute to our success. Job Details As a Senior Site Reliability engineer for Monitoring and Observability, you will be part of the Technology Engineering team within CIO supporting the definition, maintenance and implementation of SRE strategies and principles around Monitoring and … Observability for Hastings. Support the definition, maintenance and implementation of SRE strategies including observability and event management. Design, build, and maintain scalable and reliable monitoring and observability solutions for the IT infrastructure and applications, using various technologies and platforms. Design, build and maintain scalable and reliable event management capabilities to More ❯