Remote Observability Jobs in Norwich

3 of 3 Remote Observability Jobs in Norwich

Database Reliability Engineer - Core Team

Norwich, Norfolk, UK
Hybrid/Remote Options
ClickHouse
private cloud companies. With over 2,000 customers and ARR that has more than quadrupled over the past year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads. ClickHouse's incredible momentum was confirmed in its recent $350M Series C financing that included new, tier one investors, Khosla Ventures, BOND, IVP, Battery Ventures and Bessemer More ❯
Employment Type: Full-time
Posted:

Engineering Manager - Platform

Norwich, Norfolk, UK
Hybrid/Remote Options
Ripjar
developing teams that take pride in operational excellence. Key responsibilities: Lead and manage a team of platform and operations engineers delivering the public/private cloud infrastructure, automation, and observability capabilities that underpin Ripjar's mission-critical products. Coach, mentor, and develop your team through regular 1:1s, feedback, and tailored growth plans. You'll champion psychological safety and a … fault tolerant. Oversee operational excellence, including incident management, root cause analysis, and continuous improvement of system reliability. Refine and evolve platform processes, such as deployment pipelines, change management, and observability practices, to drive speed and stability as we deploy new capabilities. Foster a DevOps culture, empowering engineering teams to own their services end-to-end while providing robust shared tooling … workloads are distributed on bare metal using Spark, HBASE and HDFS. We make extensive use of terraform and ansible for IaC, our CI/CD uses GitHub Actions, our observability is provided via Datadog and we increasingly orchestrate our workloads using K8's. Skilled in agile delivery, incident management, and balancing operational priorities with longer term company objectives. Thought leadership More ❯
Employment Type: Full-time
Posted:

Senior Site Reliability Engineer

Norwich, Norfolk, UK
Hybrid/Remote Options
TechNET IT Recruitment Ltd
improvements across the platform Participate in an on-call rotation (one week every 4–5 weeks) to ensure 24x7 availability of critical systems Collaborate with internal teams to improve observability, monitoring and alerting across services Identify and implement operational improvements to existing monitoring, logging and incident response processes Use scripting and automation (primarily Bash and Python) to reduce toil and … Practical scripting skills in Bash and/or Python for automation and tooling Familiarity with IaC tools such as Ansible or Puppet Good understanding of monitoring, alerting, logging and observability best practices Excellent communication skills and the ability to own incidents end-to-end, including post-incident reviews More ❯
Employment Type: Full-time
Posted: