Senior Site Reliability Engineer
ClearScore is a workplace like no other. Over the past ten years, we’ve disrupted an entire industry and built a user base of millions. At the heart of this success is our culture: we work hard, embrace change, and treat each other with respect, enabling everyone to reach their potential and deliver results that truly make a difference. We’re on a mission to positively impact the lives of our users. Our product puts their needs at the centre of our thinking. Through cutting-edge technology, insightful analytics and beautiful design, we help our users build financial confidence and make better financial decisions. We want our people to perform at their best, so we trust everyone to work in a way that suits them, focusing on output rather than time spent at a screen. We have an inclusive culture where everyone is encouraged to look after their wellbeing while growing and developing their career. What you will be doing:
- Drive architectural change by participating in RFCs, architecture forums and platform-wide initiatives to improve reliability, scalability and efficiency
- Lead and evolve ClearScore’s Kubernetes platform, designing, upgrading and optimising clusters at scale while shaping how we use Kubernetes across the company
- Troubleshoot and resolve complex production issues independently, using a deep understanding of distributed systems and containerisation to mitigate and prevent incidents
- Design and contribute to Kubernetes controllers and automation tools to improve our infrastructure and developer experience
- Enhance our AWS estate, ensuring cost efficiency, security and scalability while promoting best practices across teams
- Collaborate with developers to improve service observability, implement metrics and alerting strategies, and build meaningful dashboards for complex systems
- Build and maintain CI/CD pipelines from scratch for new use cases, manage migrations, and introduce new tooling where beneficial
- Contribute to open source projects through fixes, feedback or new tools aligned with our mission
- Mentor and guide mid-level SREs and other engineers, helping them develop deep technical expertise and operational excellence
- Expert-level Kubernetes knowledge, including experience with cluster upgrades, networking (CNI), container runtimes and troubleshooting node-level issues
- Strong AWS expertise, including architecture, networking and cost management, with an awareness of industry standards and the ability to influence adoption across teams
- Deep understanding of Linux internals, containerisation and operating system-level performance tuning
- Proficiency in at least one compiled language (for example Go, Rust or C++) and one interpreted language (for example Python or Bash)
- Proven ability to automate infrastructure, deployments and monitoring with strong scripting skills across multiple languages
- Experience designing, deploying and operating distributed systems with complex failure modes
- Strong networking fundamentals, capable of debugging complex routing or firewall issues and designing resilient architectures
- Hands-on experience with CI/CD pipelines and tooling such as Jenkins, ArgoCD or Spinnaker, including building and managing large-scale migrations
- Deep observability expertise, from instrumenting applications and building dashboards to managing large monitoring stack upgrades and integrations
- 25 paid holidays and a “duvet day” on your birthday
- Hybrid Work Environment
- Private health and dental cover - including mental health support through Bupa
- GP office visits
- Life assurance scheme
- Up to 6% matched pension
- Regular Lunch and Learns with guest speakers
- Dog-friendly office
- Daily breakfast and free snacks
- Access to discounts via Cobens Extras
- Free sports and social clubs
- Continued investment into learning and development
- Leadership-led training
- In-house psychotherapist
- Financial coach to help you plan and achieve your goals
- No clock-watching culture
- Generous maternity and paternity plans
- Culture and inclusion representatives
- Transparent pay structure and a career growth plan
- Levels 1-5: Minimum 2 days per week in-office
- Level 6 and above: Minimum 3 days per week in-office
- Flexibility to manage your work and life
- Dedicated in-office days for team building and collaborative projects
- Office facilities (with plants!) designed for productive interactions
- Clear expectations and support for maintaining our hybrid schedule