Assistant Vice President, Site Reliability Engineer

About CLS:

CLS is the trusted party at the centre of the global FX ecosystem. Utilized by thousands of counterparties, CLS makes FX safer, smoother and more cost effective. Trillions of dollars’ worth of currency flows through our systems each day.

Created by the market for the market, our unrivalled global settlement infrastructure reduces systemic risk and provides standardization for participants in many of the world’s most actively traded currencies. We deliver huge efficiencies and savings for our clients: in fact, our approach to multilateral netting shrinks funding requirements by over 96% on average, so clients can put their capital and resources to better use.

CLS products are designed to enable clients to manage risk most effectively across the full FX lifecycle – whether through more efficient processing tools or market intelligence derived from the largest single source of FX executed data available to the market.

Our ambition to make a positive difference starts with our people. Our values – Protect, Improve, Grow – underpin everything that we do at CLS and define and shape a supportive and inclusive working environment in which everyone is encouraged to be open and forward-thinking.

Job information:

  • Functional title – Site Reliability Engineer
  • Department – Technology
  • Corporate level – Assistant Vice President
  • Report to – Vice President
  • Location - London, onsite 2 days per week

Job Purpose:

The role is primarily responsible for developing SRE methodologies and ensuring they are applied to the Cloud hosted environment. In addition, the role will act as a central point of expertise for SRE automation across the Platform Operations team.

Essential Job Functions:

  • Responsible for driving the implementation of SRE methodologies within the CLS environment, collaborating closely with other infrastructure teams to optimize infrastructure and deployment processes, focusing on automation and operational excellence.
  • Drives continuous improvement in system observability, alerting, and capacity planning through the definition and implementation of SLA, SLOs & SLIs
  • Define and enhance frameworks for Toil identification, analysis & remediation to identify opportunities to eliminate or automate remediation of recurring tasks and issues
  • Develops secure high-quality production code, and reviews and debugs code written by others.
  • Build out and enhance GitOps capabilities for use in the Cloud hosted environments using tools such as Terraform and Ansible Automation Platform
  • Provide on-call support and escalation for Cloud & Automation related issues ensuring that Production stability is the primary requirement.
  • Ensure risks and stability issues in the cloud hosted environment are understood and addressed where possible through SRE best practices as part of any incident postmortems.

Minimum Education Required:

  • Bachelor’s degree or equivalent
  • Industry standard IT certification desired e.g. AWS / Terraform

Minimum Job-Related Experience Required:

  • Must have strong technical operational support experience within an infrastructure services team performing on-call duties such as handling tickets, owning incidents & investigating their root cause
  • Minimum of 2 years experience applying SRE methodologies within a support team and an understanding of Service Level metrics associated with this.
  • Strong knowledge of at least 1 scripting language, preferably either Python or Ansible. PowerShell would also be a positive
  • Experience with supporting and building multi environment, multi region platforms with cloud providers such as AWS/GCP and managing them through Infrastructure as Code and GitOps methodologies
  • Experience of Observability/APM tools (e.g. Grafana / Datadog / Dynatrace).
  • Experience of working in a regulated financial services / banking organization.
  • Excellent troubleshooting, analytical, and communication skills with both business and technical staff.

Special Skills/Knowledge:

  • Software Development background.
  • Familiar with the ITIL framework.
  • Experience with using Ansible Automation Platform.
  • Possesses a strong service-orientated mindset, can consistently deliver a high level of service to the business.
  • Able to work proactively and provide regular updates to management and stakeholders.

Our commitment to employees:

We are a small company with a big mandate, so every person is essential to our success. We are also committed to employing and retaining the most talented and dedicated people.

What makes us interesting goes beyond our competitive salaries and great benefits. Our work environment is designed around quality outcomes, not output. The FX market would cease to function without our services, and we take pride in being responsible for keeping it running smoothly.

We are different from other financial institutions in that we have a flatter and more transparent structure with accessible leadership. You will be seen, heard and empowered to develop your career.

We are a purpose-driven organization, with an inclusive culture that focuses on doing what is right. The well-being of our people is as important to us as the resilience of our systems. In addition to encouraging our people to ‘locate for their day,’ we run a range of initiatives that support employees’ sense of belonging and physical, emotional and mental well-being.

Our extensive benefits for employees typically include:

  • Vacation/annual leave: 25 days in UK/Asia + 3 life days, 23 in US + 3 life days
  • Private medical and dental cover and life insurance
  • Generous pension contributions in the UK and Asia; matching 401(k) in the US
  • Paid volunteer days
  • ‘Locate for your day’ hybrid working – 2 days a week in office.
  • Access to Discover – our learning platform with 1000+ courses from LinkedIn Learning.
  • Paid parental leave / Coaching and support services
  • Career development / LinkedIn Learning
  • ‘Heads down days’ with no meetings on the last Friday of every month
  • Wellbeing / Mental health support
  • Diversity Council / Affinity groups (Women’s Forum, Black Employee Network, Pride Network, Parents & Caregivers Network, Sustainability Network)
  • Social events

Awards:

  • The Sunday Times Best Places to Work 2023 & 2024 / Big Company / The Sunday Times Awards
  • Third place in Britain’s Healthiest Workplace 2022 / Medium Company / Vitality Awards

Job Details

Company
CLS Group
Location
City of London, London, United Kingdom
Hybrid / Remote Options
Posted