Platform Engineer
Are you a Platform Engineer looking to build secure, scalable, and well-observed infrastructure? Thredd is expanding its Platform function and we’re constantly innovating to keep pace with the fast-moving fintech world. Join us as we build the next generation of infrastructure to support our growing client base As a Platform Engineer, you will work across all Thredd functions to support the continuous release of highly observed, quality-assured code run within a scalable, redundant, and observed cloud platform. What You'll Do As A Platform Engineer
- Own the architecture, design, and evolution of the platform, ensuring scalability, reliability, and security.
- Lead the adoption and governance of Infrastructure as Code (IaC), including module design and best practices.
- Architect and optimise AWS environments across EC2, ECS/EKS, Lambda, S3, RDS, DynamoDB, VPC, and networking components.
- Design and oversee CI/CD strategy, supporting the SRE’s on pipeline design, test automation, and deployment patterns.
- Drive automation initiatives to eliminate manual processes, improve consistency, and accelerate delivery.
- Lead AWS cost optimisation strategies, analysing workloads, rightsizing compute, and establishing cost governance.
- Build platform roadmaps and deliver reusable tooling, enabling engineering teams with self-service workflows and improved DevEx.
- Provide technical leadership and mentorship, guiding engineers in cloud best practices, modern architectures, and operational excellence.
- Collaborate with cross-functional stakeholders (Engineering, Security, Product) to align platform capabilities with organisational goals
- Run our infrastructure with Terraform and configuration management.
- Design, build and maintain AWS platform infrastructure that facilitates scaling in support of global payments and issuance.
- Provide hands-on, prototypes, and implementations incorporating cloud engineering best practices. Develop technical content like white paper etc.
- Define and implement deployment strategies which minimize or eliminate downtime.
- Aid in root cause resolution of incidents.
- Develop ways of providing early feedback on build, test, and deployment stages to developers and QA engineers.
- Demonstrable experience working in a Platform/DevOps/SRE role.
- Demonstrable experience leveraging IaC tooling within cloud platforms, ideally Terraform.
- A deep understanding of AWS serverless architecture.
- Experience in multi-tier traditional application architectures and in service decomposition and micro services.
- Demonstrable experience deploying, running, and observing serverless workloads in ECS.
- Detailed understanding of CI/CD concepts and tooling (Bitbucket pipelines).
- Demonstrable proficiency in a programming language commonly leveraged for infrastructure and service automation e.g. Go, Python.
- A desire to solve issues through root cause analysis and implement appropriate changes to mitigate repeat occurrence.
- Excellent communication and collaboration skills.
- Experience undertaking incident management, ensuring stakeholder engagement and a consistent feedback loop.
- Exposure too/awareness of payments or financial services.
- Own it and deliver – Taking responsibility for your own performance and being successful in your own role
- Collaborate purposefully – Building trusted relationships with colleagues, supporting activities and being successful together
- Think differently – Asking questions to check understanding and sharing your ideas to support continuous improvement
- Act courageously – Stepping out of your comfort zone and embracing change to help you learn and grow