Senior Site Reliability Engineer
- Hiring Organisation
- Realm
- Location
- City of London, London, United Kingdom
Culture centred on pragmatic problem-solving, cross-functional collaboration, and full lifecycle responsibility. Role Overview: Position operating across software, infrastructure, and operations to ensure reliability, scalability, and performance of a globally distributed compute platform. Close collaboration with networking, platform engineering, and physical infrastructure teams to design and operate … systems supporting high-demand computational workloads. Hands-on engineering role requiring strong systems expertise, with responsibility for resolving complex production issues, improving system resilience, and enhancing platform observability. Responsibilities Deployment and management of large-scale compute clusters using automation tooling, with adaptation to customer requirements Validation and optimisation ...