1 of 1 Incident Response Jobs in Wolverhampton

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Wolverhampton, West Midlands, UK
Employment Type
Full-time
innovation, and environmental responsibility. At Nscale, our Support and Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work … . Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident