and troubleshooting interconnectivity issues. Linux Systems: Advanced Linux administration skills, including performance tuning and OS-level troubleshooting. Storage Systems: Experience with parallel/distributed file systems (eg Lustre, Ceph, WEKA, VAST). Automation & Scripting: Proficiency in Bash, Python, and tools like Ansible and Terraform for deployment and maintenance. Monitoring & Resilience: Experience implementing monitoring solutions and ensuring high availability and security More ❯
Dorset, England, United Kingdom Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong SLURM configuration skills - partitions, priorities, resource management Advanced Linux administration and performance tuning Expertise in high-performance networking (Infiniband, RoCE, RDMA) Experience with distributed file systems (Lustre, Ceph, WEKA, VAST) Proficiency in automation and scripting (Ansible, Terraform, Bash, Python) A solid understanding of monitoring, resilience, and security compliance Excellent documentation skills and a passion for mentoring and knowledge sharing More ❯
teams to ensure infrastructure alignment with scientific computing needs Lead technical planning and implementation of infrastructure improvements Provide technical guidance on architecture decisions affecting scientific workflows Manage and optimize WEKA storage systems and VSphere virtual environments Support Linux-based scientific computing environments, leveraging managed services as appropriate Implement and maintain monitoring solutions for complex computing environments Participate in capacity planning … budgeting Experience with audit responses and compliance documentation Strong experience with Linux administration and engineering Extensive knowledge of virtualization technologies, particularly VSphere Preferred Education, Experience And Skills Experience with WEKA storage systems Knowledge of AI/ML infrastructure requirements Experience supporting bioinformatics workflows Familiarity with container technologies (Docker, Kubernetes) Experience with infrastructure automation tools Understanding of scientific computing software and More ❯