role Remote£550 Inside ir35 6 Months contract Key Skills needed - Design/implementing Unix/Linux system and services open-source solutions and performance tuning.- HPC technologies: Lustre, Slurm- Configuration systems such as Ansible and Terraform- Unix/Linux scripting.- Networking: TCP/IP, DHCP, VLANs, spanning tree protocol, link aggregation for performance (MTU settings) and reliability requirements. More ❯
file systems (e.g., Lustre), and HPC tools (e.g., Bright) • Understanding of networking (InfiniBand/Ethernet) and storage platforms (DDN, NetApp, IBM, Dell EMC) • Experience with batch schedulers (PBS Pro, Slurm, SGE/UGE, Microsoft Scheduler) Get in touch for more details More ❯
file systems (e.g., Lustre), and HPC tools (e.g., Bright) • Understanding of networking (InfiniBand/Ethernet) and storage platforms (DDN, NetApp, IBM, Dell EMC) • Experience with batch schedulers (PBS Pro, Slurm, SGE/UGE, Microsoft Scheduler) Get in touch for more details More ❯
HPC tools (such as Bright)* Networking knowledge: Mellanox InfiniBand or Ethernet* Experience with storage platforms: DDN, NetApp, IBM, Dell EMC* Familiarity with batch scheduling systems such as PBS Pro, Slurm, SGE/UGE, Microsoft Scheduler It's still worth applying even if you don't meet every requirement. If you have solid Linux knowledge and a passion for developing More ❯
Employment Type: Full-Time
Salary: £50,000 - £60,000 per annum, Inc benefits, OTE
Stevenage, Hertfordshire, South East, United Kingdom
Sanderson Government and Defence
HPC tools (such as Bright) * Networking knowledge: Mellanox InfiniBand or Ethernet * Experience with storage platforms: DDN, NetApp, IBM, Dell EMC * Familiarity with batch scheduling systems such as PBS Pro, Slurm, SGE/UGE, Microsoft Scheduler It's still worth applying even if you don't meet every requirement. If you have solid Linux knowledge and a passion for developing More ❯
oxford district, south east england, united kingdom
Ellison Institute of Technology
Computing Facility, the HPC Engineer will design, deploy, and optimise systems that enable large-scale data processing, AI-driven analytics, and simulation workloads across. For example deploying Kubernetes and Slurm to enable real-time data analysis from instruments, MLOps, or scientific workflow managers. We will be hiring either at the regular or senior level, depending on the applicant's … computational research workloads. Evaluate and integrate advanced technologies including GPU/TPU acceleration, high-speed interconnects, and parallel file systems. Manage HPC environments, including Linux-based clusters, schedulers (e.g., Slurm), and high-performance storage systems (e.g., Lustre, BeeGFS, GPFS). Implement robust monitoring, fault-tolerance, and capacity management for high availability and reliability. Develop automation scripts and tools (Python … or cloud computing) in scientific or research settings. Proficiency in Linux system administration, networking, and parallel computing (MPI, OpenMP, CUDA, or ROCm). Experience with using HPC job schedulers (Slurm preferred) and parallel file systems (Lustre, BeeGFS, GPFS). At the senior level: Extensive experience designing, deploying, and managing HPC clusters (or cloud computing) in scientific or research settings. More ❯
Stevenage, Hertfordshire, South East, United Kingdom
Anson Mccade
scripting, particularly Bash, Python, and at least one other language. Clustering: Experience with clustered environments and cluster orchestration tools. Storage: Experience with clustered, parallel file systems (e.g., Lustre). Workload Management: Experience managing batch scheduling systems (PBS Pro, Slurm, SGE/UGE, etc.). HPC Knowledge: Knowledge of HPC management systems (e.g., Bright). Networking/Storage Admin More ❯
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯