Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
IBM Spectrum LSF administration and any other cloud-based HPC deployments. Deep technical knowledge in handling large distributed linux systems. Experience working with network storage solutions such as NetApp, Lustre, Weka, LakeFS, etc. Experience in networking technologies and services such as load balancing, DNS, packet tracing and debugging, etc. Deep understanding of LDAP implementations such as Oracle Unified Directory (OUD More ❯
least one programming language, preferably in Go. Expertise in patch and OS management at scale Experienced in Linux performance benchmarking, tuning, and troubleshooting Familiarity with distributed storage solutions like Lustre and Ceph Knowledgeable in networking technologies and protocols, including Ethernet and ideally Infiniband Proactive and solution-oriented mindset Excellent problem-solving skills Initiative-driven and able to take ownership What More ❯
FP16, BF16, INT8, etc.) GPU utilization profiling and tuning Inference workload modelling and scaling AI model deployment and performance optimization Storage Design and operation of parallel file systems (eg Lustre, GPFS) Integration and optimization of NVMe storage tiers Modeling storage throughput and demand for AI/HPC workloads More ❯
Central London, London, United Kingdom Hybrid / WFH Options
STK Recruitment
FP16, BF16, INT8, etc.) GPU utilization profiling and tuning Inference workload modeling and scaling AI model deployment and performance optimization Storage Design and operation of parallel file systems (e.g. Lustre, GPFS) Integration and optimization of NVMe storage tiers Modeling storage throughput and demand for AI/HPC workloads We have multiple upcoming roles in the High-Performance Computing industry, so More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Solutions Through Knowledge
FP16, BF16, INT8, etc.) GPU utilization profiling and tuning Inference workload modeling and scaling AI model deployment and performance optimization Storage Design and operation of parallel file systems (e.g. Lustre, GPFS) Integration and optimization of NVMe storage tiers Modeling storage throughput and demand for AI/HPC workloads We have multiple upcoming roles in the High-Performance Computing industry, so More ❯
s largest and most critical customers Expected to work directly with customer administrative staff to solve issues Must be willing to quickly engage with customers to resolve problems Resolve Lustre file system issues on large, scalable customer systems and ensure customer satisfaction. Create test plans and procedures for customer upgrades and troubleshooting. Work with engineeringfor enhancing product quality using customer … problem solving. Proven skills and a solid team player. Good verbal and written communication skills (English, second language beneficial). Essential Technical Requirements 7+ years of experience working with Lustre or similar Parallel Filesystems; administration/implementation/support. Strong knowledge of Linux architecture and fundamentals. Good understanding of the technical fundamentals of the system infrastructure including Storage systems Linux More ❯