skills in C++ programming are required, with Python experience being a plus. Familiarity with storage solutions, protocols, and technologies such as ZFS, NFS, object stores, S3, Google Cloud Storage, Lustre, and databases. Strong software design, testing, deployment, and monitoring skills in a large distributed compute cluster. Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related More ❯
least one programming language, preferably in Go. Expertise in patch and OS management at scale Experienced in Linux performance benchmarking, tuning, and troubleshooting Familiarity with distributed storage solutions like Lustre and Ceph Knowledgeable in networking technologies and protocols, including Ethernet and ideally Infiniband Proactive and solution-oriented mindset Excellent problem-solving skills Initiative-driven and able to take ownership What More ❯
Skills You'll Need: A desire for operational work as primary job function 2+ years of professional experience with Linux systems High performance computing (HPC), including parallel filesystems (e.g., Lustre, GPFS), batch systems (e.g., Slurm, Grid Engine), and high-performance network interconnects experience is a plus, but not required High proficiency with at least one programming/scripting language (e.g. More ❯
core hours of 10am - 4pm, with no on-call demands, you'll dive deep into the world of cluster computing, Linux kernels, and cutting-edge storage tools like GPFS, Lustre, and Isilon. If you bring strong professional HPC experience, then I want to hear from you! Academic qualifications are secondary to your technical prowess Let's connect and explore how More ❯
Central London, London, United Kingdom Hybrid / WFH Options
STK Recruitment
FP16, BF16, INT8, etc.) GPU utilization profiling and tuning Inference workload modeling and scaling AI model deployment and performance optimization Storage Design and operation of parallel file systems (e.g. Lustre, GPFS) Integration and optimization of NVMe storage tiers Modeling storage throughput and demand for AI/HPC workloads We have multiple upcoming roles in the High-Performance Computing industry, so More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Solutions Through Knowledge
FP16, BF16, INT8, etc.) GPU utilization profiling and tuning Inference workload modeling and scaling AI model deployment and performance optimization Storage Design and operation of parallel file systems (e.g. Lustre, GPFS) Integration and optimization of NVMe storage tiers Modeling storage throughput and demand for AI/HPC workloads We have multiple upcoming roles in the High-Performance Computing industry, so More ❯