ISO 27001, SOC 2). Collaborate with ML/AI Teams Package and deploy large‑language‑model (LLM) training jobs on distributed GPU clusters (Slurm, Ray, Kubeflow, or AWS SageMaker). Optimize model‑serving (Triton, vLLM, TorchServe) for low‑latency, high‑throughput inference. Cost & Performance Optimization Track cloud spend More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
XMOS
Testing infrastructure including pytest as well as managing machines used for testing on real hardware Jenkins for CI/CD Ansible for provisioning servers Slurm for clustered resource allocation Atlassian suite for knowledge base (confluence) and organisation (JIRA) The ideal candidate You're a fast learner who has some More ❯
south yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
Chapman Tate Associates
Perform routine maintenance, updates, and capacity planning for HPC infrastructure. Essential Skills: Strong experience with HPC systems, including cluster management and scheduling tools (e.g., Slurm, PBS, or Grid Engine). Proficiency in Linux-based systems and shell scripting. Experience with performance tuning and optimization of large-scale compute systems. More ❯
Stevenage, Hertfordshire, United Kingdom Hybrid / WFH Options
WISE Campaign
CCP-EM, Doppio, and CryoCloud. Experience with modern software development tools and practices, including object-oriented programming, Git/GitHub, DevOps tools, AI tools, Slurm, and Google Cloud. Web development, including browser-based visualisation and plug-ins. Proven expertise in leading the analysis and interpretation of scientific data. Closing More ❯
london (city of london), south east england, United Kingdom
Ncounter Technology Recruitment
Willingness to engage in technical discussion and commit to producing high quality code Enthusiasm to learn and grow in your role Any understanding of Slurm and HPC a bonus Developing in Python within an SRE team spanning across the business with project and product work, there is a huge More ❯
Farnborough, Hampshire, United Kingdom Hybrid / WFH Options
Avature
and build performance extrapolation to future generation of HPC/AI hardware. Interact with customers and the Lenovo sales team to offer insight into workload performance characteristics that drive system configurations. Complete competitive comparison studies of different technologies to showcase Intel technology advantages. Develop seller enablement collateral for Lenovo … than one of OpenMP, MPI, CUDA, ROCm, OpenCL, SYCL paradigms. Experience of production HPC environment: large-scale filesystems (ideally Storage Scale), batch scheduling (ideally SLURM) as well as common HPC SW and management tools. Experience with analysis and profiling tools for HPC/AI codes: Intel OneAPI suite (Vtune More ❯
sheffield, south yorkshire, yorkshire and the humber, United Kingdom
Chapman Tate Associates
Engineer Location: Remote – Sheffield Salary: Up to £45,000 plus an excellent benefits package HPC, Linux Systems Admin, Supercomputing, Scripting, UNIX, Linux, Nvidia, GPU, Slurm, Torque, GPFS, Lustre Chapman Tate Associates seeks a Linux Technical Consultant to join this established technology house that deliver a range of AI, ML … drive continuous improvement. Technical and soft skills needed will include: Proficiency in Linux system administration and shell scripting. Experience with HPC technologies such as Slurm, Torque, OpenMPI, GPFS, Lustre, etc. Strong networking skills, including TCP/IP, DNS, DHCP, and firewall management. Familiarity with monitoring and performance tuning tools. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
The Engage Partnership Recruitment
hear from you.) 💡 The Stack & Environment: A diverse, modern environment spanning: Linux, Windows, MacOS, Microsoft 365, Azure AD, Intune, Teams, NICE DCV, Nvidia CUDA, Slurm, Jira Service Desk, Terraform, Azure Resource Manager 💡 What We’re Looking For: 2+ years of experience administering HPC infrastructure Hands-on experience with … Infiniband, Slurm, and GPU compute platforms (e.g. CUDA) Proficiency in systems administration and troubleshooting Strong documentation habits and a customer-focused mindset Experience with VDI solutions and monitoring tools 💡 Bonus Points: Familiarity with Jira Service Desk and Terraform scripting Exposure to SSL management, infrastructure-as-code, or cloud database More ❯
london, south east england, United Kingdom Hybrid / WFH Options
The Engage Partnership Recruitment
The Stack & Environment: Work across a modern tech landscape including: Linux, Windows, MacOS, Microsoft 365, Azure AD, Intune, Teams (incl. voice integration), NICE DCV, Slurm, CUDA, Jira Service Desk, Terraform, SiteGround, Confluence, and more 💡 What We’re Looking For: 2+ years of experience in IT support or service desk … troubleshooting across Windows, Linux, and MacOS Experience with Microsoft cloud environments and endpoint management Clear communicator with good documentation practices 💡 Bonus Points: Exposure to Slurm, CUDA, Infiniband, or NICE/Amazon DCV Experience with infrastructure automation or scripting (e.g. Terraform) Understanding of SSL management and monitoring tools like Site24x7 More ❯