ensure resolution. Essential Qualifications A bachelor’s degree or master’s degree in Computer Science or related field. 5+ years of experience administering HPC clusters and systems. Experience with SLURM and Grid Engine scheduling software. 5+ years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support. 7+ years professional experience developing or administering compute solutions for … working with cross-functional IT (Public Cloud skills being a plus) and sciences skillsets. Experience with Python, R, or other related data science programming. Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator capacity. Experience working with databases and/or supporting. Experience managing large amounts of data effectively. Experience working with AI …/ML technologies. Experience with containerizing compute workload via Docker or Singularity. Experience with Nvidia DGX systems. Additional information Great talent should benefit from a great work environment. If you join our team, you’ll have access to: A competitive salary and bonus package based on experience. Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance. Company More ❯
Farnborough, Hampshire, United Kingdom Hybrid / WFH Options
Lenovo
Analyse and characterise scientific codes and build performance extrapolation to future generation of HPC/AI hardware. Interact with customers and the Lenovo sales team to offer insight into workload performance characteristics that drive system configurations. Complete competitive comparison studies of different technologies to showcase Intel technology advantages. Develop seller enablement collateral for Lenovo Sellers and Business Partners, and … or accelerated applications using more than one of OpenMP, MPI, CUDA, ROCm, OpenCL, SYCL paradigms. Experience of production HPC environment: large-scale filesystems (ideally Storage Scale), batch scheduling (ideally SLURM) as well as common HPC SW and management tools. Experience with analysis and profiling tools for HPC/AI codes: Intel OneAPI suite (Vtune ), AMD (uProf), nVidia toolkit. HPC More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
ONE NUCLEUS
managerial experience - Previous experience working as part of a broad collaborative project - Familiarity with cloud technologies (Docker, Kubernetes) - Experience with high-performance computing environments and job schedulers such as SLURM Apply now! Benefits and Contract Information - Financial incentives: depending on circumstances, monthly family/marriage allowance of £272 monthly child allowance of £328 per child. Non resident allowance up More ❯
Aberdeen, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Bath, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Brighton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Reading, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Woking, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
High Wycombe, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Bournemouth, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Hemel Hempstead, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Portsmouth, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Watford, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Crawley, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
Southampton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
on-call rotations to support high-priority incidents and escalations. About You Skills & Experience Proven experience supporting HPC and/or AI workloads in production environments. Strong expertise with Slurmworkloadmanager, including tuning and troubleshooting. Proficiency with system-level debugging, including kernel modules and network interfaces. Experience with GPU compute platforms (NVIDIA and/or AMD … settings. Comfort operating in fast-paced, ambiguous, high-growth environments. Nice to have Experience with OpenStack and troubleshooting infrastructure in cloud environments. Kubernetes expertise, particularly in HPC or AI workload contexts. Familiarity with distributed file systems and advanced storage configurations. Understanding of GPU virtualization and multi-tenant HPC architecture. Exposure to machine learning frameworks and AI optimization workflows. Scripting More ❯
following skills and experience: Proven experience in managing HPC environments (on-prem, cloud, or hybrid). Deep understanding of Linux-based systems (CentOS, RHEL, Ubuntu, etc.). Experience with workload managers/schedulers (e.g. Slurm, PBS, LSF). Familiarity with parallel file systems (e.g. Lustre, GPFS) and storage management. Scripting skills in Bash, Python or similar for automation More ❯
handsworth, yorkshire and the humber, united kingdom
Chapman Tate Associates
following skills and experience: Proven experience in managing HPC environments (on-prem, cloud, or hybrid). Deep understanding of Linux-based systems (CentOS, RHEL, Ubuntu, etc.). Experience with workload managers/schedulers (e.g. Slurm, PBS, LSF). Familiarity with parallel file systems (e.g. Lustre, GPFS) and storage management. Scripting skills in Bash, Python or similar for automation More ❯
clearly across technical and non-technical teams Required Skills & Knowledge: Strong understanding of Infrastructure (Azure, On-premises, Cloud) Proficiency in R and Python environments Experience with HPC systems (e.g., Slurm) Basic SAS knowledge Deep understanding of Life Science and Biostatics Desirable: Background in Life Sciences or Clinical Data Broad knowledge of infrastructure solutions If you have the relevant skills More ❯
Chemistry and Biology Can communicate with ML engineers Demonstrates competence and rigor in software development. Has experience working with scientific computing/lab environments (e.g. has used or administered SLURM) Conversant with cloud computing; able to provide requirements to DevOps engineers ABOUT IAMBIC THERAPEUTICS Iambic is a clinical-stage life-science and technology company developing novel medicines using its More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
Arm
to ensure high availability and observability. Tune platform performance under high-throughput workloads and lead capacity planning. Automate and execute stress/load tests using both synthetic and real workload profiles. Mentor junior engineers and contribute to architectural direction and code quality. Collaborate cross-functionally with infrastructure, tools, and FinOps teams to ensure the platform remains efficient and cost … and production support practices. Proven experience monitoring production systems, designing actionable alerts, and improving reliability through observability (metrics, logs, tracing). “Nice To Have” Skills And Experience Experience with workload orchestration or job scheduling systems (e.g., AWS Batch, Slurm, LSF). Exposure to compute-intensive domains such as EDA, HPC, or large-scale simulation. Knowledge of FinOps practices More ❯