13 of 13 Slurm Workload Manager Jobs in the UK

Platform Engineer

Hiring Organisation
Hlx Life Sciences
Location
Greater London, England, United Kingdom
Familiarity with CI/CD , Git-based workflows, and automation Strong problem-solving skills and a collaborative mindset Bonus Terraform or other IaC tools Slurm, Kueue, Ray, Spark, or similar systems GPU tooling (CUDA, Nvidia operators, schedulers) Experience supporting ML training or data science teams ...

Senior HPC Engineer - Linux

Hiring Organisation
Ascent People
Location
Banbury, England, United Kingdom
drive continual service improvement initiatives. We would like you come with most of the following: Linux systems administration HPC cluster management and scheduling (Slurm, Kubernetes) Enterprise storage platforms and parallel filesystems InfiniBand or high-speed networking GPU compute workloads and scheduling Virtualisation (VMware, Nutanix AHV) Monitoring tools (Prometheus, Grafana ...

Linux Infrastructure Engineer

Hiring Organisation
Sopra Steria
Location
Salisbury, Wiltshire, South West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
25 days holidays, 6% Contributory pension, 4 x life Insurance,
Linux capabilities. This role offers exposure to business-as-usual operations, delivered services, and innovative project work. We welcome candidates with experience in Slurm or HPC and Ansible. We have different level opportunities in this growing team, so dont shy away if you dont have all the skills. ...

Senior Linux Infrastructure Engineer

Hiring Organisation
Sopra Steria
Location
Portsmouth, Hampshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
25 days holidays, 6% Contributory pension, 4 x life Insurance
Linux capabilities. This role offers exposure to business-as-usual operations, delivered services, and innovative project work. We welcome candidates with experience in Slurm or HPC and Ansible. We have different level opportunities in this growing team, so dont shy away if you dont have all the skills. ...

Machine Learning Engineer PyTorch LLM

Hiring Organisation
Client Server
Location
East London, London, United Kingdom
Employment Type
Permanent, Work From Home
with TypeScript and/or Golang You have distributed systems/training ops experience including practical experience running multi-node jobs on GPU clusters (Slurm, Kubernetes, or managed cloud equivalents) and are familiar with GPU performance tuning: memory usage, mixed precision, throughput vs. latency tradeoffs You have experience within ...

Machine Learning Engineer

Hiring Organisation
Block MB
Location
London Area, United Kingdom
engineering fundamentals — clean, maintainable code and version control best practices. System Knowledge: Hands-on experience with multi-node GPU clusters, orchestration tools (e.g., Kubernetes, Slurm) and performance tuning. Communication: Clear and effective communicator, able to share insights with both technical and non-technical stakeholders. Desirable Qualities Experience with reinforcement ...

Python Developer

Hiring Organisation
Ncounter
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£160,000 - £180,000 per annum
Linux environment. - Experience in building automation tools and managing system configurations. - Knowledge of C++, KDB/q, and experience with technologies like Slurm, Airflow, Kafka, or AMPS. - Background in enhancing system stability, scalability, and performance while conducting root cause analyses to resolve incidents efficiently. - Observability skillset, monitoring and analysis ...

SC Cleared HPC Containerization Expert

Hiring Organisation
Click Digital
Location
Derby, Derbyshire, United Kingdom
Employment Type
Contract
Contract Rate
£700/day
workloads through various customer workshops and interviews, to identify: HPC Environment Assessment & Optimization Conduct rapid reviews of existing HPC infrastructure, including Beowulf clusters, Slurm job schedulers, workflow execution patterns, and key use cases. Analyze current HPC/ML workloads and identify performance constraints, containerization opportunities, and runtime optimization options. … cases, future strategy, HPC market view along with, next steps, and architectural patterns. HPC Tools, Runtimes & Ecosystem Integration Have a good understanding of Slurm scheduling, MPI job patterns, JupyterHub/JupyterLab for data-science workflows, GPU scheduling tools such as Run:AI, and SDLC practices tailored to research/ ...

IT Expert Principal

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Pride Park, Derby, Derbyshire, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
infrastructure and workloads through various customer workshops and interviews, to identify:HPC Environment Assessment & OptimizationConduct rapid reviews of existing HPC infrastructure, including Beowulf clusters,Slurm job schedulers, workflow execution patterns, and key use cases.Analyze current HPC/ML workloads and identify performance constraints, containerization opportunities, and runtime optimization options. … cases, future strategy, HPC market view along with, next steps, and architectural patterns. HPC Tools, Runtimes & Ecosystem Integration Have a good understanding of Slurm scheduling, MPI job patterns,JupyterHub/JupyterLab for data-science workflows, GPU scheduling tools such as Run:AI, and SDLC practices tailored to research/ ...

Principal Cloud Architect – HPC/GPU & AI Platform Solutions

Hiring Organisation
Ll Oefentherapie
Location
London, England, United Kingdom
vision for cloud and AI adoption. Key Responsibilities Architect and deploy large-scale GPU/HPC infrastructure on OCI using tools like Terraform, Ansible, Slurm and Kubernetes. Build automated solutions for cluster provisioning, software deployment, and infrastructure as code. Collaborate with Oracle’s largest enterprise customers to define … architecture in cloud and on‐prem environments. Proficiency in scripting and automation: Python, Bash, PowerShell, Terraform, Ansible. Experience with cluster managers (SLURM, PBS, Bright), Kubernetes, and container orchestration. Knowledge of RDMA, Infiniband, MPI, and distributed file systems. Core Cloud Native experience Familiarity with AI/ML platforms, large language ...

Principal Cloud Architect – HPC/GPU & AI Platform Solutions

Hiring Organisation
Oracle
Location
London, England, United Kingdom
technical consulting, solution engineering, and AI transformation strategy. Key Responsibilities Architect and deploy large‐scale GPU/HPC infrastructure on OCI using Terraform, Ansible, Slurm, Kubernetes, and other tools. Build automated solutions for cluster provisioning, software deployment, and infrastructure as code. Collaborate with Oracle’s largest enterprise customers … architecture in cloud and on‐prem environments. Proficiency in scripting and automation: Python, Bash, PowerShell, Terraform, Ansible. Experience with cluster managers (SLURM, PBS, Bright), Kubernetes, and container orchestration. Knowledge of RDMA, Infiniband, MPI, and distributed file systems. Core cloud‐native experience. Familiarity with AI/ML platforms, large language ...

HPC Systems Administrator

Hiring Organisation
Accenture
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Systems Administrator, Assoc Manager Salary: Competitive salary and package (Depending on level of experience) Locations: UK, London (must be willing to travel to client sites throughout the UK on an ad hoc basis) Salary: Competitive salary and package (Depending on level of experience) Accenture are partnering with scaled … related incidents, implementing preventive measures as needed. Required Skills: •Expertise in an HPC environment, including GPU cluster administration (e.g., NVIDIA, AMD) and workload schedulers such as SLURM or PBS. •Proficiency with AI model training workflows and experience supporting popular AI/ML frameworks (e.g., TensorFlow, PyTorch, CUDA). ...

HPC Engineer

Hiring Organisation
Pearson Whiffin IT & Digital
Location
Derby, Derbyshire, East Midlands, United Kingdom
Employment Type
Contract
scalability of HPC systems. Key Responsibilities Design, deploy, and manage HPC clusters (on-prem, cloud, or hybrid) Install, configure, and optimise job schedulers (e.g. Slurm, PBS, LSF) Tune system performance for CPU, GPU, memory, storage, and network workloads Support users with application optimisation and parallelisation Automate system administration using … system administration experience Hands-on experience with HPC environments and parallel computing Knowledge of MPI, OpenMP, and/or CUDA Experience with job schedulers (Slurm preferred) Familiarity with high-speed interconnects (InfiniBand, Omni-Path) Experience with scripting languages (Bash, Python) Understanding of performance profiling and optimisation techniques Desirable Skills ...