4 of 4 Permanent Slurm Workload Manager Jobs in London

Platform Engineer

Hiring Organisation
Hlx Life Sciences
Location
Greater London, England, United Kingdom
Familiarity with CI/CD , Git-based workflows, and automation Strong problem-solving skills and a collaborative mindset Bonus Terraform or other IaC tools Slurm, Kueue, Ray, Spark, or similar systems GPU tooling (CUDA, Nvidia operators, schedulers) Experience supporting ML training or data science teams ...

Machine Learning Engineer

Hiring Organisation
Block MB
Location
London Area, United Kingdom
engineering fundamentals — clean, maintainable code and version control best practices. System Knowledge: Hands-on experience with multi-node GPU clusters, orchestration tools (e.g., Kubernetes, Slurm) and performance tuning. Communication: Clear and effective communicator, able to share insights with both technical and non-technical stakeholders. Desirable Qualities Experience with reinforcement ...

Python Developer

Hiring Organisation
Ncounter
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£160,000 - £180,000 per annum
Linux environment. - Experience in building automation tools and managing system configurations. - Knowledge of C++, KDB/q, and experience with technologies like Slurm, Airflow, Kafka, or AMPS. - Background in enhancing system stability, scalability, and performance while conducting root cause analyses to resolve incidents efficiently. - Observability skillset, monitoring and analysis ...

HPC Systems Administrator

Hiring Organisation
Accenture
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Systems Administrator, Assoc Manager Salary: Competitive salary and package (Depending on level of experience) Locations: UK, London (must be willing to travel to client sites throughout the UK on an ad hoc basis) Salary: Competitive salary and package (Depending on level of experience) Accenture are partnering with scaled … related incidents, implementing preventive measures as needed. Required Skills: •Expertise in an HPC environment, including GPU cluster administration (e.g., NVIDIA, AMD) and workload schedulers such as SLURM or PBS. •Proficiency with AI model training workflows and experience supporting popular AI/ML frameworks (e.g., TensorFlow, PyTorch, CUDA). ...