Remote Slurm Workload Manager Jobs in London

10 of 10 Remote Slurm Workload Manager Jobs in London

Research Engineer, Machine Learning

London, England, United Kingdom
Hybrid / WFH Options
Mistral AI
or equivalent proven track record) 4 + years working on large-scale ML codebases Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed/FSDP/SLURM/K8s) Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops Strong software-design instincts: testing, code review, CI/CD Self-starter, low More ❯
Posted:

Quantitative Developer

London, England, United Kingdom
Hybrid / WFH Options
Tower Research Capital
learn, PyTorch) Experience building distributed systems with message buses (Kafka, ZeroMQ) and asynchronous I/O Experience with cloud or on-prem orchestration and scheduling frameworks (Kubernetes, HT Condor, SLURM) Benefits Tower’s headquarters are in the historic Equitable Building, right in the heart of NYC’s Financial District and our impact is global, with over a dozen offices More ❯
Posted:

Machine Learning Software Engineer, Research

London, England, United Kingdom
Hybrid / WFH Options
PhysicsX Ltd
for computer vision, geometry processing, or scientific computing; software engineering concepts and best practices (e.g., versioning, testing, CI/CD, API design, MLOps); container-ization and orchestration (Docker, Kubernetes, Slurm); writing pipelines and experiment environments, including running experiments in pipelines in a systematic way. What we offer Be part of something larger: Make an impact and meaningfully shape an More ❯
Posted:

Machine Learning Software Engineer, Research

London, England, United Kingdom
Hybrid / WFH Options
PhysicsX
for computer vision, geometry processing, or scientific computing; software engineering concepts and best practices (e.g., versioning, testing, CI/CD, API design, MLOps); container-ization and orchestration (Docker, Kubernetes, Slurm); writing pipelines and experiment environments, including running experiments in pipelines in a systematic way What We Offer Be part of something larger: Make an impact and meaningfully shape an More ❯
Posted:

Biostatistics HPC Solution Architect

London, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
days per week. This is a 6 month temporary contract, to start ASAP. Day rate: Competitive Market rate. The right candidate should have a strong understanding of HPC (Slurm) including the installation and configuration. Key Requirements: Strong understanding of Infrastructure (Azure, On-premises and other cloud techs) Knowledge of Cloud Platforms: Understanding of cloud platforms and Azure, in particular … R : The candidate should have a good understanding of R HPC Skills: The candidate should have a strong understanding of HPC (Slurm) including the installation and configuration Experience with Python: The candidate should have experience with the Python installation and configuration on Linux system Associates should have deep understanding of Biostatistics and Life science domain (especially Clinical) knowledge Basic More ❯
Posted:

GCP Public Cloud Infrastructure Architect (HPC, GKE)

London, England, United Kingdom
Hybrid / WFH Options
Derisk360
What You Bring 10+ years in cloud infrastructure design or DevOps roles. Proven expertise in Google Cloud infrastructure, GKE, and HPC architecture. Strong background in batch scheduling, job queuing (Slurm), and distributed storage systems. Proficient in Kubernetes internals, pod autoscaling, node management. Skilled in Infrastructure as Code (Terraform, Deployment Manager). Hands-on experience with Docker, Helm, Istio … Trivy or Aqua. Fluent in English, with excellent communication and problem-solving skills. Certification: Google Professional Cloud Architect (mandatory). Nice To Have Experience with GPU/TPU workloads, Slurm, Intel MPI/OpenMPI. Exposure to hybrid or multi-cloud setups using Anthos or GCVE. Familiarity with GitOps (ArgoCD, Flux), workload identity, and K8s RBAC. Experience in life More ❯
Posted:

Head of Engineering

London Area, United Kingdom
Hybrid / WFH Options
Enertek Group
engineering teams. Passion for open-source and decentralized infrastructure. Excellent communication and executive presence. Preferred Tech Stack Languages: Go, Rust, Python, Solidity Infrastructure: Kubernetes, Docker, GPU Scheduling (e.g., Kubeflow, Slurm), CI/CD pipelines Blockchain: EVM, Cosmos SDK, ZK/L2 solutions AI Stack (plus): PyTorch, Hugging Face, Ray, ONNX What We Offer Competitive salary + equity/token More ❯
Posted:

Head of Engineering

City of London, London, United Kingdom
Hybrid / WFH Options
Enertek Group
engineering teams. Passion for open-source and decentralized infrastructure. Excellent communication and executive presence. Preferred Tech Stack Languages: Go, Rust, Python, Solidity Infrastructure: Kubernetes, Docker, GPU Scheduling (e.g., Kubeflow, Slurm), CI/CD pipelines Blockchain: EVM, Cosmos SDK, ZK/L2 solutions AI Stack (plus): PyTorch, Hugging Face, Ray, ONNX What We Offer Competitive salary + equity/token More ❯
Posted:

Member of Technical Staff (Infrastructure)

London, England, United Kingdom
Hybrid / WFH Options
Reka AI
logging tools (e.g., Prometheus, Grafana). A deep understanding of cloud computing platforms (e.g., AWS, GCP, Azure). Strongly desired: Experience with HPC/GPU cluster management tools (e.g., Slurm, GPU monitoring tools, distributed file systems). The ability to build in a fast-paced environment under some uncertainty. Reka's Mission Reka's mission is to build useful More ❯
Posted:

Infrastructure Architect (we have office locations in Cambridge, Leeds & London)

London, England, United Kingdom
Hybrid / WFH Options
Genomics England
both on-premise and AWS. About the Tech Stack Our HPC clusters are built in our on-premises data centres and in AWS. We use IBM LSF for our workload management currently. Hardware wise we have a large footprint of FGPA Servers (DRAGEN) both on-premises and in AWS, as well as standard HPC Compute nodes both on-premises … certifications, we are primarily interested in your real-world experience. Essential Skills and Experience: Extensive knowledge and understanding of HPC Technologies – Including but not limited to IBM LSF, NextFlow, Slurm, AWS Batch. Experience working within an On-Premises estate and working to build/design platform on premise considering physical networking, Bare Metal Servers, and Hardware Lifecycles. Strong Experience More ❯
Posted: