Python, C++. Java. Extensive experience with data processing tools and platforms such as Kafka, Flink, Databricks. Skilled in managing distributed computing management systems, like SLURM, LSF. Knowledgeable in both relational and non-relational databases (e.g., SQL, MongoDB). Familiarity with cloud platforms (AWS, Azure). Strong analytical and troubleshooting More ❯
south west london, south east england, united kingdom
General Motors
Python, C++. Java. Extensive experience with data processing tools and platforms such as Kafka, Flink, Databricks. Skilled in managing distributed computing management systems, like SLURM, LSF. Knowledgeable in both relational and non-relational databases (e.g., SQL, MongoDB). Familiarity with cloud platforms (AWS, Azure). Strong analytical and troubleshooting More ❯
Python, C++. Java. Extensive experience with data processing tools and platforms such as Kafka, Flink, Databricks. Skilled in managing distributed computing management systems, like SLURM, LSF. Knowledgeable in both relational and non-relational databases (e.g., SQL, MongoDB). Familiarity with cloud platforms (AWS, Azure). Strong analytical and troubleshooting More ❯
What We're Looking For: 5+ years of experience in HPC environments, including exposure to parallel file-systems (e.g., Lustre, GPFS), batch schedulers (e.g., Slurm, Grid Engine), and high-performance networking (experience with interconnects is a plus) Strong Linux systems administration skills in distributed and high-scale setups Proficiency More ❯
london (city of london), south east england, united kingdom
Selby Jennings
What We're Looking For: 5+ years of experience in HPC environments, including exposure to parallel file-systems (e.g., Lustre, GPFS), batch schedulers (e.g., Slurm, Grid Engine), and high-performance networking (experience with interconnects is a plus) Strong Linux systems administration skills in distributed and high-scale setups Proficiency More ❯
C-Level executives. This requires deep familiarity across the stack - compute infrastructure (Amazon EC2, Amazon EKA), ML frameworks PyTorch, JAX, orchestration layers Kubernetes and Slurm, parallel computing (NCCL, MPI), MLOPs, through to Amazon SageMaker Hyperpod, Amazon Bedrock as well as target use cases in the cloud. This is an More ❯
Willingness to engage in technical discussion and commit to producing high quality code Enthusiasm to learn and grow in your role Any understanding of Slurm and HPC a bonus Developing in Python within an SRE team spanning across the business with project and product work, there is a huge More ❯
london (city of london), south east england, united kingdom
Ncounter Technology Recruitment
Willingness to engage in technical discussion and commit to producing high quality code Enthusiasm to learn and grow in your role Any understanding of Slurm and HPC a bonus Developing in Python within an SRE team spanning across the business with project and product work, there is a huge More ❯
to omics datasets Ability to read machine learning research articles and implement the algorithms described Experience of working with high performance computing clusters (Bash, Slurm etc) Good understanding of MLOps for experiment tracking, model and data versioning, hyperparameter tuning and results visualisation Experience in database technologies: SQL, NoSQL. What More ❯
software engineering skills. Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR. Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray). Experience using large-scale distributed training strategies. Hands on experience on training large model at scale and having contributed More ❯
be all the more interesting if you also have: Experience in an AI/ML environment. Experience of high-performance computing (HPC) systems and workload managers (Slurm). Worked with modern AI-oriented solutions (Fluidstack, Coreweave, Vast ). Benefits Competitive cash salary and equity. Food: Daily lunch vouchers. More ❯
be all the more interesting if you also have: Experience in an AI/ML environment. Experience of high-performance computing (HPC) systems and workload managers (Slurm). Worked with modern AI-oriented solutions (Fluidstack, Coreweave, Vast...). Benefits Competitive cash salary and equity. Food: Daily lunch vouchers. More ❯
performance of models on accelerated computing (GPU, TPU, AI ASICs) clusters with high-speed networking. Experience scaling model training and inference using technologies like Slurm, ParallelCluster, Amazon SageMaker. Experience in developing and deploying large scale machine learning or deep learning models and/or systems into production, including batch More ❯
performance of models on accelerated computing (GPU, TPU, AI ASICs) clusters with high-speed networking. Experience scaling model training and inference using technologies like Slurm, ParallelCluster, Amazon SageMaker. Experience in developing and deploying large scale machine learning or deep learning models and/or systems into production, including batch More ❯
performance of models on accelerated computing (GPU, TPU, AI ASICs) clusters with high-speed networking. - Experience scaling model training and inference using technologies like Slurm, ParallelCluster, Amazon SageMaker. - Experience in developing and deploying large scale machine learning or deep learning models and/or systems into production, including batch More ❯
performance of models on accelerated computing (GPU, TPU, AI ASICs) clusters with high-speed networking. - Experience scaling model training and inference using technologies like Slurm, ParallelCluster, Amazon SageMaker. - Experience in developing and deploying large scale machine learning or deep learning models and/or systems into production, including batch More ❯
performance of models on accelerated computing (GPU, TPU, AI ASICs) clusters with high-speed networking. - Experience scaling model training and inference using technologies like Slurm, ParallelCluster, Amazon SageMaker. - Experience in developing and deploying large scale machine learning or deep learning models and/or systems into production, including batch More ❯
with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset. Overview: We are seeking a skilled Technical Account Manager (TAM) to serve as a trusted advisor and strategic partner to our diverse customer base. In this role, you will be responsible for building … Technology, Engineering, or a related field (or equivalent experience). 3+ years of experience in a customer-facing technical role, such as Technical Account Manager, Solutions Architect, or Cloud Support Engineer. Strong understanding of cloud architecture, DevOps practices, and tools such as Docker, Kubernetes, SLURM, CI/CD More ❯
with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset. Overview: We are seeking a skilled Technical Account Manager (TAM) to serve as a trusted advisor and strategic partner to our diverse customer base. In this role, you will be responsible for building … Technology, Engineering, or a related field (or equivalent experience). 3+ years of experience in a customer-facing technical role, such as Technical Account Manager, Solutions Architect, or Cloud Support Engineer. Strong understanding of cloud architecture, DevOps practices, and tools such as Docker, Kubernetes, SLURM, CI/CD More ❯
hear from you.) 💡 The Stack & Environment: A diverse, modern environment spanning: Linux, Windows, MacOS, Microsoft 365, Azure AD, Intune, Teams, NICE DCV, Nvidia CUDA, Slurm, Jira Service Desk, Terraform, Azure Resource Manager 💡 What We’re Looking For: 2+ years of experience administering HPC infrastructure Hands-on experience with … Infiniband, Slurm, and GPU compute platforms (e.g. CUDA) Proficiency in systems administration and troubleshooting Strong documentation habits and a customer-focused mindset Experience with VDI solutions and monitoring tools 💡 Bonus Points: Familiarity with Jira Service Desk and Terraform scripting Exposure to SSL management, infrastructure-as-code, or cloud database More ❯
london, south east england, united kingdom Hybrid / WFH Options
The Engage Partnership Recruitment
hear from you.) 💡 The Stack & Environment: A diverse, modern environment spanning: Linux, Windows, MacOS, Microsoft 365, Azure AD, Intune, Teams, NICE DCV, Nvidia CUDA, Slurm, Jira Service Desk, Terraform, Azure Resource Manager 💡 What We’re Looking For: 2+ years of experience administering HPC infrastructure Hands-on experience with … Infiniband, Slurm, and GPU compute platforms (e.g. CUDA) Proficiency in systems administration and troubleshooting Strong documentation habits and a customer-focused mindset Experience with VDI solutions and monitoring tools 💡 Bonus Points: Familiarity with Jira Service Desk and Terraform scripting Exposure to SSL management, infrastructure-as-code, or cloud database More ❯
engineering and building a high performance culture within a team. The best of both worlds. You know how to engineer HPC clusters, confident with Slurm for scheduling and GPFS for storage. Linux is the bread and butter and you have exposure to the cloud. If you have experience with More ❯