Slurm Workload Manager Jobs in Cambridgeshire

3 of 3 Slurm Workload Manager Jobs in Cambridgeshire

Senior HPC Engineer

Cambridge, England, United Kingdom
ZipRecruiter
ensure resolution. Essential Qualifications A bachelor’s degree or master’s degree in Computer Science or related field. 5+ years of experience administering HPC clusters and systems. Experience with SLURM and Grid Engine scheduling software. 5+ years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support. 7+ years professional experience developing or administering compute solutions for … working with cross-functional IT (Public Cloud skills being a plus) and sciences skillsets. Experience with Python, R, or other related data science programming. Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator capacity. Experience working with databases and/or supporting. Experience managing large amounts of data effectively. Experience working with AI …/ML technologies. Experience with containerizing compute workload via Docker or Singularity. Experience with Nvidia DGX systems. Additional information Great talent should benefit from a great work environment. If you join our team, you’ll have access to: A competitive salary and bonus package based on experience. Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance. Company More ❯
Posted:

Senior Software Engineer

Cambridge, England, United Kingdom
Hybrid / WFH Options
Arm
to ensure high availability and observability. Tune platform performance under high-throughput workloads and lead capacity planning. Automate and execute stress/load tests using both synthetic and real workload profiles. Mentor junior engineers and contribute to architectural direction and code quality. Collaborate cross-functionally with infrastructure, tools, and FinOps teams to ensure the platform remains efficient and cost … and production support practices. Proven experience monitoring production systems, designing actionable alerts, and improving reliability through observability (metrics, logs, tracing). “Nice To Have” Skills And Experience Experience with workload orchestration or job scheduling systems (e.g., AWS Batch, Slurm, LSF). Exposure to compute-intensive domains such as EDA, HPC, or large-scale simulation. Knowledge of FinOps practices More ❯
Posted:

Performance Engineer – Engineering Platforms

Cambridge, England, United Kingdom
Hybrid / WFH Options
Arm Limited
Experience working in Linux-based environments, particularly in performance-sensitive contexts General experience working in compute or storage-heavy environments Exposure to basic job scheduling systems (e.g., LSF, Jenkins, SLURM) Familiarity with monitoring tools like Prometheus, Grafana, or Linux-based telemetry Familiarity with profiling tools Ability to troubleshoot issues related to CPU, memory, I/O, or network performance More ❯
Posted: