ensure resolution. Essential Qualifications A bachelor’s degree or master’s degree in Computer Science or related field. 5+ years of experience administering HPC clusters and systems. Experience with SLURM and Grid Engine scheduling software. 5+ years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support. 7+ years professional experience developing or administering compute solutions for … working with cross-functional IT (Public Cloud skills being a plus) and sciences skillsets. Experience with Python, R, or other related data science programming. Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator capacity. Experience working with databases and/or supporting. Experience managing large amounts of data effectively. Experience working with AI …/ML technologies. Experience with containerizing compute workload via Docker or Singularity. Experience with Nvidia DGX systems. Additional information Great talent should benefit from a great work environment. If you join our team, you’ll have access to: A competitive salary and bonus package based on experience. Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance. Company More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
ONE NUCLEUS
managerial experience - Previous experience working as part of a broad collaborative project - Familiarity with cloud technologies (Docker, Kubernetes) - Experience with high-performance computing environments and job schedulers such as SLURM Apply now! Benefits and Contract Information - Financial incentives: depending on circumstances, monthly family/marriage allowance of £272 monthly child allowance of £328 per child. Non resident allowance up More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
Arm
to ensure high availability and observability. Tune platform performance under high-throughput workloads and lead capacity planning. Automate and execute stress/load tests using both synthetic and real workload profiles. Mentor junior engineers and contribute to architectural direction and code quality. Collaborate cross-functionally with infrastructure, tools, and FinOps teams to ensure the platform remains efficient and cost … and production support practices. Proven experience monitoring production systems, designing actionable alerts, and improving reliability through observability (metrics, logs, tracing). “Nice To Have” Skills And Experience Experience with workload orchestration or job scheduling systems (e.g., AWS Batch, Slurm, LSF). Exposure to compute-intensive domains such as EDA, HPC, or large-scale simulation. Knowledge of FinOps practices More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Experience working in Linux-based environments, particularly in performance-sensitive contexts General experience working in compute or storage-heavy environments Exposure to basic job scheduling systems (e.g., LSF, Jenkins, SLURM) Familiarity with monitoring tools like Prometheus, Grafana, or Linux-based telemetry Familiarity with profiling tools Ability to troubleshoot issues related to CPU, memory, I/O, or network performance More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Deep familiarity with HPC or large-scale cloud-native compute environments Experience tuning network-attached storage (NAS) and managing storage performance Hands-on knowledge of job schedulers like LSF, SLURM, or cloud-native batch systems Familiarity with tools for tracing, profiling, and monitoring production clusters Comfortable tuning kernel/system parameters and working with external vendors Strong, hands-on More ❯