HPC Engineer
We are seeking an experienced and highly motivated High-Performance Computing (HPC) Engineer to join our team. The successful candidate will have a proven record of delivering robust HPC services and infrastructure, combined with the ability to work closely with the scientific and research community to optimise computational workflows.
The role requires an individual with strong technical expertise, an understanding of the evolving HPC landscape, and a commitment to delivering high-quality, scalable, and automated solutions. You will play a key role in building and maintaining advanced computing platforms, including containerised environments and cloud-based research computing services, while applying DevOps principles and Infrastructure-as-Code methodologies.
Key ResponsibilitiesDesign, implement, and maintain secure and scalable HPC infrastructure using Infrastructure-as-Code (IaC) tools such as Terraform.
Develop, deliver, and support advanced research computing services and applications.
Apply Site Reliability Engineering (SRE) principles to ensure high availability, performance, and reliability across HPC environments.
Troubleshoot and resolve complex technical challenges affecting both the platform and user workloads.
Essential Skills and Experience
10+ years of hands-on experience designing, operating, or engineering large-scale computing environments (HPC, HTC, or Big Compute).
Proven ability to drive innovation and integrate emerging technologies into HPC solutions.
Administration experience with cluster and workload management software (eg, Slurm, LSF, Grid Engine).
Strong knowledge of Linux system administration, TCP/IP Networking, and storage systems.
Experience managing parallel file systems (eg, Weka, GPFS, Lustre).
Hands-on experience with private cloud platforms (eg, OpenStack).
Proficiency with configuration management tools (eg, Ansible, Salt, Puppet).
Demonstrated experience in DevOps environments using agile methodologies.
Strong Scripting skills in Bash and Python for automation and systems management.
Ability to build and maintain productive relationships with third-party suppliers.
About You
You are a problem solver with a deep understanding of research computing and a passion for leveraging technology to enable discovery. You thrive in complex technical environments, value collaboration, and are driven by the challenge of delivering high-quality, reliable computing services.
- Company
 - Red - The Global SAP Solutions Provider
 - Location
 - London, United Kingdom
 - Employment Type
 - Contract
 - Salary
 - GBP Annual
 - Posted
 
- Company
 - Red - The Global SAP Solutions Provider
 - Location
 - London, United Kingdom
 - Employment Type
 - Contract
 - Salary
 - GBP Annual
 - Posted