role Remote£550 Inside ir35 6 Months contract Key Skills needed - Design/implementing Unix/Linux system and services open-source solutions and performance tuning.- HPC technologies: Lustre, Slurm- Configuration systems such as Ansible and Terraform- Unix/Linux scripting.- Networking: TCP/IP, DHCP, VLANs, spanning tree protocol, link aggregation for performance (MTU settings) and reliability requirements. More ❯
customers, including the network infrastructure, security, server, storage, end user compute and device management. Role Overview : The UNIX Systems Specialist reports to the Unix Systems Group lead, Infrastructure Systems Manager (UNIX), and is responsible for design, management and support in the Linux System Administration team, manage the day-to-day running of the UKAEA Linux based IT Systems, HPC …/BPSS level minimum). Desirable o Experience of managing Linux systems at scale. o Experience managing IT projects. o Experience setting up and supporting batch queueing systems (i.e. slurm) o Experience setting up and supporting Nvidia GPU systems o Ability to write well documented code in a high-level language or script (Python/Perl) o Experience in More ❯
and engineering. You’ll gain exposure to technologies that power large-scale modeling, including FEA, and data-driven research, and develop your skills across Linux systems, compute clusters, and workload management tools. Responsibilities: Assist in the setup, monitoring, and maintenance of HPC clusters, storage, and interconnects. Support Linux system administration tasks (RHEL, Rocky), with a focus on stability and … uptime. Help configure and troubleshoot workload managers such as Slurm. Work with senior engineers to monitor performance of key applications and identify opportunities for improvement. Contribute to scripting and automation tasks (Bash, Python) to streamline system operations. Support end-users by responding to tickets, preparing documentation, and guiding researchers on best practices. Learn about parallel computing concepts (MPI, OpenMP … or Python). Exposure to bare metal environments (installing, configuring, and troubleshooting physical servers). Interest in high-performance computing, scientific computing, or distributed systems. Eagerness to learn about workload managers (Slurm or similar). Good problem-solving skills, with the ability to troubleshoot technical issues. Strong communication skills and a collaborative mindset. This role is ideal for More ❯
South West London, London, United Kingdom Hybrid/Remote Options
Client Server
hands-on role at a global systematic trading firm with $25 billion under management, earning significant bonuses. As a HPC Platform Management Engineer you'll develop and support scalable workload scheduling solutions for HPC environments using tools such as YellowDog within a large scale computing environment with both on-premise and cloud (AWS) based services. You'll collaborate with … with flexibility to work from home 1-2 days a week. About you: You have experience of engineering and supporting at least one HPC scheduler, such as YellowDog, Ray, Slurm or IBM Symphony You have a deep knowledge of Linux You have a good understanding of both loosely coupled and tightly coupled HPC workloads and experience of working on More ❯
a related field2. Proven industry experience in building, deploying, and maintaining Linux servers (Red Hat/Rocky Linux)3. A working knowledge and practical experience with batch queuing systems (Slurm) and cloud computing, particularly AWSKey Words: Linux Systems Administrator/Scientific Computing/Red Hat/Rocky Linux/Slurm/AWS/Oracle DBA/IT Security More ❯
research demands and IT infrastructure. Leverage any scientific computing experience to optimize system performance and manage specialized applications. Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies. Work closely with other technical teams and stakeholders to align IT services with organizational needs. Build and maintain strong stakeholder relationships, communicating complex technical … 9. Proven experience with high-end workstation hardware setups and scientific application support. Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable. Strong troubleshooting skills for both hardware and software issues. Desirable Skills: Working knowledge of ServiceNow and its application in incident and service management. Familiarity with More ❯