Senior Linux HPC Systems Administrator/Engineer

This is an excellent opportunity for Senior Linux HPC Systems Administrator/Engineer professionals to be part of leading-edge technology projects. Cognizant’s Cloud, Infrastructure & Security Services Practice provides end-to-end solutions covering architecture, design, implementation, management, and on-going support across the entire enterprise technology infrastructure. Our services include a spectrum of management, consulting, and systems integration services to help our clients maximize value in their infrastructure resources, while optimizing infrastructure performance and cost.

Key Responsibilities

  • Administer, configure, and maintain RHEL environments (specifically RHEL 8 & 9) ensuring stability, performance, and security.
  • Provide hands-on support with high-end workstation hardware for scientists, promptly addressing hardware and software issues.
  • Offer technical support to scientific users, bridging the gap between research demands and IT infrastructure.
  • Leverage any scientific computing experience to optimize system performance and manage specialized applications.
  • Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies.
  • Work closely with other technical teams and stakeholders to align IT services with organizational needs.
  • Build and maintain strong stakeholder relationships, communicating complex technical concepts.
  • Provide in-person support onsite to ensure effective resolution of issues and a high level of customer satisfaction.
  • Utilize ServiceNow for tracking incidents, managing change requests, and ensuring timely resolution of service tickets.
  • Implement and follow IT best practices for incident management, performance monitoring, and network troubleshooting.
  • Manage SSL certificates and configure web servers as needed.
  • Monitor and troubleshoot system performance issues, including understanding the impact of GPUs, networking, and other hardware components.
  • Handle vendor relationships effectively, coordinating with external partners to resolve issues and optimize service delivery.
  • Maintain familiarity with MacOS systems to provide assistance when necessary.

Required Skills

  • Good enterprise IT experience with extensive hands-on expertise in RedHat Enterprise Linux (RHEL), specifically RHEL 8 & 9.
  • Proven experience with high-end workstation hardware setups and scientific application support.
  • Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable.
  • Strong troubleshooting skills for both hardware and software issues.

Desirable Skills:

  • Working knowledge of ServiceNow and its application in incident and service management.
  • Familiarity with networking concepts, performance monitoring tools, and GPU technologies.
  • Any experience with scientific applications will be a significant advantage.
  • Exposure to MacOS environments is useful but not essential.
Company
Cognizant
Location
Stevenage, Hertfordshire, UK
Posted
Company
Cognizant
Location
Stevenage, Hertfordshire, UK
Posted