AI /ML Platform Engineer

Apply Now

AI/ML Platform Engineer (Senior Consultant) | Permanent position

Location: Central London
Hybrid model
Salary: Up to £82,500 + Excellent Benefits
Available position: 2 positions

Our client, a leading IT services provider, is seeking an experienced AI/ML Platform Engineer to support a transformative project within the UK education sector. You'll play a critical role in delivering and maintaining a cutting-edge AI/ML Operations platform for a prestigious academic institution, collaborating closely with IT teams, third-party vendors, and internal specialists to deploy a state-of-the-art high-performance computing environment (HPC). The role requires strong hands-on engineering expertise combined with solid consulting capabilities across AI/ML Engineering platforms, Ubuntu Server, NVIDIA Technologies (Run:AI, GPU Operator), Kubernetes containerisation, and High-Performance Computing (HPC) environments to ensure successful platform delivery and ongoing operational excellence.

Key Responsibilities:

Deploy and configure a complete AI/ML Ops platform, including Ubuntu OS, Kubernetes, NVIDIA GPU/Network Operators, and Run: AI orchestration
Shadow vendor installations, conduct knowledge transfer, and validate platform performance against AI/ML development requirements
Provide ongoing operational support and platform ownership, including monitoring, troubleshooting, and implementing updates/patches
Deliver technical guidance to researchers and developers while engaging stakeholders to understand evolving requirements
Create comprehensive documentation covering configuration, maintenance procedures, and troubleshooting guides
Collaborate effectively with IT teams, third-party vendors, and internal specialists throughout implementation and BAU phases

Required Qualifications & Experience:

Strong experience deploying and supporting HPC environments for AI/ML workloads
Strong Ubuntu server administration and extensive Kubernetes cluster management experience
Hands-on experience with NVIDIA GPU technologies (GPU Operator) and AI/ML orchestration platforms (Run: AI highly desirable)
Understanding of networking in containerised environments and experience supporting complex computing infrastructures
Strong documentation and knowledge transfer capabilities

Company: Computappoint
Location: City of London, Greater London, UK
Hybrid / WFH Options
Posted: 4 days ago

Apply Now

Company: Computappoint
Location: City of London, Greater London, UK
Hybrid / WFH Options
Posted: 4 days ago