AI /ML Platform Engineer
AI/ML Platform Engineer (Senior Consultant) | Permanent position
- Location: Central London
- Hybrid model
- Salary: Up to £82,500 + Excellent Benefits
- Available position: 2 positions
Our client, a leading IT services provider, is seeking an experienced AI/ML Platform Engineer to support a transformative project within the UK education sector. You'll play a critical role in delivering and maintaining a cutting-edge AI/ML Operations platform for a prestigious academic institution, collaborating closely with IT teams, third-party vendors, and internal specialists to deploy a state-of-the-art high-performance computing environment (HPC). The role requires strong hands-on engineering expertise combined with solid consulting capabilities across AI/ML Engineering platforms, Ubuntu Server, NVIDIA Technologies (Run:AI, GPU Operator), Kubernetes containerisation, and High-Performance Computing (HPC) environments to ensure successful platform delivery and ongoing operational excellence.
Key Responsibilities:
- Deploy and configure a complete AI/ML Ops platform, including Ubuntu OS, Kubernetes, NVIDIA GPU/Network Operators, and Run: AI orchestration
- Shadow vendor installations, conduct knowledge transfer, and validate platform performance against AI/ML development requirements
- Provide ongoing operational support and platform ownership, including monitoring, troubleshooting, and implementing updates/patches
- Deliver technical guidance to researchers and developers while engaging stakeholders to understand evolving requirements
- Create comprehensive documentation covering configuration, maintenance procedures, and troubleshooting guides
- Collaborate effectively with IT teams, third-party vendors, and internal specialists throughout implementation and BAU phases
Required Qualifications & Experience:
- Strong experience deploying and supporting HPC environments for AI/ML workloads
- Strong Ubuntu server administration and extensive Kubernetes cluster management experience
- Hands-on experience with NVIDIA GPU technologies (GPU Operator) and AI/ML orchestration platforms (Run: AI highly desirable)
- Understanding of networking in containerised environments and experience supporting complex computing infrastructures
- Strong documentation and knowledge transfer capabilities
- Company
- Computappoint
- Location
- City of London, Greater London, UK
Hybrid / WFH Options - Posted
- Company
- Computappoint
- Location
- City of London, Greater London, UK
Hybrid / WFH Options - Posted