Systems Research Engineer
- Hiring Organisation
- European Tech Recruit
- Location
- Edinburgh, Scotland, United Kingdom
emerging AI and data-centric workloads. Drive modular design and scalability across CPU, GPU, and NPU clusters, building highly efficient serving and scheduling systems. Performance Optimization & Profiling: Conduct in-depth profiling and performance tuning of large-scale inference and data pipelines, focusing on KV cache management, heterogeneous … Understanding of load balancing, state management, fault tolerance, and resource scheduling in large-scale AI inference clusters. Prior experience designing, deploying, and profiling high-performance cloud or AI infrastructure systems. If this role is of any interest please apply directly on LinkedIn or send a copy of your ...