Remote Ray Jobs in the City of London

1 of 1 Remote Ray Jobs in the City of London

Distributed Systems Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Stanford Black Limited
fit share your CV! Role: Architect and optimise large-scale compute-intensive workloads spanning significant numbers of nodes and concurrent tasks Design, build, and manage systems with tools like Ray and YellowDog Optimise application performance on distributed platforms Provide architectural guidance on distributed computing design and development Drive efficiency and scalability across the platform, with a focus on ML pipeline … Job/Resource scheduling experience i.e. Yellowdog Cloud platform proficiency (any provider) Experience with large scale systems (1k+ Nodes, 10k+ tasks) Experience monitoring/troubleshooting a distributed environment Advance Ray experience for ML pipelines, tuning, distributed execution Python and Conda proficiency Docker + Kubernetes experience Knowledge of networking (TCP/IP, UDP/IP, LAN/WAN) Identify and access More ❯
Posted: