Senior DevOps Engineer
- Hiring Organisation
- Humanoid
- Location
- City of London, London, United Kingdom
operating multi-GPU, cross-cloud platforms that enable efficient, reliable, and scalable model training. You’ll work at the intersection of DevOps, MLOps, and distributed systems, helping push the limits of real-world AI. What You’ll Do: Design, build, and operate scalable multi-GPU infrastructure across cloud environments … code and automation for provisioning, orchestration, and lifecycle management Build and evolve CI/CD pipelines for both infrastructure and ML training workflows Optimize distributed training workloads (scheduling, resource utilization, observability) Ensure high standards of reliability, scalability, security, and monitoring across systems Collaborate with ML engineers and researchers ...