Principal Engineer
Principle Engineer
This is a role for engineers who like big systems, hard problems, and meaningful ownership. You'll be joining a team operating at the intersection of software, hardware, and AI.
The engineering team is small, ambitious, and deeply technical, building the orchestration systems that keep thousands of GPUs running at peak performance across global data centres.
About You
You're a pragmatic systems builder who thrives in complexity, enjoys autonomy, and understands what it means to own production at scale. You'll likely bring:
- 5+ years' experience building distributed systems in Go within cloud-native environments.
- Deep hands-on experience with Kubernetes and container orchestration.
- A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar).
- Strong observability experience using tools like Prometheus / Mimir, Loki, Tempo, Grafana, Alertmanager.
- Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally).
- Working knowledge of ML infrastructure and familiarity with GPU drivers, CUDA, and container runtimes.
- A low-ego, collaborative approach and a clear, proactive communication style.