Systems Engineer (Ray Framework SME)
Distributed Systems Engineer / Ray Framework SME
A fast-growing, early-stage infrastructure company building at the intersection of AI and distributed compute!
They’re looking for an Nngineer with strong experience in Python-based distributed computing, strong experience with Ray Framework in particular, to help design and scale systems handling complex, high-throughput workloads.
What you’ll be doing
- Designing and building distributed systems for large-scale compute workloads
- Scaling Python-based services across clusters and cloud environments
- Working on ML infrastructure / platform tooling (training, orchestration, or inference pipelines)
- Improving performance, reliability, and fault tolerance across distributed environments
- Collaborating closely with data scientists and ML engineers to productionise models
What they’re looking for
- 6-8 years of experience minimum in Systems/Software Engineering.
- Strong experience with Ray (e.g. Ray Core, Ray Serve or Ray Tune)
- Strong experience with distributed systems or parallel computing
- Solid Python skills (production-level)
- Experience with frameworks like Ray, Spark, Dask, or similar
- Exposure to Kubernetes / containerised environments
Nice to have
- Background in ML infrastructure, data platforms, or backend systems ideal
- Familiarity with PyTorch / TensorFlow in production environments
- Experience building internal platforms or developer tooling
- Cloud experience (AWS, GCP, or Azure)
Why join?
- Work on genuinely complex distributed problems at scale
- High ownership and impact in a growing team
- Opportunity to shape ML infrastructure from the ground up
- Strong engineering culture with a focus on performance and scalability
Location / Package
- Ideally London, onsite, but remote will be considered for the right fit.
- Up to circa £130k + benefits per annum (dependent on experience)