Systems Engineer (Ray Framework SME)

Distributed Systems Engineer / Ray Framework SME

A fast-growing, early-stage infrastructure company building at the intersection of AI and distributed compute!

They’re looking for an Nngineer with strong experience in Python-based distributed computing, strong experience with Ray Framework in particular, to help design and scale systems handling complex, high-throughput workloads.

What you’ll be doing

  • Designing and building distributed systems for large-scale compute workloads
  • Scaling Python-based services across clusters and cloud environments
  • Working on ML infrastructure / platform tooling (training, orchestration, or inference pipelines)
  • Improving performance, reliability, and fault tolerance across distributed environments
  • Collaborating closely with data scientists and ML engineers to productionise models

What they’re looking for

  • 6-8 years of experience minimum in Systems/Software Engineering.
  • Strong experience with Ray (e.g. Ray Core, Ray Serve or Ray Tune)
  • Strong experience with distributed systems or parallel computing
  • Solid Python skills (production-level)
  • Experience with frameworks like Ray, Spark, Dask, or similar
  • Exposure to Kubernetes / containerised environments

Nice to have

  • Background in ML infrastructure, data platforms, or backend systems ideal
  • Familiarity with PyTorch / TensorFlow in production environments
  • Experience building internal platforms or developer tooling
  • Cloud experience (AWS, GCP, or Azure)

Why join?

  • Work on genuinely complex distributed problems at scale
  • High ownership and impact in a growing team
  • Opportunity to shape ML infrastructure from the ground up
  • Strong engineering culture with a focus on performance and scalability

Location / Package

  • Ideally London, onsite, but remote will be considered for the right fit.
  • Up to circa £130k + benefits per annum (dependent on experience)

Job Details

Company
TrueNorth®
Location
United Kingdom
Posted