Founding Data Engineer

Founding Data Engineer

London (hybrid)

£100-140k base + equity

Do you have experience building data infrastructure and pipelines from ground up?

Are you skilled in making cloud-based infrastructure (Kubernetes, Docker, CI/CD) faster and more reliable?

Do you have experience with ML deployments and pipelines?

We're a well-funded generative AI company training diffusion models to create a state-of-the-art platform for synthetic data generation.

We are seeking to recruit a Data Infrastructure Engineer to help design and scale the pipelines that power our core technology. You'll enable our researchers and engineers to train, validate, and deploy AI models faster across cloud and distributed environments.

  • Build and optimize data pipelines for large-scale, multimodal datasets.
  • Design and operate distributed data processing across Spark, Databricks, and Kubernetes.
  • Improve developer productivity through faster builds, better orchestration, and scalable infra.
  • Productionize ML models (PyTorch), from prototype to deployment.

We are seeking someone with the following experience:

  • Strong Python programming and solid data engineering fundamentals.
  • Hands-on with Ray, Spark, Databricks, or similar distributed compute frameworks.
  • Experience managing cloud infrastructure, Kubernetes, and CI/CD pipelines.
  • Familiar with dataset versioning, orchestration, and experiment tracking (DVC, MLflow, Airflow).

Ideally, we are looking for someone with prior experience working at an early-stage start-up.

Job Details

Company
Stealth AI Startup
Location
Slough, Berkshire, UK
Employment Type
Full-time
Posted