Forward Deployed Data Engineer (Genomics / Bioinformatics)

Location: Oxford / London – Hybrid

I’m currently working with a highly ambitious organisation building large-scale data platforms to power the next generation of scientific discovery.

They are looking for Data Engineers with a background in genomics or bioinformatics to work directly alongside scientists and researchers, turning complex biological data into reliable, scalable datasets that can power AI and machine learning models.

This role sits at the intersection of data engineering, genomics, and applied research - ideal for engineers who enjoy collaborating closely with scientists and building systems that enable real-world scientific progress.

What you’ll work on

  • Building robust pipelines to process and curate genomic datasets
  • Turning raw sequencing and biological data into ML-ready training datasets
  • Working with formats such as FASTQ, FASTA, VCF and other high-throughput biological data
  • Collaborating with bioinformaticians, computational biologists, and ML teams
  • Designing reproducible workflows using tools like Nextflow, Python and cloud infrastructure

What they’re looking for

  • Experience in bioinformatics, genomics, or computational biology environments
  • Strong programming skills (Python / SQL)
  • Experience building data pipelines or workflow systems
  • Familiarity with genomic data processing or sequencing workflows
  • Engineers who enjoy working close to research and scientific discovery

This is a rare opportunity to work on scientific data problems at scale, helping translate cutting-edge research into real-world impact.

If you’d like to learn more, feel free to message me directly.

Job Details

Company
Cubiq Recruitment
Location
Oxford, England, United Kingdom
Hybrid / Remote Options
Posted