Bioinformatics Data engineer

Bioinformatics | Data Engineer

📍 London OR Oxford

About the Company

We are a growing TechBio organization using data science and machine learning to accelerate drug discovery. Our teams integrate multi-omics and functional assay data to uncover insights into disease biology and therapeutic development.

The Role

We’re seeking a Bioinformatics Data Engineer to design, build, and optimize data pipelines that integrate large-scale biological, multi-omics, and experimental datasets. You’ll collaborate closely with scientists, bioinformaticians, and ML engineers to deliver robust, compliant, and reusable data solutions that drive research and discovery.

Key Responsibilities

  • Develop and maintain ETL pipelines for bioinformatics and omics datasets across cloud and on-prem environments.
  • Standardize and harmonize diverse data sources, ensuring metadata quality and FAIR compliance.
  • Integrate multi-modal datasets (genomic, transcriptomic, proteomic, imaging, etc.) into unified data models.
  • Automate data validation, quality control, and lineage tracking.
  • Support analytics, visualization, and machine learning workflows.
  • Contribute to data governance practices covering access, privacy, and lifecycle management.

Qualifications

  • Bachelor’s or Master’s in Bioinformatics, Computer Science, Data Engineering, or related field.
  • 4+ years of experience in data engineering or bioinformatics data management.
  • Strong Python and SQL skills; experience with Pandas, PySpark, Dask, or similar frameworks.
  • Familiar with Linux, Docker, and modern data architectures (relational, object, non-relational).
  • Experience with orchestration tools (Airflow/Prefect) and cloud platforms (AWS preferred).
  • Proven experience handling large-scale biological or multi-omics datasets.
  • Bonus: exposure to distributed computing (Spark, Databricks, Kubernetes) or data cataloguing systems.

You Are

  • Curious and scientifically minded, with a strong understanding of biological data workflows.
  • Collaborative and able to communicate effectively across computational and experimental teams.
  • Passionate about applying data to accelerate biomedical discovery.
  • Detail-oriented and proactive about data quality and governance.

Join a mission-driven team at the intersection of bioinformatics, data engineering, and AI-driven biology , where your work will directly support innovations in precision medicine and therapeutic discovery.

Company
Hlx Life Sciences
Location
London, UK
Posted
Company
Hlx Life Sciences
Location
London, UK
Posted