Bioinformatics Data engineer
Bioinformatics | Data Engineer
📍 London OR Oxford
About the Company
We are a growing TechBio organization using data science and machine learning to accelerate drug discovery. Our teams integrate multi-omics and functional assay data to uncover insights into disease biology and therapeutic development.
The Role
We’re seeking a Bioinformatics Data Engineer to design, build, and optimize data pipelines that integrate large-scale biological, multi-omics, and experimental datasets. You’ll collaborate closely with scientists, bioinformaticians, and ML engineers to deliver robust, compliant, and reusable data solutions that drive research and discovery.
Key Responsibilities
- Develop and maintain ETL pipelines for bioinformatics and omics datasets across cloud and on-prem environments.
- Standardize and harmonize diverse data sources, ensuring metadata quality and FAIR compliance.
- Integrate multi-modal datasets (genomic, transcriptomic, proteomic, imaging, etc.) into unified data models.
- Automate data validation, quality control, and lineage tracking.
- Support analytics, visualization, and machine learning workflows.
- Contribute to data governance practices covering access, privacy, and lifecycle management.
Qualifications
- Bachelor’s or Master’s in Bioinformatics, Computer Science, Data Engineering, or related field.
- 4+ years of experience in data engineering or bioinformatics data management.
- Strong Python and SQL skills; experience with Pandas, PySpark, Dask, or similar frameworks.
- Familiar with Linux, Docker, and modern data architectures (relational, object, non-relational).
- Experience with orchestration tools (Airflow/Prefect) and cloud platforms (AWS preferred).
- Proven experience handling large-scale biological or multi-omics datasets.
- Bonus: exposure to distributed computing (Spark, Databricks, Kubernetes) or data cataloguing systems.
You Are
- Curious and scientifically minded, with a strong understanding of biological data workflows.
- Collaborative and able to communicate effectively across computational and experimental teams.
- Passionate about applying data to accelerate biomedical discovery.
- Detail-oriented and proactive about data quality and governance.
Join a mission-driven team at the intersection of bioinformatics, data engineering, and AI-driven biology , where your work will directly support innovations in precision medicine and therapeutic discovery.
- Company
- Hlx Life Sciences
- Location
- London, UK
- Posted
- Company
- Hlx Life Sciences
- Location
- London, UK
- Posted