MLOps Platform Engineer

MLOps Platform Engineer

Location: Oxford, UK (Hybrid)

Organisation: Global AI & Health Innovation Programme

Employment Type: Full-time | Permanent

About the Role

We are seeking a MLOps Platform Engineer to design, implement, and scale the infrastructure that supports high-performance machine learning and AI-driven research workflows . You will play a critical role in bridging the gap between data science, bioinformatics, and engineering — ensuring seamless, secure, and reproducible deployment of ML models in production and research environments.

You’ll collaborate closely with AI Scientists, Data Engineers, and DevSecOps teams , building automation pipelines that accelerate model development and deployment across distributed, cloud-native systems.

Key Responsibilities

  • Design, implement, and maintain end-to-end MLOps pipelines for model training, validation, deployment, and monitoring.
  • Develop and automate workflows using Terraform, Kubernetes, Docker , and CI/CD toolchains (GitHub Actions, Jenkins, Argo, etc.).
  • Manage scalable cloud-based compute environments (Oracle Cloud, AWS, or GCP) for AI workloads and data processing.
  • Build and maintain feature stores, model registries, and versioning systems to ensure traceability and reproducibility.
  • Implement data ingestion and pre-processing pipelines to support ML and bioinformatics workloads.
  • Collaborate with security and DevSecOps teams to enforce best practices in access control, compliance, and governance .
  • Support AI/ML researchers in model experimentation and infrastructure optimization.
  • Monitor model and system performance, implement drift detection, and refine automated retraining pipelines.
  • Contribute to the development of a modular, reusable ML platform architecture supporting multi-modal data (genomic, clinical, imaging, etc.).

Essential Skills and Experience

  • Proven experience as an MLOps Engineer, Platform Engineer, or DevOps Engineer supporting ML or data science teams.
  • Strong hands-on experience with containerization (Docker) and orchestration (Kubernetes) .
  • Expertise in Terraform , Infrastructure as Code (IaC), and cloud provisioning (OCI, AWS, GCP, or Azure).
  • Solid understanding of CI/CD pipelines and automated testing frameworks.
  • Experience with ML frameworks such as PyTorch, TensorFlow, or Scikit-learn.
  • Familiarity with MLflow, Kubeflow, DVC, or similar MLOps tools .
  • Understanding of cloud security principles , IAM, and networking best practices.
  • Proficiency in Python and Bash scripting for automation and tooling development.
  • Version control with Git , and collaborative development practices.

Desirable Experience

  • Exposure to bioinformatics or health data ecosystems (WGS, transcriptomics, clinical data).
  • Knowledge of data governance and compliance frameworks (GDPR, ISO27001, HIPAA).
  • Experience building monitoring dashboards for ML performance metrics.
  • Familiarity with distributed training environments and GPU/TPU orchestration.
  • Oracle Cloud Infrastructure (OCI) certification or equivalent.

Terms of Appointment

Applicants must have the right to work permanently in the UK and be within commuting distance of Oxford . Occasional travel may be required for collaboration across global sites.

Company
Hlx Life Sciences
Location
Oxford, Oxfordshire, UK
Hybrid / WFH Options
Posted
Company
Hlx Life Sciences
Location
Oxford, Oxfordshire, UK
Hybrid / WFH Options
Posted