Machine Learning Systems & Infrastructure Engineer
- Hiring Organisation
- Jobleads-UK
- Location
- Greater London, England, United Kingdom
underlying storage at petabyte scale (object storage, fuse mounts, caching layers, shared filesystems, and relational/analytical/embedded metadata stores). ML workflow orchestration and serving: Operate the systems researchers use to launch experiments, data jobs, and production endpoints — workflow engines (e.g., Kubeflow Pipelines, Airflow), GPU schedulers (e.g., Volcano … scale : SQL fundamentals; relational (e.g., Postgres), analytical (e.g., BigQuery, Snowflake), and embedded (e.g., SQLite) stores; and object storage with caching layers. Familiarity with ML workflow orchestration and experiment tracking (e.g., Kubeflow Pipelines, MLflow). Experience with monitoring and observability tooling (e.g., Prometheus/Grafana, OpenTelemetry) and CI/ ...