Data Engineer
The Role Build the data infrastructure for next-generation robotics. You will curate and manage the massive datasets required to train sophisticated Machine Learning models for physical interaction.
Key Responsibilities
Data Pipelines: Build and scale pipelines for complex training workflows.
Data Integrity: Ensure high-quality, consistent data across all projects.
Collaboration: Partner with ML researchers on versioning and data security.
Metrics: Develop reporting systems for data quality and performance.
Technical Requirements
Core: Python, SQL, and data processing frameworks.
Experience: Large-scale data management (100s of TBs to Petabytes).
Education: Degree in CS, Data Science, or a related field.
Bonus: Background in robotics, CV, or autonomous systems.
Technical Self-Assessment (1-10)
Candidates will be asked to rate their knowledge of:
Languages: Python, Java, Scala
Compute/Distributed: RAY, Spark, Databricks, Hadoop
Orchestration/Cloud: Kafka, Airflow, Prefect, AWS
Storage/Warehouse: Clickhouse, Snowflake, Redshift, Greenplum
Note: Experience with Petabyte/Exabyte scale is a strong signal for this role.
Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to a high level of applications, we can only respond to applicants whose skills & qualifications are suitable for this position. No terminology in this advert is intended to discriminate against any of the protected characteristics that fall under the Equality Act 2010. For the purposes of the Conduct Regulations 2003, when advertising permanent vacancies we are acting as an Employment Agency, and when advertising temporary/contract vacancies we are acting as an Employment Business.