Data Engineer
Penryn, England, United Kingdom
Hybrid / WFH Options
Hybrid / WFH Options
Aspia Space
and optimising our data infrastructure across both on-premise HPCs and cloud platforms. You’ll work closely with ML engineers and researchers to wrangle, clean, and prepare large datasets—including geospatial data—for training our large-scale AI models. Key Responsibilities: •Architect, design, and manage scalable data pipelines and … infrastructure across on-premise and cloud environments (AWS S3, Redshift, Glue, Step Functions). •Ingest, clean, wrangle, and preprocess large, diverse, and often messy datasets—including structured, unstructured, and geospatial data. •Collaborate with ML and research teams to ensure data pipelines align with model training requirements and schedules. •Develop … distributed systems. •Experience working across hybrid environments: on-premise HPCs and cloud platforms. •Proficiency with Linux, bash scripting, and git. •Proven ability to write clean, maintainable, and testable code. •Ability to thrive in a fast-paced, dynamic environment with shifting priorities. •Excellent problem-solving and communication skills. • Proximity to More ❯
Posted: