Platform technologies (Synapse, Data Lakes, ADF) Expertise in data modelling, ETL/ELT pipeline development, and data integration Proficient in SQL and Python (ideally PySpark) Knowledge of tools such as Power BI, Microsoft Fabric, and DevOps (CI/CD pipelines) Experience working with enterprise data sources and APIs (e.g. More ❯
East London, London, United Kingdom Hybrid / WFH Options
McGregor Boyall Associates Limited
Science, Data Science, Mathematics, or related field. 5+ years of experience in ML modeling, ranking, or recommendation systems . Proficiency in Python, SQL, Spark, PySpark, TensorFlow . Strong knowledge of LLM algorithms and training techniques . Experience deploying models in production environments. Nice to Have: Experience in GenAI/ More ❯
schemas (both JSON and Spark), schema management etc - Strong understanding of complex JSON manipulation - Experience working with Data Pipelines using a custom Python/PySpark frameworks - Strong understanding of the 4 core Data categories (Reference, Master, Transactional, Freeform) and the implications of each, particularly managing/handling Reference Data. … Languages/Frameworks - JSON - YAML - Python (as a programming language, not just able to write basic scripts. Pydantic experience would be a bonus.) - SQL - PySpark - Delta Lake - Bash (both CLI usage and scripting) - Git - Markdown - Scala (bonus, not compulsory) - Azure SQL Server as a HIVE Metastore (bonus) Technologies - Azure More ❯