Senior Data Engineer (Data Governance, Databricks, PySpark)
Leeds, Yorkshire, United Kingdom
Hybrid / WFH Options
Hybrid / WFH Options
PEXA Group Limited
end data quality, from raw ingested data to business-ready datasets Optimise PySpark-based data transformation logic for performance and reliability Build scalable and maintainable pipelines in Databricks and Airflow Implement and uphold GDPR-compliant processes around PII data Collaborate with stakeholders to define what "business-ready" means, and confidently sign off datasets as fit for consumption Put testing … internal and external customers Skills & Experience Required Extensive hands-on experience with PySpark, including performance optimisation Deep working knowledge of Databricks (development, architecture, and operations) Proven experience working with Airflow for orchestration Proven track record in managing and securing PII data, with GDPR compliance in mind Experience in data governance processes; Alation experience preferred, but similar toolswelcome Strong SQL More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted: