PySpark Developer
Data & AI - LLM Model Developer (PySpark & AWS)
Contract | UK Fully Remote | Inside IR35
We're supporting a large-scale data and AI transformation programme. We are looking for an experienced Data & AI - LLM Model Developer with advanced PySpark and AWS expertise to help modernise complex, Legacy data platforms.
This is a hands-on contract role where you'll play a key part in SAS-to-PySpark migration, building scalable, cloud-native data pipelines and delivering production-ready solutions in a regulated environment.
What you'll be doing
-
Designing, developing, and optimising PySpark-based data pipelines on AWS
-
Converting Legacy SAS workloads to PySpark, using automated migration tools and manual optimisation
-
Refactoring and stabilising existing data workflows into modern cloud architectures
-
Optimising Spark workloads for performance, scalability, and cost efficiency
-
Working closely with engineers and stakeholders to deliver reliable, high-quality data solutions
Essential skills
-
PySpark - P3 (Advanced): strong hands-on experience building production-grade Spark solutions
-
AWS - P3 (Advanced): EMR, Glue, S3, Athena (and related services)
-
Experience using automated migration tools for large-scale code or data modernisation
-
Strong SQL and data engineering fundamentals
-
Experience working with distributed data processing and cloud platforms
Nice to have
-
Exposure to SAS or Legacy analytics platforms
-
Experience in banking or financial services
-
CI/CD, Git-based workflows, or DevOps tooling
Why this role?
-
Fully remote (UK-based) contract
-
Long-term transformation programme with real technical depth
-
Modern cloud and data stack
- Outside-the-box problem solving, not just maintenance work