AWS Databricks Engineer
About the Job you are considering:
We are looking for an experienced AWS Databricks Engineer with strong hands-on expertise in Databricks, AWS cloud services, PySpark, Spark SQL, Delta Lake, Python, SQL, and data engineering. The candidate will be responsible for designing, developing, optimizing, and supporting scalable data pipelines and lakehouse solutions for a banking client.The ideal candidate should have strong experience in building enterprise-grade data platforms, processing large volumes of structured and semi-structured data, and implementing secure, reliable, and high-performance data pipelines in AWS-based environments.
Hybrid working:
The places that you work from day to day will vary according to your role, your needs, and those of the business; it will be a blend of Company offices, client sites, and your home; noting that you will be unable to work at home 100% of the time.
Your Role:
- Design, develop, and maintain scalable data pipelines using Databricks on AWS.
- Build and optimize data processing frameworks using PySpark, Spark SQL, Python, Delta Lake, and Databricks notebooks.
- Develop end-to-end ETL/ELT pipelines for ingestion, transformation, validation, and consumption.
- Work with AWS services such as S3, Glue, Lambda, EMR, IAM, KMS, CloudWatch, Secrets Manager, Athena, and related services.
- Implement lakehouse architecture using Databricks, Delta Lake, and AWS S3.
- Build data ingestion pipelines from multiple sources including databases, APIs, files, streaming platforms, and third-party systems.
- Develop curated, trusted, and consumption-ready data layers for analytics, reporting, regulatory, and business use cases.
- Implement data quality checks, validation rules, reconciliation logic, audit controls, and exception handling.
- Optimize Spark jobs for performance, scalability, memory usage, shuffle, partitioning, and cost efficiency.
- Manage Databricks jobs, clusters, workflows, libraries, notebooks, and production deployments.
- Implement secure data access using AWS IAM, Databricks access controls, secrets management, and encryption standards.
- Support CI/CD implementation using Git, Jenkins, GitHub Actions, GitLab CI, AWS CodePipeline, or similar tools.
- Collaborate with data architects, business analysts, QA teams, DevOps teams, and client stakeholders.
- Troubleshoot production issues, perform root-cause analysis, and implement permanent fixes.
- Create technical documentation, data mapping specifications, runbooks, and deployment guides.
- Mentor junior engineers and support code reviews, design reviews, and engineering best practices.
Your Skills:
- Strong hands-on experience with Databricks on AWS.
- Strong experience with Apache Spark / PySpark.
- Excellent programming skills in Python.
- Strong SQL skills including complex queries, joins, CTEs, window functions, and query optimization.
- Experience with Spark SQL and large-scale distributed data processing.
- Strong understanding of Delta Lake and lakehouse architecture.
- Hands-on experience with AWS S3 as a data lake storage layer.
- Experience in building ETL/ELT pipelines for batch and near-real-time data processing.
- Experience with data formats such as Parquet, Avro, ORC, JSON, and CSV.
- Good understanding of data modelling, data warehousing, and dimensional modelling concepts.
- Experience with data validation, reconciliation, data quality, audit, and logging frameworks.
- Experience with Git-based version control and CI/CD pipelines.
- Develop and maintain Databricks notebooks, workflows, jobs, and libraries.
- Build reusable PySpark frameworks for ingestion, transformation, and data validation.
- Implement Delta Lake features such as ACID transactions, schema evolution, time travel, and optimized storage.
- Design and implement bronze, silver, and gold layers using medallion architecture.
- Tune Databricks clusters for performance and cost optimization.
- Monitor Databricks jobs and handle failures, retries, alerts, and job dependencies.
We are a Disability Confident Employer:
Capgemini is proud to be a Disability Confident Employer (Level 2) under the UK Government’s Disability Confident scheme. As part of our commitment to inclusive recruitment, we will offer an interview to all candidates who:
- Declare they have a disability, and
- Meet the minimum essential criteria for the role.
Please opt in during the application process.
Make It Real (what does it mean for you):
- You’d be joining an accredited Great Place to work for Wellbeing in 2024. Employee wellbeing is vitally important to us as an organization. We see a healthy and happy workforce a critical component for us to achieve our organizational ambitions.
- To help support wellbeing we have trained ‘Mental Health Champions’ across each of our business areas, and we have invested in wellbeing apps such as Thrive and Peppy.
- You will be empowered to explore, innovate, and progress. You will benefit from Capgemini’s ‘learning for life’ mindset, meaning you will have countless training and development opportunities from thinktanks to hackathons, and access to 250,000 courses with numerous external certifications from AWS, Microsoft, Harvard ManageMentor, Cybersecurity qualifications and much more.
- You will be joining one of the World’s Most Ethical Companies®, as recognized by Ethisphere® for 13 consecutive years. We live our values by making ethical business choices every day. Working ethically is at the centre of our culture at Capgemini, meaning you will be helping to create a future we can all be proud of.
Why you should consider Capgemini:
Growing clients’ businesses while building a more sustainable, more inclusive future is a tough ask. When you join Capgemini, you’ll join a thriving company and become part of a collective of free-thinkers, entrepreneurs and industry experts. We find new ways technology can help us reimagine what’s possible. It’s why, together, we seek out opportunities that will transform the world’s leading businesses, and it’s how you’ll gain the experiences and connections you need to shape your future. By learning from each other every day, sharing knowledge, and always pushing yourself to do better, you’ll build the skills you want. You’ll use your skills to help our clients leverage technology to innovate and grow their business. So, it might not always be easy, but making the world a better place rarely is.
About Capgemini:
Capgemini is an AI-powered global business and technology transformation partner, delivering tangible business value. We imagine the future of organizations and make it real with AI, technology and people. With our strong heritage of nearly 60 years, we are a responsible and diverse group of 420,000 team members in more than 50 countries. We deliver end-to-end services and solutions with our deep industry expertise and strong partner ecosystem, leveraging our capabilities across strategy, technology, design, engineering and business operations. The Group reported 2024 global revenues of €22.1 billion.
Make it real | www.capgemini.com