Senior Data Engineer
AWS Data Engineer
Your responsibilities:
Design, develop, and maintain scalable data pipelines on AWS using Glue, EMR, S3, and Athena for batch and real-time processing. Build and optimize ETL workflows using PySpark and SQL, ensuring high data quality, reliability, and performance. Orchestrate and schedule data pipelines using Apache Airflow, enabling seamless data movement across systems. Collaborate with business analysts and stakeholders to translate data requirements into technical solutions and deliver actionable insights. Implement data governance, security, and best practices while working within cloud-native architectures on AWS.
Essential skills/knowledge/experience:
- Strong experience with PySpark, distributed data processing, and largescale ETL/ELT pipelines.
- Advanced proficiency in Python for data engineering, automation
- Hands‐on expertise with AWS services (S3, Glue, Lambda, EMR, Bedrock / custom model hosting).
- Hands-on experience in SQL and ETL.
Desirable skills/knowledge/experience: (As applicable)
Pyspark, Python, SQL,AWS