Graduate Data Engineer
Peterborough, England, United Kingdom
Tata Consultancy Services
external data to process answering specific business questions and identify opportunities for improvement. Build processes supporting data transformation, data structures, metadata, data quality and workload management. Process and extract value from large datasets. Build and optimise ETL data pipelines to ingest, transform and load the datasets. Your Profile Key … data platform services: EC2, EMR, RDS, Redshift, Glue. Ability to work with object-oriented scripting languages: Python, Pyspark. Knowledge of data pipeline and workflow management and their tools such as Airflow. Strong understanding of relational SQL and NoSQL databases, including MongoDB and stream-processing systems: Spark-Streaming, Kinesis etc. more »
Posted: