Data Engineer
!! IMMEDIATE JOINERS !!
💻 DATA ENGINEER (Python & SQL Focus) 📊 IMMEDIATE JOINE
RWe're looking for an enthusiastic and detail-oriented Junior Big Data Develope r to join our data engineering team. This role is ideal for an early-career professional with foundational knowledge in data processing, strong proficiency in Pytho n, and expert skills in SQ L. You'll focus on building, testing, and maintaining data pipelines and ensuring data quality across our scalable Big Data platforms
.
Key Responsibiliti
- esData Pipeline Developmen t: Assist in the design, construction, and maintenance of robus t ETL/E LT pipelines to integrate data from various sources into our data warehouse or data lak
- e.Data Transformation with Pytho n: Write, optimize, and maintain production-grad e Python scrip ts to clean, transform, aggregate, and process large volumes of dat
- a.Database Interaction (SQL): Develop complex, high-performanc e SQL queri es (DDL/DML) for data extraction, manipulation, and validation within relational and data warehousing environment
- s.Quality Assuranc e: Implement data quality checks and monitoring across pipelines, identifying discrepancies and ensuring the accuracy and reliability of dat
- a.Collaboratio n: Work closely with Data Scientists, Data Analysts, and other Engineers to understand data requirements and translate business needs into technical data solution
- s.Tooling & Automatio n: Utilize version control tools lik e G it and contribute to the automation of data workflows and recurring processe
- s.Documentatio n: Create and maintain technical documentation for data mappings, processes, and pipeline
s.
Required Skills and Qualificati
onsCore Technical Ski
llsSkill A
reaRequireme
nts
Program
mingStrong proficiency in Py thon for data manipulation and scripting. Familiarity with standard Python data libraries (e.g., Pandas, N um
Py).Data
baseExpert-level proficiency in SQL (Structured Query Language). Experience writing complex joins, stored procedures, and performing performance tun
ing.Big Data Conc
eptsFoundational understanding of Big Data architecture (Data Lakes, Data Warehouses) and distributed processing concepts (e.g., MapRedu
ce).ETL
/ELTBasic knowledge of ETL principles and data modeling (star schema, snowflake sche
ma).Version Con
trolPractical experience w ith Git (branching, merging, pull reques
ts).
Preferred Qualifications (A
- Plus)Experience with a distributed computing framework like Apache Spark (using PySp
- ark).Familiarity with cloud data servi ces (AWS S3/Redshift, Azure Data Lake/Synapse, or Google BigQuery/Cloud St or
- age).Exposure to workflow orchestration to ols (Apache Airflow, Prefect, or Da gs
- ter).Master's degree in Computer Science, Engineering, Information Technology, or a related f