Hadoop Spark Scala Data Engineer
- Hiring Organisation
- Capgemini
- Location
- Northampton, England, United Kingdom
Spark. Experienced in working with the Hadoop ecosystem, including HDFS, Hive, and YARN, to process and analyze large datasets efficiently. Skilled in building robust ETL pipelines, real-time data processing, and optimizing distributed systems for performance and reliability. Proficient in SQL, data modeling, and integrating data from multiple sources … time. Your Role: Design and develop Hadoop based applications and data pipelines. Build operate monitor and troubleshoot Hadoop clusters. Write scalable ETL processes using tools like Hive Pig and Spark. Develop and maintain data ingestion processes using Sqoop Flume or Kafka. Optimize MapReduce jobs and manage HDFS storage. Collaborate ...