. Knowledge of software engineering practices (coding practices to DS, unit testing, version control, code review). Experience with Hadoop (especially the Cloudera and Hortonworks distributions), other NoSQL (especially Neo4j and Elastic), and streaming technologies (especially Spark Streaming). Deep understanding of data manipulation/wrangling techniques. Experience using development More ❯
knowledge of applying Data Engineering best practices (coding practices to DS, unit testing, version control, code review). Big Data Eco-Systems, Cloudera/Hortonworks, AWS EMR, GCP DataProc or GCP Cloud Data Fusion. Streaming technologies and processing engines, Kinesis, Kafka, Pub/Sub and Spark Streaming. Experience of working More ❯
source Apache code and should be an individual contributor to open-source projects. Required Skills: Apache Hadoop Architecture Yarn Architecture Spark Architecture Cloudera distribution HortonworksMore ❯
source Apache code and should be an individual contributor to open-source projects. Mandatory Skills: Apache Hadoop Architecture Yarn Architecture Spark Architecture Cloudera distribution HortonworksMore ❯
source Apache code and should be an individual contributor to open-source projects. Required Skills: Apache Hadoop Architecture Yarn Architecture Spark Architecture Cloudera distribution HortonworksMore ❯