techniques * Demonstrable knowledge of applying Data Engineering best practices (coding practices to DS, unit testing, version control, code review). * Big Data Eco-Systems, Cloudera/Hortonworks, AWS EMR, GCP DataProc or GCP Cloud Data Fusion. * NoSQL Databases. Dynamo DB/Neo4j/Elastic, Google Cloud Datastore. * Snowflake Data Warehouse more »
Experience building data lakes and data pipelines in cloud using Azure and Databricks or similar tools. Spark Developer certification from any of (Databricks, MAPR, Cloudera or Hortonworks) is added advantage but not required Practice with Unix command line tools Familiarity working with agile methodology Strong database and data analysis skills more »
Greater London, England, United Kingdom Hybrid / WFH Options
First Derivative
development and the opportunity to design your own path. We support a variety of external training courses and accreditations such as AWS, GCP, Azure, Cloudera to name a few and are truly passionate about our Mentor Program, through which our senior colleagues generously set aside personal time to coach and more »
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
First Derivative
development and the opportunity to design your own path. We support a variety of external training courses and accreditations such as AWS, GCP, Azure, Cloudera to name a few and are truly passionate about our Mentor Program, through which our senior colleagues generously set aside personal time to coach and more »
like Apache (Flink/Beam/Spark), Oracle ODI, and Confluent’s Platform. Good knowledge of Hadoop Cluster Architecture and hands-on experience within Cloudera Hadoop ecosystems Good working knowledge with Kubernetes Knowledge of Exasol. Knowledge of any one of the scripting languages, such as Java, Python, Shell Scripting, or more »
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
First Derivative
wide variety of projects to gain hands on business exposure, we also provide external training courses and accreditation's such as AWS, GCP, Azure, Cloudera etc. First Derivative encourage diversity within our company. We encourage and lead inclusivity through our Pride, Multicultural, Alumni and Women networks that are run by more »
Greater London, England, United Kingdom Hybrid / WFH Options
InterEx Group
experience in Big Data implementation projects Experience in the definition of Big Data architecture with different tools and environments: Cloud (AWS, Azure and GCP), Cloudera, No-sql databases (Cassandra, Mongo DB), ELK, Kafka, Snowflake, etc. Past experience in Data Engineering and data quality tools (Informatica, Talend, etc.) Previous involvement in more »
and integrating data Python for ETL processes and automation. Hands-on experience with AWS/Azure data lake and data warehousing technologies. Familiarity with Cloudera, Teradata, and Agile/DevOps methodologies would be a bonus. Strong ability to communicate and collaborate with Data Architects. What you’ll get in return more »
data projects and programmes working in complex, multi-vendor change delivery organisations. You have deep distributed data platform knowledge and experience particularly on the Cloudera CDH/CDP. You have excellent knowledge of distributed data design principles commonly used in Hadoop and a solid understanding of processing large datasets (including more »
a wide variety of projects to gain hands on business exposure, we also provide external training courses and accreditations such as AWS, GCP, Azure, Cloudera etc. First Derivative encourage diversity within our company. We know that our people are vital to our success, and we are proud of the diverse more »
make corrective recommendations. Monitoring – Be able to monitor Spark jobs using wider tools such as Grafana to see whether there are Cluster level failures. Cloudera (CDP) – Knowledge of understanding how Cloudera Spark is set up and how the run time libraries are used by PySpark code. Prophecy – High level understanding more »