distributed systems, using solutions such as Spark, Big Data Technologies would be preferred but not mandatory. Knowledge of Big Data querying tools (Cloudera stack or similar) e.g. Hive or Impala would be preferred but not mandatory. Experience working on parallel development tracks at the same time is required Experience in leading smaller development teams is necessary Adhere to the More ❯
Tools, Monitoring utilities, Disaster recovery process/tools Experience in troubleshooting and problem resolution Experience in System Integration Knowledge of the following: Hadoop, Flume, Sqoop, Map Reduce, Hive/Impala, Hbase, Kafka, Spark Streaming Experience of ETL tools incorporating Big Data Shell Scripting, Python Beneficial Skills: Understanding of: LAN, WAN, VPN and SD Networks Hardware and Cabling set-up More ❯
open/close the workspace during regular business hours as needed. Preferred Requirements • Experience with big data technologies like: Hadoop, Accumulo, Ceph, Spark, NiFi, Kafka, PostgreSQL, ElasticSearch, Hive, Drill, Impala, Trino, Presto, etc. • Experience with containers, EKS, Diode, CI/CD, and Terraform are a plus. Benefits $152,000-$198,000 salary per year, depending on experience. 11 Federal More ❯
accept TS/SCI or TS/SCI with CI Polygraph Desired Experience: Experience with big data technologies like: Hadoop, Accumulo, Ceph, Spark, NiFi, Kafka, PostgreSQL, ElasticSearch, Hive, Drill, Impala, Trino, Presto, etc. Work could possibly require some on-call work. The Swift Group and Subsidiaries are an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive More ❯
accept TS/SCI or TS/SCI with CI Polygraph Desired Experience: Experience with big data technologies like: Hadoop, Accumulo, Ceph, Spark, NiFi, Kafka, PostgreSQL, ElasticSearch, Hive, Drill, Impala, Trino, Presto, etc. Work could possibly require some on-call work. The Swift Group and Subsidiaries are an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive More ❯
accept TS/SCI or TS/SCI with CI Polygraph Desired Experience: Experience with big data technologies like: Hadoop, Accumulo, Ceph, Spark, NiFi, Kafka, PostgreSQL, ElasticSearch, Hive, Drill, Impala, Trino, Presto, etc. Work could possibly require some on-call work. The Swift Group and Subsidiaries are an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive More ❯
is preferred High level of competence in SQL, Python, Spark/Scala, and Unix/Linux scripts Real world experience using Hadoop and the related query engines (Hive/Impala) for big data processing Ability to construct model features utilizing open-banking data, in-house data,and/or third-party data to enhance rules and models Experience utilizing More ❯
RDBMS such as DB2, Oracle. • Exposure and hands on in Microservices, Distributed Cache (REDIS, Couchbase) and Cloud technologies • Good to have knowledge and experience in Big data - HBASE and Impala concepts. • Experienced with XML parsing (including schemas), JSON and third-party libraries like Gauva, lombok. • Well versed with design standards & frameworks; experience in working on multiple technologies. • Quick learner More ❯
knowledge of warehousing and ETLs. Extensive knowledge of popular database providers such as SQL Server, PostgreSQL, Teradata and others. • Proficiency in technologies in the Apache Hadoop ecosystem, especially Hive, Impala and Ranger • Experience working with open file and table formats such Parquet, AVRO, ORC, Iceberg and Delta Lake • Extensive knowledge of automation and software development tools and methodologies. • Excellent More ❯
/SQL - Must Have Job Description : Scala/Spark • Good Big Data resource with the below Skillset: § Spark § Scala § Hive/HDFS/HQL • Linux Based Hadoop Ecosystem (HDFS, Impala, Hive, HBase, etc.) • Experience in Big data technologies , real time data processing platform(Spark Streaming) experience would be an advantage. • Consistently demonstrates clear and concise written and verbal communication More ❯
/SQL - Must Have Job Description : Scala/Spark • Good Big Data resource with the below Skillset: § Spark § Scala § Hive/HDFS/HQL • Linux Based Hadoop Ecosystem (HDFS, Impala, Hive, HBase, etc.) • Experience in Big data technologies , real time data processing platform(Spark Streaming) experience would be an advantage. • Consistently demonstrates clear and concise written and verbal communication More ❯
bash, Python, or Go Must have a DoD 8140/8570 compliance certification (i.e. Security+ certification) Preferred Experience with big data technologies like: Hadoop, Spark, MongoDB, ElasticSearch, Hive, Drill, Impala, Trino, Presto, etc. Experience with containers and Kubernetes More ❯
Science, Statistics, Applied Mathematics, or Engineering - Strong experience with Python and R - A strong understanding of a number of the tools across the Hadoop ecosystem such as Spark, Hive, Impala & Pig - An expertise in at least one specific data science area such as text mining, recommender systems, pattern recognition or regression models - Previous experience in leading a team, ideally More ❯
process mentality and ability to create data management, analytics and reporting tools from scratch. Fluency in Tableau, Power BI or similar data visualization software. Advanced proficiency in SQL (Hive, Impala, Data Bricks). A high level of learning agility - you can pick things up and learn quickly on your own, collaborate with new stakeholders, and drive with determination to More ❯
over 12,000 people working across more than 60 offices. The Trafigura Group owns global multi-metals producer Nyrstar; fuel storage and distribution company Puma Energy; and joint ventures Impala Terminals, a port and logistics provider, and Nalo Renewables, investing in wind, solar and battery storage projects. More ❯