customer proposals aligned with Analytics Solutions. Experience with one or more relevant tools (Sqoop, Flume, Kafka, Oozie, Hue, Zookeeper, HCatalog, Solr, Avro, Parquet, Iceberg, Hudi). Experience developing software and data engineering code in one or more programming languages (Java, Python, PySpark, Node, etc). AWS and other Data More ❯
or Python development. Experience with tools like Terraform for provisioning AWS cloud services. Knowledge of AWS Glue, AWS Athena, and AWS S3. Understanding of Apache Parquet and open table formats such as Delta, Iceberg, and Hudi. Experience with Test Driven Development using JUnit, Mojito, or similar tools. Extensive knowledge More ❯
freeing data from data platform lock-in. We deliver the industry's most interoperable data lakehouse through a cloud-native managed service built on Apache Hudi. Onehouse enables organizations to ingest data at scale with minute-level freshness, centrally store it, and make it available to any downstream query … building the sales strategy around a large, successful open source project. You will be challenged to deeply understand data architecture and the key role ApacheHudi plays in some of the biggest enterprises in the world, then articulate the value proposition of the Onehouse managed service to potential More ❯
core engines and datalake team. Athena and EMR are services that our customer use to run large scale analytics, leveraging open source engines like Apache Spark and Trino, with datalake open table formats like Apache Iceberg, Hudi and Delta. The analytics engines organization makes significant modifications to … in a growing and very technical space. We are seeking a passionate and hands-on engineer to collaborate closely with open-source communities like Apache Iceberg and Apache Spark, driving innovations in query engines and table format integrations. In this role, you will focus on performance optimizations, feature … require specialized security solutions for their cloud services. Key job responsibilities • Develop and optimize core components of query engines and open table formats (Iceberg, Hudi, Delta) to enhance performance, scalability, and reliability. • Design and implement innovative solutions and algorithms to improve feature capabilities, stability, and security in table format More ❯