exasol / hadoop-etl-udfsLinks
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
☆16Updated 2 years ago
Alternatives and similar repositories for hadoop-etl-udfs
Users that are interested in hadoop-etl-udfs are comparing it to the libraries listed below
Sorting:
- Entry point repository for the EXASOL Virtual Schemas☆24Updated last week
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Spark Example using Phoenix to interact with HBase☆16Updated 8 years ago
- Test your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating system…☆39Updated 4 years ago
- Spark code to analyze HBase Snapshots☆34Updated 7 years ago
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Cloudera CDP SDK for Java☆13Updated last week
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 7 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Sample Spark Streaming application for secure consumption from Kafka☆33Updated 7 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 9 years ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Sample processing code using Spark 2.1+ and Scala☆52Updated 4 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- BigQuery connector for Apache Flink☆31Updated 2 weeks ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- Port of TPC-DS dsdgen to Java☆49Updated 10 months ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago