gwenshap / sqoop2hive
Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table
☆29Updated 10 years ago
Alternatives and similar repositories for sqoop2hive:
Users that are interested in sqoop2hive are comparing it to the libraries listed below
- Ambari service for Presto☆44Updated 2 months ago
- ☆54Updated 10 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- ☆44Updated 7 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 8 years ago
- Companion Code for Using Flume Book☆32Updated 9 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Updated 8 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- Hive,Pig,Hbase,Sqoop examples☆16Updated 7 years ago
- ☆30Updated 2 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated last year
- spark + drools☆102Updated 2 years ago
- A series of demos using HBase Standalone and Phoenix/HBase☆19Updated 9 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 8 years ago
- sample oozie workflows☆18Updated 7 years ago
- Spark Streaming HBase Example☆95Updated 8 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- Ambari stack for easily installing and managing Redis on HDP cluster☆15Updated 9 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- ☆17Updated 8 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- ☆26Updated 8 years ago
- Ambari service for Apache Flink☆126Updated 3 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 8 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 8 years ago
- ☆23Updated 6 years ago
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated last year