michal-harish / kafka-hadoop-loader
Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.
☆90Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for kafka-hadoop-loader
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆417Updated last year
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 7 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- ☆76Updated 11 years ago
- Plugins for Azkaban.☆130Updated 6 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆321Updated 2 years ago
- Flink Forward 2017-04-10 &11 ppt☆57Updated 7 years ago
- ☆558Updated 2 years ago
- ☆57Updated 5 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- A Monitor over HBase, including Table,Region,RegionServer,Zookeeper monitoring etc.☆54Updated 5 years ago
- Spark Streaming HBase Example☆96Updated 8 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆154Updated last year
- A Maven-based example of using Cloudera Impala's JDBC driver☆117Updated 8 years ago
- loading hdfs data to clickhouse☆73Updated 2 years ago
- Kafka as Hive Storage☆67Updated 10 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated last year
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆61Updated 8 years ago
- Explore the project Tungsten☆1Updated 8 years ago
- ☆122Updated 3 weeks ago
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- Learning to write Spark examples☆160Updated 10 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 9 years ago