cpbaranwal / Avro-SparkStreaming-Kafka
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Updated 8 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka:
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago
- A Spark SQL HBase connector☆29Updated 10 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 8 years ago
- Spark Streaming HBase Example☆96Updated 9 years ago
- This is a based on playframwork for submit spark app☆60Updated last year
- A sink to save Spark Structured Streaming DataFrame into Hive table☆31Updated 7 years ago
- Ambari service for Presto☆44Updated 3 months ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- spark实例代码☆78Updated 7 years ago
- 一个为spark批量导入数据到hbase的库☆43Updated 8 years ago
- ☆105Updated 5 years ago
- spark + drools☆102Updated 2 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 6 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Updated 9 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 8 years ago
- Code repository for the book - Mastering Flink by Tanmay Deshpande☆74Updated 8 years ago
- A web application for submitting spark application☆8Updated 4 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Updated 8 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- Serviceframework一个简单但灵活的模块引擎☆31Updated 7 years ago
- spark流数据处理,可以从flume-ng,kafka接收数据☆11Updated 9 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 8 years ago
- 使用Spark的MLlib、Hbase作为模型、Hive作数据清洗的核心推荐引擎,在Spark on Yarn测试通过☆30Updated 8 years ago
- ☆29Updated 6 years ago
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Capture changes of HBase to Kafka☆30Updated 9 years ago