dibbhatt / kafka-spark-consumerView external linksLinks
High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper. No Data-loss. No dependency on HDFS and WAL. In-built PID rate controller. Support Message Handler . Offset Lag checker.
☆636Feb 26, 2022Updated 3 years ago
Alternatives and similar repositories for kafka-spark-consumer
Users that are interested in kafka-spark-consumer are comparing it to the libraries listed below
Sorting:
- ☆243Jun 14, 2018Updated 7 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- [PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…☆723Mar 22, 2022Updated 3 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- REST job server for Apache Spark☆2,845Jul 8, 2025Updated 7 months ago
- Simple examle for Spark Streaming over Kafka topic☆108Oct 13, 2020Updated 5 years ago
- Connect Spark to HBase for reading and writing data with ease☆295Dec 19, 2017Updated 8 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- Example of use of Spark Streaming with Kafka☆90Jul 11, 2014Updated 11 years ago
- Spark RDD to read, write and delete from HBase☆273Jan 22, 2021Updated 5 years ago
- Apache Spark and Apache Kafka integration example☆124Dec 21, 2017Updated 8 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,485May 18, 2022Updated 3 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Apr 15, 2018Updated 7 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Oct 12, 2016Updated 9 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Jan 15, 2026Updated last month
- The Internals of Apache Spark☆1,538Jul 5, 2025Updated 7 months ago
- A library for time series analysis on Apache Spark☆1,195Oct 13, 2020Updated 5 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆185Feb 7, 2023Updated 3 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Notes talking about the design and implementation of Apache Spark☆5,357Apr 2, 2024Updated last year
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Aug 2, 2017Updated 8 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Apr 18, 2017Updated 8 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated last month
- KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apach…☆1,183Jan 5, 2017Updated 9 years ago
- Apache Spark to Apache Cassandra connector☆1,949Apr 29, 2025Updated 9 months ago
- CMAK is a tool for managing Apache Kafka clusters☆11,951Aug 2, 2023Updated 2 years ago
- Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived …☆2,057Mar 9, 2025Updated 11 months ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,151May 16, 2023Updated 2 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Sep 9, 2016Updated 9 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆947Oct 22, 2024Updated last year
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆2,049Updated this week
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,848May 29, 2024Updated last year
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Dropwizard Metrics reporter for Apache Spark☆28Dec 22, 2014Updated 11 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆550May 10, 2021Updated 4 years ago