dibbhatt/kafka-spark-consumer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dibbhatt/kafka-spark-consumer)

dibbhatt / kafka-spark-consumer

High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper. No Data-loss. No dependency on HDFS and WAL. In-built PID rate controller. Support Message Handler . Offset Lag checker.

☆632

Alternatives and similar repositories for kafka-spark-consumer

Users that are interested in kafka-spark-consumer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

koeninger / kafka-exactly-once
View on GitHub
☆242Jun 14, 2018Updated 8 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
miguno / kafka-storm-starter
View on GitHub
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…
☆721Mar 22, 2022Updated 4 years ago
cloudera-labs / SparkOnHBase
View on GitHub
SparkOnHBase
☆278Mar 30, 2021Updated 5 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
trK54Ylmz / kafka-spark-streaming-example
View on GitHub
Simple examle for Spark Streaming over Kafka topic
☆106Oct 13, 2020Updated 5 years ago
spirom / spark-streaming-with-kafka
View on GitHub
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
☆196Apr 15, 2018Updated 8 years ago
chimpler / blog-spark-streaming-log-aggregation
View on GitHub
Example of use of Spark Streaming with Kafka
☆89Jul 11, 2014Updated 12 years ago
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
lw-lin / CoolplaySpark
View on GitHub
酷玩 Spark: Spark 源代码解析、Spark 类库等
☆3,475May 18, 2022Updated 4 years ago
mkuthan / example-spark-kafka
View on GitHub
Apache Spark and Apache Kafka integration example
☆122Dec 21, 2017Updated 8 years ago
ippontech / metrics-spark-reporter
View on GitHub
Dropwizard Metrics reporter for Apache Spark
☆28Dec 22, 2014Updated 11 years ago
databricks / reference-apps
View on GitHub
Spark reference applications
☆649Oct 3, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
nerdammer / spark-hbase-connector
View on GitHub
Connect Spark to HBase for reading and writing data with ease
☆296Dec 19, 2017Updated 8 years ago
confluentinc / kafka-connect-hdfs
View on GitHub
Kafka Connect HDFS connector
☆27Updated this week
hbase-rdd / hbase-rdd
View on GitHub
Spark RDD to read, write and delete from HBase
☆275Jan 22, 2021Updated 5 years ago
Stratio / sparta
View on GitHub
Real Time Analytics and Data Pipelines based on Spark Streaming
☆530Oct 24, 2019Updated 6 years ago
polomarcus / Spark-Structured-Streaming-Examples
View on GitHub
Spark Structured Streaming / Kafka / Cassandra / Elastic
☆186Feb 7, 2023Updated 3 years ago
ippontech / spark-kafka-source
View on GitHub
Kafka stream for Spark with storage of the offsets in ZooKeeper
☆60Apr 18, 2017Updated 9 years ago
jerryshao / spark-kafka-0-8-sql
View on GitHub
Spark Structured Streaming Kafka 0.8 Source Implementation
☆35Apr 27, 2017Updated 9 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
BenFradet / spark-kafka-writer
View on GitHub
Write your Spark data to Kafka seamlessly
☆172Jul 10, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gh-gan / SparkStreaming
View on GitHub
Streaming 相关项目
☆15Mar 27, 2017Updated 9 years ago
qindongliang / streaming-offset-to-zk
View on GitHub
一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目
☆133Dec 17, 2025Updated 7 months ago
E-SoulDataGroup / spark_streaming_kafka_offset
View on GitHub
SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失
☆43Aug 2, 2017Updated 8 years ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
JerryLead / SparkInternals
View on GitHub
Notes talking about the design and implementation of Apache Spark
☆5,361Apr 2, 2024Updated 2 years ago
mkuthan / example-spark
View on GitHub
Spark, Spark Streaming and Spark SQL unit testing strategies
☆215Oct 12, 2016Updated 9 years ago
japila-books / apache-spark-internals
View on GitHub
The Internals of Apache Spark
☆1,547Updated this week
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
yahoo / CMAK
View on GitHub
CMAK is a tool for managing Apache Kafka clusters
☆11,927Aug 2, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wankunde / logcount
View on GitHub
基于spark streaming和kafka，hbase的日志统计分析系统
☆264Sep 5, 2017Updated 8 years ago
apache / cassandra-spark-connector
View on GitHub
Apache Spark to Apache Cassandra connector
☆1,949Apr 29, 2025Updated last year
databricks / spark-perf
View on GitHub
Performance tests for Apache Spark
☆392Jul 9, 2018Updated 8 years ago
killrweather / killrweather
View on GitHub
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apach…
☆1,180Jan 5, 2017Updated 9 years ago
linkedin / kafka-monitor
View on GitHub
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived …
☆2,061Mar 9, 2025Updated last year
cpbaranwal / Avro-SparkStreaming-Kafka
View on GitHub
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Oct 5, 2022Updated 3 years ago