miguno/kafka-storm-starter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/miguno/kafka-storm-starter)

miguno / kafka-storm-starter

[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

☆721

Alternatives and similar repositories for kafka-storm-starter

Users that are interested in kafka-storm-starter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dibbhatt / kafka-spark-consumer
View on GitHub
High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…
☆632Apr 24, 2026Updated 3 months ago
killrweather / killrweather
View on GitHub
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apach…
☆1,180Jan 5, 2017Updated 9 years ago
koeninger / kafka-exactly-once
View on GitHub
☆241Jun 14, 2018Updated 8 years ago
chimpler / blog-spark-streaming-log-aggregation
View on GitHub
Example of use of Spark Streaming with Kafka
☆89Jul 11, 2014Updated 12 years ago
apache / storm
View on GitHub
Apache Storm
☆6,695Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
miguno / wirbelsturm
View on GitHub
[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …
☆329Feb 21, 2022Updated 4 years ago
NetherlandsForensicInstitute / kafka-spout
View on GitHub
Kafka consumer emitting messages as storm tuples
☆106Jan 8, 2021Updated 5 years ago
databricks / reference-apps
View on GitHub
Spark reference applications
☆649Oct 3, 2024Updated last year
otoolep / stormkafkamon
View on GitHub
Dumps state of Storm Kafka consumers
☆96Jan 15, 2018Updated 8 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,269Updated this week
elodina / scala-kafka
View on GitHub
Quick up and running using Scala for Apache Kafka
☆327Jul 2, 2017Updated 9 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
yahoo / CMAK
View on GitHub
CMAK is a tool for managing Apache Kafka clusters
☆11,927Aug 2, 2023Updated 2 years ago
aseigneurin / kafka-sandbox
View on GitHub
☆48Feb 4, 2018Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cloudera-labs / SparkOnHBase
View on GitHub
SparkOnHBase
☆278Mar 30, 2021Updated 5 years ago
caroljmcdonald / SparkStreamingHBaseExample
View on GitHub
Spark Streaming HBase Example
☆94Apr 4, 2016Updated 10 years ago
wurstmeister / storm-kafka-0.8-plus-test
View on GitHub
Simple test project for storm-kafka-0.8-plus
☆72Mar 2, 2018Updated 8 years ago
krasserm / akka-analytics
View on GitHub
Large-scale event processing with Akka Persistence and Apache Spark
☆271Jun 18, 2016Updated 10 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,836Mar 3, 2026Updated 4 months ago
LinkedInAttic / camus
View on GitHub
LinkedIn's previous generation Kafka to HDFS pipeline.
☆879Aug 27, 2020Updated 5 years ago
databricks / learning-spark
View on GitHub
Example code from Learning Spark book
☆3,892Jun 30, 2026Updated 3 weeks ago
nathanmarz / storm-starter
View on GitHub
Learn to use Storm!
☆926Mar 9, 2016Updated 10 years ago
SinghAsDev / pankh
View on GitHub
☆76May 19, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
twitter / bijection
View on GitHub
Reversible conversions between types
☆656Nov 22, 2024Updated last year
Stratio / sparta
View on GitHub
Real Time Analytics and Data Pipelines based on Spark Streaming
☆530Oct 24, 2019Updated 6 years ago
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
sryza / aas
View on GitHub
Code to accompany Advanced Analytics with Spark from O'Reilly Media
☆1,524Sep 25, 2024Updated last year
apache / cassandra-spark-connector
View on GitHub
Apache Spark to Apache Cassandra connector
☆1,950Apr 29, 2025Updated last year
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
elastic / elasticsearch-hadoop
View on GitHub
Elasticsearch real-time search and analytics natively integrated with Hadoop
☆1,975Updated this week
JerryLead / SparkInternals
View on GitHub
Notes talking about the design and implementation of Apache Spark
☆5,361Apr 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mkuthan / example-spark-kafka
View on GitHub
Apache Spark and Apache Kafka integration example
☆122Dec 21, 2017Updated 8 years ago
databricks / spark-knowledgebase
View on GitHub
Spark Knowledge Base
☆333Oct 1, 2020Updated 5 years ago
hbase-rdd / hbase-rdd
View on GitHub
Spark RDD to read, write and delete from HBase
☆275Jan 22, 2021Updated 5 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
cjmamo / kafka-web-console
View on GitHub
A web console for Apache Kafka (retired)
☆760Apr 11, 2023Updated 3 years ago
gwenshap / SparkStreamingExample
View on GitHub
☆55Aug 21, 2014Updated 11 years ago
harlixxy / StudyNotes
View on GitHub
Study Notes
☆56Sep 23, 2019Updated 6 years ago