EthicalML/kafka-spark-streaming-zeppelin-docker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EthicalML/kafka-spark-streaming-zeppelin-docker)

EthicalML / kafka-spark-streaming-zeppelin-docker

One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)

☆120

Alternatives and similar repositories for kafka-spark-streaming-zeppelin-docker

Users that are interested in kafka-spark-streaming-zeppelin-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

colemanja91 / docker-kafka-spark-poc
View on GitHub
Quickly set up a POC environment for Kafka+Spark
☆14Oct 10, 2017Updated 8 years ago
hakanilter / kafka-spark-streaming
View on GitHub
An example project for Kafka and Spark Streaming integration
☆11Apr 21, 2023Updated 3 years ago
DrSnowbird / docker-spark-bde2020-zeppelin
View on GitHub
Zeppelin docker
☆16Nov 16, 2020Updated 5 years ago
enkhalifapro / bigdata-all-in-one
View on GitHub
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆28Oct 9, 2023Updated 2 years ago
panovvv / hadoop-hive-spark-docker
View on GitHub
Base Docker image with just essentials: Hadoop, Hive and Spark.
☆67Feb 3, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Wesley-Bos / spark3.0-examples
View on GitHub
Basic Spark examples.
☆11Jan 12, 2021Updated 5 years ago
Shohruh72 / PIPNet
View on GitHub
The Facial Landmark Detection
☆16Jul 20, 2025Updated last year
panovvv / bigdata-docker-compose
View on GitHub
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
☆168Feb 4, 2021Updated 5 years ago
dursunkoc / flink-kafka-sample
View on GitHub
A Basic Flink Application Consuming & Aggregating Kafka Messages
☆10Nov 8, 2019Updated 6 years ago
davidcampos / kafka-spark-flink-example
View on GitHub
Kafka streaming with Spark and Flink example
☆31Jul 16, 2023Updated 3 years ago
dsaidgovsg / python-spark
View on GitHub
Docker image for a Python installation with Spark, Hadoop and Sqoop binaries
☆15Jan 26, 2018Updated 8 years ago
lenaxia / home-ops-dev
View on GitHub
☆18Updated this week
zhujun98 / data-engineering
View on GitHub
Spark, Airflow, Kafka
☆24Apr 30, 2023Updated 3 years ago
pixipanda / FraudDetection
View on GitHub
Real-time Credit card Fraud detection using Spark Streaming, Spark ML, Spark SQL, Kafka, Cassandra and Airflow
☆11Jul 1, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shabie / streaming_nd
View on GitHub
Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions
☆17Aug 14, 2023Updated 2 years ago
crunchy-devops / jenkins-pic
View on GitHub
CI/CD platform using Jenkins, docker, Sonar, Nexus, Jmeter, Selenium, Ansible, AWX, Grafana, Prometheus, Zabbix, Stress-ng
☆21May 25, 2026Updated 2 months ago
PacktPublishing / Kubernetes-for-Developers
View on GitHub
Kubernetes for Developers, published by Packt
☆15Jan 30, 2023Updated 3 years ago
mjhea0 / flask-spark-docker
View on GitHub
Just a boilerplate for PySpark and Flask
☆36Aug 2, 2018Updated 7 years ago
dmatrix / mlflow-workshop-part-3
View on GitHub
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…
☆35Jun 22, 2020Updated 6 years ago
Wittline / apache-spark-docker
View on GitHub
Dockerizing an Apache Spark Standalone Cluster
☆42Jun 29, 2022Updated 4 years ago
astrojuanlu / desalkila
View on GitHub
☆17Updated this week
ritchie46 / serverless-model-aws
View on GitHub
Deploy any Machine Learning model serverless in AWS.
☆23Oct 17, 2018Updated 7 years ago
ahujaraman / live_log_analyzer_spark
View on GitHub
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
☆21Jan 30, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bryanyang0528 / docker-spark-hive-ipython
View on GitHub
Spark + Jupyer + Hive
☆16Sep 22, 2015Updated 10 years ago
dmatrix / mlflow-workshop-part-2
View on GitHub
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…
☆39Apr 6, 2021Updated 5 years ago
marcelmittelstaedt / BigData
View on GitHub
Lecture: Big Data
☆14Oct 27, 2025Updated 9 months ago
eneskemalergin / MachineLearning_Beyond
View on GitHub
Repository to store machine learning, artificial intelligence, and deep learning implementations with explanations
☆10Apr 17, 2018Updated 8 years ago
big-data-europe / docker-hadoop-spark-workbench
View on GitHub
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…
☆701Oct 1, 2020Updated 5 years ago
big-data-europe / docker-spark
View on GitHub
Apache Spark docker image
☆2,050Apr 20, 2026Updated 3 months ago
noahgift / or
View on GitHub
Operations Research Algorithms
☆18Mar 20, 2024Updated 2 years ago
redpanda-data-blog / 2022-redpanda-duckdb
View on GitHub
☆12Jan 20, 2023Updated 3 years ago
mvalleavila / Kafka-Spark-Hbase-Example
View on GitHub
☆40Aug 19, 2015Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
san089 / Optimizing-Public-Transportation
View on GitHub
A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.
☆33Aug 14, 2023Updated 2 years ago
cordon-thiago / airflow-spark
View on GitHub
Docker with Airflow and Spark standalone cluster
☆264Aug 5, 2023Updated 2 years ago
sonarsushant / Loan-Defaulter-Prediction
View on GitHub
Classification problem to predict loan defaulters using Lending Club Dataset
☆11Jan 26, 2019Updated 7 years ago
javicacheiro / pyspark_course
View on GitHub
Material for the PySpark course
☆15May 12, 2026Updated 2 months ago
omarmhaimdat / quickner
View on GitHub
Quickner is a new tool to quickly annotate texts for NER (Named Entity Recognition). It is written in Rust and accessible through a Pytho…
☆22Feb 24, 2024Updated 2 years ago
yennanliu / spark-etl-pipeline
View on GitHub
Various data stream/batch process demo with Apache Scala Spark 🚀
☆12Feb 28, 2020Updated 6 years ago
antlypls / spark-kafka-docker-demo
View on GitHub
A sample project shows how to run Spark Streaming app with Kafka in Docker
☆35Oct 25, 2017Updated 8 years ago