dbusteed / kafka-spark-streaming-exampleLinks

☆36

Alternatives and similar repositories for kafka-spark-streaming-example

Users that are interested in kafka-spark-streaming-example are comparing it to the libraries listed below

Sorting:

yennanliu / NYC_Taxi_Pipeline
Stream/batch system with Hadoop, Spark on NYC taxi data | #DE
☆26Updated 2 months ago
indiacloudtv / structuredstreamingkafkapyspark
Apche Spark Structured Streaming with Kafka using Python(PySpark)
☆40Updated 6 years ago
indiacloudtv / pyspark_on_google_colab
PySpark Tutorial for Beginners on Google Colab: Hands-On Guide
☆17Updated 5 years ago
itversity / data-engineering-spark
☆88Updated 3 years ago
shravan-kuchkula / udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…
☆24Updated 4 years ago
LearningJournal / Spark-Streaming-In-Python
Apache Spark 3 - Structured Streaming Course Material
☆125Updated 2 years ago
shravan-kuchkula / udacity-data-eng-proj4
Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …
☆16Updated 6 years ago
shravan-kuchkula / udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆89Updated 4 years ago
jleetutorial / python-spark-streaming
☆151Updated 7 years ago
vim89 / datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…
☆55Updated 2 years ago
sankamuk / PysparkCheatsheet
PySpark Cheatsheet
☆36Updated 2 years ago
JoseRFJuniorLLMs / PySpark-ETL
PySpark-ETL
☆22Updated 5 years ago
BenSchr / Udacity-Data-Engineering-Projects
My solutions for the Udacity Data Engineering Nanodegree
☆34Updated 6 years ago
rodrigo-arenas / kafkaml-anomaly-detection
Project for real-time anomaly detection using Kafka and python
☆58Updated 2 years ago
PacktPublishing / Data-Engineering-with-Apache-Spark-Delta-Lake-and-Lakehouse
Data Engineering with Spark and Delta Lake
☆105Updated 2 years ago
immu0001 / Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
☆74Updated last year
kadnan / Calories-Alert-Kafka
Simple alert system implemented in Kafka and Python
☆95Updated 7 years ago
MBKraus / incremental_training
Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
☆121Updated 2 years ago
dogukannulu / airflow_kafka_cassandra_mongodb
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆42Updated 2 years ago
pran4ajith / spark-twitter-streaming
A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…
☆29Updated 5 years ago
PacktPublishing / Mastering-Big-Data-Analytics-with-PySpark
Mastering Big Data Analytics with PySpark, Published by Packt
☆163Updated last year
yugokato / Spark-and-Kafka_IoT-Data-Processing-and-Analytics
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
☆71Updated 9 years ago
mahmoudparsian / data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
☆223Updated 2 years ago
harjeet88 / spark-streaming
☆13Updated 5 years ago
airscholar / FlinkCommerce
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…
☆45Updated last year
hyunjoonbok / PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
☆104Updated 4 years ago
tirthajyoti / Spark-with-Python
Fundamentals of Spark with Python (using PySpark), code examples
☆355Updated 3 years ago
SatadruMukherjee / Data-Preprocessing-Models
☆70Updated 3 weeks ago
josephmachado / beginner_de_project_stream
Simple stream processing pipeline
☆110Updated last year
vivek-bombatkar / Spark-with-Python---My-learning-notes-
ETL pipeline using pyspark (Spark - Python)
☆116Updated 5 years ago