dogukannulu / kafka_spark_structured_streamingLinks

Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra

☆143

Alternatives and similar repositories for kafka_spark_structured_streaming

Users that are interested in kafka_spark_structured_streaming are comparing it to the libraries listed below

Sorting:

coder2j / airflow-docker
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
☆323Updated last year
sidharth1805 / Spotify_etl
☆142Updated 2 years ago
HamzaG737 / data-engineering-project
End to end data engineering project with kafka, airflow, spark, postgres and docker.
☆102Updated 7 months ago
airscholar / e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…
☆285Updated 8 months ago
RSKriegs / finnhub-streaming-data-pipeline
Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
☆365Updated last year
darshilparmar / twitter-airflow-data-engineering-project
YouTube tutorial project
☆105Updated 2 years ago
josephmachado / data_engineering_project_template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
☆279Updated last year
uhussain / WebCrawlerForInflation
Price Crawler - Tracking Price Inflation
☆188Updated 5 years ago
josephmachado / data_engineering_best_practices
Sample project to demonstrate data engineering best practices
☆197Updated last year
alanchn31 / Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
☆157Updated 5 years ago
dogukannulu / streaming_data_processing
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
☆63Updated 2 years ago
josephmachado / bitcoinMonitor
Near real time ETL to populate a dashboard.
☆72Updated last month
shafiab / HashtagCashtag
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…
☆505Updated 3 years ago
andrem8 / surf_dash
☆161Updated 3 years ago
cordon-thiago / airflow-spark
Docker with Airflow and Spark standalone cluster
☆261Updated 2 years ago
josephmachado / efficient_data_processing_spark
Code for "Efficient Data Processing in Spark" Course
☆345Updated last week
kroudir / Data-Engineer-Nanodegree-Projects-Udacity
Projects done in the Data Engineer Nanodegree Program by Udacity.com
☆164Updated 2 years ago
abdkumar / spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…
☆69Updated last year
dogukannulu / airflow_kafka_cassandra_mongodb
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆42Updated 2 years ago
ankurchavda / streamify
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
☆758Updated 3 years ago
josephmachado / beginner_de_project_stream
Simple stream processing pipeline
☆110Updated last year
ABZ-Aaron / reddit-api-pipeline
☆375Updated 9 months ago
hnawaz007 / pythondataanalysis
Python data repo, jupyter notebook, python scripts and data.
☆535Updated 10 months ago
airscholar / RedditDataEngineering
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…
☆159Updated 2 years ago
mage-ai / mage-zoomcamp
This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…
☆101Updated last year
PacktPublishing / Data-Engineering-with-AWS
Data Engineering with AWS, Published by Packt
☆332Updated 2 years ago
ris-tlp / audiophile-e2e-pipeline
Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…
☆242Updated 2 years ago
darshilparmar / dataengineering-youtube-analysis-project
Data Engineering YouTube Analysis Project by Darshil Parmar
☆204Updated last year
darshilparmar / stock-market-kafka-data-engineering-project
☆205Updated 2 years ago
josephmachado / beginner_de_project
Beginner data engineering project - batch edition
☆548Updated 9 months ago