Joshua-omolewa / Stock_streaming_pipeline_projectLinks

Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.

☆27

Alternatives and similar repositories for Stock_streaming_pipeline_project

Users that are interested in Stock_streaming_pipeline_project are comparing it to the libraries listed below

Sorting:

RSKriegs / finnhub-streaming-data-pipeline
Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
☆357Updated last year
ris-tlp / audiophile-e2e-pipeline
Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…
☆237Updated 2 years ago
josephmachado / data_engineering_best_practices
Sample project to demonstrate data engineering best practices
☆196Updated last year
HamzaG737 / data-engineering-project
End to end data engineering project with kafka, airflow, spark, postgres and docker.
☆102Updated 5 months ago
josephmachado / efficient_data_processing_spark
Code for "Efficient Data Processing in Spark" Course
☆338Updated 3 months ago
dogukannulu / kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
☆143Updated 2 years ago
sidharth1805 / Spotify_etl
☆142Updated 2 years ago
josephmachado / bitcoinMonitor
Near real time ETL to populate a dashboard.
☆72Updated last year
josephmachado / data_engineering_project_template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
☆275Updated last year
cordon-thiago / airflow-spark
Docker with Airflow and Spark standalone cluster
☆261Updated 2 years ago
ABZ-Aaron / reddit-api-pipeline
☆364Updated 7 months ago
andrem8 / surf_dash
☆154Updated 3 years ago
dominikhei / Local-Data-LakeHouse
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…
☆74Updated 2 years ago
uhussain / WebCrawlerForInflation
Price Crawler - Tracking Price Inflation
☆187Updated 5 years ago
davidzajac1 / zillacode
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
☆206Updated 2 months ago
abdkumar / spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…
☆69Updated last year
airscholar / e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…
☆273Updated 7 months ago
ankurchavda / streamify
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
☆743Updated 3 years ago
shafiab / HashtagCashtag
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…
☆505Updated 3 years ago
josephmachado / online_store
End to end data engineering project
☆57Updated 2 years ago
alanchn31 / Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
☆155Updated 5 years ago
ssp-data / practical-data-engineering
Practical Data Engineering: A Hands-On Real-Estate Project Guide
☆699Updated last year
josephmachado / beginner_de_project
Beginner data engineering project - batch edition
☆539Updated 7 months ago
digitalghost-dev / premier-league
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
☆246Updated last year
coder2j / airflow-docker
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
☆315Updated last year
Amrit-Hub / Databricks-Certified-Data-Engineer-Professional-Questions
This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.
☆106Updated last year
afaqueahmad7117 / spark-experiments
Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews
☆164Updated this week
data-engineering-community / data-engineering-project-template
This is a template you can use for your next data engineering portfolio project.
☆181Updated 4 years ago
EcZachly / little-book-of-pipelines
This repository goes over how to handle massive variety in data engineering
☆298Updated 2 years ago
ANelson82 / de_zoomcamp_2022_earthquake_capstone
☆18Updated 2 years ago