Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.
☆28Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for Stock_streaming_pipeline_project
Users that are interested in Stock_streaming_pipeline_project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆12May 25, 2023Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Jul 24, 2023Updated 2 years ago
- This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)☆11Apr 29, 2022Updated 4 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- Scripts to convert tables from SQL Server to Snowflake☆13Jun 27, 2019Updated 6 years ago
- In this project I used apache airflow to scrape website periodically. This is for the tutorials I do on youtube.☆10Nov 21, 2022Updated 3 years ago
- ☆27Aug 30, 2024Updated last year
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Parses a json file of reddit comments and dump it to a MySQL database☆11Mar 19, 2018Updated 8 years ago
- ☆10May 3, 2021Updated 5 years ago
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 2 years ago
- Here, I include all the required resources for the courses I teach, either online or in academia☆35Jun 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆18Dec 26, 2023Updated 2 years ago
- Self-improving AI agents using Agentic Context Engineering - A starter implementation with Google ADK☆21Oct 23, 2025Updated 6 months ago
- Building event-driven data ingestion pipelines in Azure☆16Apr 27, 2023Updated 3 years ago
- Collect orderbook data from crypto exchanges and publish as GRPC☆13Jun 19, 2022Updated 3 years ago
- ☆11Nov 18, 2022Updated 3 years ago
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated 2 weeks ago
- Reviewing and statistically testing trading strategy ideas implemented in QuantCT app.☆13Jun 22, 2021Updated 4 years ago
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- A concise and comprehensive cheat sheet covering time complexities of Python's built-in data structures like Lists, Dictionaries, Sets, T…☆15Jan 24, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- ☆17Oct 28, 2022Updated 3 years ago
- ☆21Nov 20, 2023Updated 2 years ago
- LS증권 OpenApi 샘플☆17May 9, 2025Updated last year
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆26Feb 9, 2021Updated 5 years ago
- ☆14May 1, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆14Sep 19, 2024Updated last year
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago
- Nifi 1.9.0☆24Apr 17, 2019Updated 7 years ago
- Deep Learning the Sorting Algorithm☆12Dec 11, 2016Updated 9 years ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- Quickly set up a basic data warehouse with Terraform's Snowflake provider☆28Nov 17, 2021Updated 4 years ago
- Price Crawler - Tracking Price Inflation☆205Jun 23, 2020Updated 5 years ago