Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.
☆28Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for Stock_streaming_pipeline_project
Users that are interested in Stock_streaming_pipeline_project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆13May 25, 2023Updated 3 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- These are my personal data analysis projects. I mainly used R/Python programming for my data analysis. And also used BI tools such as Tab…☆15Dec 12, 2025Updated 5 months ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Jul 24, 2023Updated 2 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- Links to Azure resources☆13Oct 16, 2020Updated 5 years ago
- ☆27Aug 30, 2024Updated last year
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Get started setting up infrastructure as code on Google Cloud Platform☆11Jun 13, 2021Updated 4 years ago
- ☆20Feb 18, 2024Updated 2 years ago
- ☆10May 3, 2021Updated 5 years ago
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implement a complete data warehouse etl using spark SQL☆14Sep 8, 2022Updated 3 years ago
- dbtVault + Greenplum demo☆11Feb 19, 2024Updated 2 years ago
- AI Object Detector with Next js 14, Tailwind CSS, Tenserflow, React☆31Mar 17, 2024Updated 2 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆18Dec 26, 2023Updated 2 years ago
- Self-improving AI agents using Agentic Context Engineering - A starter implementation with Google ADK☆21Oct 23, 2025Updated 7 months ago
- ☆16Mar 15, 2024Updated 2 years ago
- Building event-driven data ingestion pipelines in Azure☆16Apr 27, 2023Updated 3 years ago
- ☆12Aug 11, 2021Updated 4 years ago
- ☆11Nov 18, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated last month
- ☆13Sep 5, 2025Updated 8 months ago
- A concise and comprehensive cheat sheet covering time complexities of Python's built-in data structures like Lists, Dictionaries, Sets, T…☆15Jan 24, 2025Updated last year
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Multi-Agent AI Application(Python) that uses Semantic-Kernel along with Azure AI Agent Service in Azure Ai Foundry☆15Mar 6, 2025Updated last year
- Python scraper for fbref.com☆16Jul 2, 2024Updated last year
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 8 months ago
- ☆11Oct 6, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- LS증권 OpenApi 샘플☆18May 9, 2025Updated last year
- ☆20Nov 8, 2024Updated last year
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- ☆22May 1, 2023Updated 3 years ago
- ☆14May 1, 2024Updated 2 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 9 months ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago