DucAnhNTT/bigdata-ETL-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DucAnhNTT/bigdata-ETL-pipeline)

DucAnhNTT / bigdata-ETL-pipeline

The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use

☆18

Alternatives and similar repositories for bigdata-ETL-pipeline

Users that are interested in bigdata-ETL-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaoutaar / end-to-end-etl-pipeline-jcdecaux-API
View on GitHub
velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…
☆21Aug 12, 2025Updated 11 months ago
sergio11 / traffic_sentinel_architecture
View on GitHub
📚🧪 Traffic Sentinel is a learning-focused POC that explores a scalable IoT architecture using Fog nodes and Apache Flink to process 📷 …
☆28Dec 29, 2025Updated 6 months ago
japerry911 / crypto-data-pipeline
View on GitHub
Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.
☆10Jan 23, 2023Updated 3 years ago
faizeraza / dataengineering-github-data-pipelineline
View on GitHub
In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…
☆12Sep 9, 2023Updated 2 years ago
hant121 / shortrangeradar
View on GitHub
Short Range Ultrasonic Radar - A simple radar using the ultrasonic sensor, this radar works by measuring a range from 3cm to 40 cm as non…
☆19Nov 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
airscholar / changecapture-e2e
View on GitHub
This project shows how to capture changes from postgres database and stream them into kafka
☆41May 17, 2024Updated 2 years ago
Siddhartha80 / AI-Powered-Predictive-Maintenance-System-for-Vehicles-with-Real-Time-Data-Visualization-and-Analysis
View on GitHub
Gradient Boosting Models on Real-Time Sensor Data for AI-Enhanced Vehicle Predictive Maintenance. By using a web-based interface to forec…
☆19Nov 17, 2024Updated last year
vv4t / mangadex-webhook
View on GitHub
A Google Script which sends discord webhook notifications on manga updates.
☆10Feb 13, 2024Updated 2 years ago
CR-DSiL / predictive-incident-management
View on GitHub
Predictive Incident Management analyses large data sets to identify risk patterns, predict outcomes, and guide teams on effective decisio…
☆16Nov 25, 2022Updated 3 years ago
c-hebert / MecaDRIL
View on GitHub
Jupyter notebooks for the teaching of mechanics
☆11Oct 8, 2024Updated last year
edumucelli / spotify-worldwide-ranking
View on GitHub
Automate data collection from Spotify's worldwide ranking in 50+ countries
☆25May 3, 2020Updated 6 years ago
multimedia-berkeley / OriSet
View on GitHub
Dataset with images of origami.
☆17Jan 15, 2021Updated 5 years ago
dominikhei / Local-Data-LakeHouse
View on GitHub
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…
☆82Sep 2, 2023Updated 2 years ago
Thomas-George-T / Ecommerce-Data-MLOps
View on GitHub
End-to-End deployment of E-commerce customers segmentation using Clustering Machine learning algorithms in Google Cloud Platform and MLOp…
☆21Jun 5, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
nsidn98 / PyThor
View on GitHub
Template for projects in PyTorch powered with PyTorch Lightning, Telegrad and MLflow. Get updates on mobile and streamline PyTorch code f…
☆10May 1, 2023Updated 3 years ago
fullstorydev / solr-bench
View on GitHub
Solr benchmarking and load testing harness
☆16Jan 7, 2025Updated last year
mrn-aglic / apache-iceberg-data-exploration
View on GitHub
☆23Feb 5, 2024Updated 2 years ago
nhat2008 / vietnam-ecommerce-crawler
View on GitHub
Crawling the data from lazada, websosanh, compare.vn, cdiscount and cungmua with flexible configs
☆30Jul 7, 2016Updated 10 years ago
Marlowess / spark-exercises
View on GitHub
Some exercises to learn Spark. Solved in Python.
☆21Oct 15, 2024Updated last year
tmtuan04 / auto-fill-hust
View on GitHub
Extension giúp ta tự động điền form Quy chế + Pháp Luật - HUST
☆23May 26, 2026Updated last month
fatchur / Simple-Tensor
View on GitHub
A simplification of Tensorflow Tensor Operations
☆21Dec 2, 2022Updated 3 years ago
keeganhines / computationalStatistics
View on GitHub
☆14Jan 14, 2017Updated 9 years ago
KeithGalli / lego-analysis
View on GitHub
☆19Feb 28, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nowke / megamart
View on GitHub
A Retail store management system - DBMS project (Sep 2015) written in Django (Python)
☆11Aug 11, 2020Updated 5 years ago
acoiman / wildfire_modeling
View on GitHub
Wildfire Modeling in Yosemite National Park
☆19Apr 14, 2025Updated last year
ccampo133 / maxheap
View on GitHub
Python implementation of binary max-heaps.
☆11Mar 22, 2020Updated 6 years ago
Cocola6s6 / Chat2DB_rust
View on GitHub
参考 Chat2DB 的效果，使用 chatgpt 进行自然语言翻译，然后对数据库进行操作，使用 rust 语言实现的 web 应用。
☆10Jan 13, 2025Updated last year
anilkulkarni87 / airflow-docker
View on GitHub
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …
☆34Feb 9, 2024Updated 2 years ago
syscalldev / FutureGPT
View on GitHub
⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…
☆13May 14, 2023Updated 3 years ago
annaymj / Python-Code
View on GitHub
☆13Jun 21, 2021Updated 5 years ago
ABZ-Aaron / coincap-api-pipeline
View on GitHub
A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase
☆13Jul 9, 2024Updated 2 years ago
jw-ng / airflow-dynamic-dags
View on GitHub
☆12Mar 17, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
binodsuman / kafka-spark-integration
View on GitHub
Kafka and Spark Integration. Alll code in maven project.
☆14Jun 30, 2026Updated 3 weeks ago
caltechlibrary / sidewall
View on GitHub
Sidewall is a Python library for interacting with the Dimensions search API.
☆17Sep 11, 2024Updated last year
fjmatrix / gpt_assistant
View on GitHub
Web app designed to enhance your interaction with OpenAI's language models
☆12Jun 14, 2023Updated 3 years ago
davidlkl / Insurance-Pricing-Game
View on GitHub
1st place solution
☆23Apr 8, 2021Updated 5 years ago
mage-ai / machine_learning
View on GitHub
The definitive end-to-end machine learning (ML lifecycle) guide and tutorial for data engineers.
☆24Nov 14, 2024Updated last year
luatnc87 / modern-data-warehouse-modeling-and-data-quality-with-dbt-openmetadata
View on GitHub
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source to…
☆42Sep 20, 2023Updated 2 years ago
abyaadrafid / Deep-Reinforcement-Learning
View on GitHub
To keep track and showcase
☆16Aug 27, 2021Updated 4 years ago