Đồ án tốt nghiệp | Data Lakehouse
☆42Feb 9, 2026Updated 2 months ago
Alternatives and similar repositories for DataLakeHouse
Users that are interested in DataLakeHouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆40Dec 15, 2025Updated 4 months ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆43Apr 22, 2023Updated 3 years ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- This Repo contains Jupyter Notebooks to recap on RDD, DataFrame, Spark Streaming and ML operations using Pyspark☆11Nov 3, 2024Updated last year
- End-to-end data engineering pipeline with various technologies to ingest real time data.☆26Nov 3, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This GitHub repository contains all the code and Jupyter notebooks accompanying the book "Building Medallion Architectures," offering pra…☆42Sep 20, 2025Updated 7 months ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- On-premises ELT Pipeline☆31Jul 10, 2025Updated 9 months ago
- Airflow helm chart for AWS EKS☆20Jan 18, 2021Updated 5 years ago
- TrafficAdvisor: a Real-Time Traffic Monitoring System☆14Sep 10, 2018Updated 7 years ago
- MLOps Implementation for Disaster Tweets Classifier Application☆24Mar 24, 2024Updated 2 years ago
- the full pipeline for model retraining with fastapi and github actions☆16Jul 5, 2024Updated last year
- open source data lake☆32Jan 17, 2025Updated last year
- A testing ground for Plotly Dash app development including app features and experimenting with dashboard visualizations.☆10Oct 15, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- NSCollectionView sample for OS X 10.11 ElCapitan☆12Nov 24, 2017Updated 8 years ago
- ☆10Feb 2, 2024Updated 2 years ago
- Nyc_Taxi_Data_Pipeline - DE Project☆140Oct 21, 2024Updated last year
- ☆13Sep 23, 2023Updated 2 years ago
- ☆10Sep 29, 2022Updated 3 years ago
- um its my portfolio?☆16Feb 10, 2026Updated 2 months ago
- Jupyter notebook with the code of a probabilistic neural network in PyTorch☆12Jan 17, 2020Updated 6 years ago
- ☆16Feb 11, 2026Updated 2 months ago
- ☆16Nov 27, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆48Oct 14, 2024Updated last year
- ☆11Jan 31, 2019Updated 7 years ago
- 구글의 지식 그래프 서비스 아키텍쳐를 직접 구현해보는 것을 목표로 한다.☆10Jan 10, 2019Updated 7 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Matlab scripts for the paper "Machine Learning meets Stochastic Geometry: Determinantal Subset Selection for Wireless Networks"☆12May 4, 2019Updated 6 years ago
- Stock Advisor☆12Jun 13, 2025Updated 10 months ago
- ☆29Oct 24, 2024Updated last year
- Chroma maintenance CLI☆16Aug 15, 2025Updated 8 months ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Alpaca Brokerage Implementation☆13Updated this week
- This project demonstrates how to integrate DuckLake, SQLMesh, and Neon PostgreSQL to create a modern data lakehouse architecture with ver…☆28Jun 3, 2025Updated 10 months ago
- Julia package for finding robust shortest paths☆19Feb 21, 2021Updated 5 years ago
- Official Code of Decoupled Graph Convolution (DGC)☆16Jan 31, 2026Updated 3 months ago
- A jupyter kernel for GoPlus☆23Mar 4, 2021Updated 5 years ago
- A fully featured banking API built with FastAPI,Docker,Celery,Redis,RabbitMQ with an AI/ML transaction analysis and fraud detection syste…☆21Sep 4, 2025Updated 7 months ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 11 months ago