DucAnhNTT / bigdata-ETL-pipelineLinks
The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use
☆17Updated last year
Alternatives and similar repositories for bigdata-ETL-pipeline
Users that are interested in bigdata-ETL-pipeline are comparing it to the libraries listed below
Sorting:
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated last year
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- ☆44Updated last year
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Updated last year
- ETL to scrape a real estate website, process house prices and data, and build an ML model of the house prices.☆16Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.