hnawaz007 / datalakeLinks
open source data lake
☆23Updated 7 months ago
Alternatives and similar repositories for datalake
Users that are interested in datalake are comparing it to the libraries listed below
Sorting:
- A Postgres data warehouse for processing synthetic data using IAC principles☆18Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Data engineering with dbt, published by Packt☆85Updated last year
- ☆88Updated 2 years ago
- ☆21Updated 2 years ago
- ☆44Updated last year
- Cloned by the `dbt init` task☆61Updated last year
- Python ETL demo for Hackforge☆32Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- ☆49Updated this week
- A guide to show you how to import data for ETL☆21Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆24Updated 3 weeks ago
- Challenge Data Engineer☆25Updated 3 years ago
- ☆10Updated 3 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆83Updated 3 months ago
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆32Updated last year
- New generation opensource data stack☆71Updated 3 years ago
- ☆12Updated 3 years ago
- DataTalks Workshop Materials☆19Updated last year
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Updated 3 years ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 6 months ago
- A few end to end examples that use data-describe☆16Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- ☆21Updated 2 years ago
- Repository for Apache Spark course at Team Data Science☆16Updated 4 years ago
- Recohut - Learn data engineering, data science☆99Updated 2 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 3 years ago
- Delta Lake Documentation☆49Updated last year