hnawaz007 / datalakeLinks
open source data lake
☆25Updated 9 months ago
Alternatives and similar repositories for datalake
Users that are interested in datalake are comparing it to the libraries listed below
Sorting:
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Data engineering with dbt, published by Packt☆87Updated last month
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Updated 2 years ago
- Cloned by the `dbt init` task☆62Updated last year
- ☆72Updated this week
- ☆88Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆104Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆57Updated 5 months ago
- Data Engineering with Scala, published by Packt☆26Updated last year
- ☆10Updated 3 years ago
- ☆20Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆59Updated last year
- build dw with dbt☆47Updated last year
- Repo for CDC with debezium blog post☆29Updated last year
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆249Updated 7 months ago
- Delta Lake Documentation☆50Updated last year
- Challenge Data Engineer☆25Updated 3 years ago
- ☆21Updated 2 years ago
- The Ultimate Guide to Snowpark, published by Packt☆14Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆121Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆119Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆85Updated 5 months ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆56Updated 2 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆41Updated last year
- Apache Airflow Best Practices, published by Packt☆49Updated 11 months ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆18Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆90Updated 2 years ago