omidvd79 / Big_Data_DemystifiedLinks
Big Data Demystified meetup and blog examples
☆31Updated last year
Alternatives and similar repositories for Big_Data_Demystified
Users that are interested in Big_Data_Demystified are comparing it to the libraries listed below
Sorting:
- Cloned by the `dbt init` task☆62Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Rules based grant management for Snowflake☆41Updated 6 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Data lake, data warehouse on GCP☆58Updated 4 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- The go to demo for public and private dbt Learn☆81Updated 9 months ago
- A bunch of hacks developed around dbt☆48Updated 6 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆90Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 3 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- ☆23Updated 4 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated last week
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 6 years ago
- Basic tutorial of using Apache Airflow☆36Updated 7 years ago
- Machine Learning in Snowflake☆23Updated 6 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- ☆48Updated 4 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆113Updated 2 years ago
- ☆82Updated 4 months ago
- Superset Quick Start Guide, published by Packt☆56Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago