omidvd79 / Big_Data_DemystifiedLinks
Big Data Demystified meetup and blog examples
☆31Updated 11 months ago
Alternatives and similar repositories for Big_Data_Demystified
Users that are interested in Big_Data_Demystified are comparing it to the libraries listed below
Sorting:
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- ☆48Updated 3 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆86Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆54Updated 4 years ago
- The go to demo for public and private dbt Learn☆80Updated 4 months ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Cloned by the `dbt init` task☆61Updated last year
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- ☆23Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆118Updated 2 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- ☆78Updated this week
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 4 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- A guide for leading a data (engineering) team☆64Updated last year
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- ☆96Updated 2 years ago
- An example mini data warehouse for python project stats, template for new projects☆179Updated 5 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week
- ☆111Updated 7 months ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Official dbt adapter for Vertica☆26Updated last month