polakowo / yelp-3nf
3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
☆12Updated 5 years ago
Alternatives and similar repositories for yelp-3nf:
Users that are interested in yelp-3nf are comparing it to the libraries listed below
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 9 months ago
- ☆16Updated last year
- ☆23Updated 6 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- AWS Big Data Certification☆25Updated 3 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- ELT Code for your Data Warehouse☆26Updated last year
- ☆16Updated 3 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- From the medium article about Customer Retention☆11Updated 5 years ago
- A Python API for Asynchronously Loading Data into Snowflake DB -☆63Updated 5 months ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Fivetran data models for QuickBooks using dbt.☆28Updated 3 months ago
- Big Data Demystified meetup and blog examples☆31Updated 8 months ago
- dbt data models for facebook ads☆40Updated 4 months ago
- Customer 360 analytics powered by MapR☆23Updated 2 years ago
- Finance 🏦 Data Builder 🛠️ @ postgres 🐘☆21Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- A few end to end examples that use data-describe☆16Updated last year
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- #DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Grap…☆14Updated 6 years ago
- ☆33Updated last year
- A repo to track data engineering projects☆13Updated 2 years ago