gabriel-garciae / one_billion_row_challenge_python
☆17Updated 7 months ago
Alternatives and similar repositories for one_billion_row_challenge_python:
Users that are interested in one_billion_row_challenge_python are comparing it to the libraries listed below
- End to end data engineering project☆53Updated 2 years ago
- ☆74Updated 4 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆209Updated last week
- In this repository we store all materials for dlt workshops, courses, etc.☆113Updated last month
- Code for dbt tutorial☆151Updated 8 months ago
- Code snippets for Data Engineering Design Patterns book☆69Updated 2 weeks ago
- ☆111Updated 6 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆64Updated 4 months ago
- build dw with dbt☆36Updated 3 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆126Updated 7 months ago
- Daily updated fake data for DBT learning and projects☆32Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 6 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆47Updated 3 months ago
- Sample project to demonstrate data engineering best practices☆179Updated 11 months ago
- Project for "Data pipeline design patterns" blog.☆43Updated 6 months ago
- ☆31Updated last month
- Code to demonstrate data engineering metadata & logging best practices☆16Updated 11 months ago
- Example repo to create end to end tests for data pipeline.☆22Updated 8 months ago
- ☆119Updated last week
- Code for "Advanced data transformations in SQL" free live workshop☆71Updated 3 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 5 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆26Updated 2 years ago
- an ephemeral project repo for the DU Dagster project☆66Updated this week
- Delta Lake examples☆217Updated 4 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆124Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆75Updated last year
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆63Updated last year