h3xagn / blog-build-etl-pipelineLinks
Data pipeline from device to cloud
☆12Updated 3 years ago
Alternatives and similar repositories for blog-build-etl-pipeline
Users that are interested in blog-build-etl-pipeline are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆54Updated 3 weeks ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆100Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆225Updated last week
- An example repository showing how to leverage Kafka to stream your data☆21Updated last year
- ☆103Updated last week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated 3 weeks ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆251Updated last month
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆315Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- ☆20Updated last year
- Project for "Data pipeline design patterns" blog.☆46Updated last year
- Example app using FastAPI, asyncio, SQLModel, Celery, Alembic and Supertokens☆104Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆119Updated 7 months ago
- The Open-Source Enterprise Data Platform in a single Portal☆259Updated this week
- ☆38Updated 7 months ago
- Deploy multiple Dagster data pipelines on Docker environment☆23Updated last year
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 8 months ago
- New generation opensource data stack☆74Updated 3 years ago
- This project is a backend template for a FastAPI-based application that uses the repository pattern approach to provide an abstraction la…☆47Updated 2 years ago
- This is a demo streaming project simulating a music streaming service.☆34Updated last year
- ☆211Updated 9 months ago
- An extendable async API using FastAPI, SQLModel, PostgreSQL and Redis.☆225Updated 2 months ago
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆36Updated 5 months ago
- A fully functional FastAPI application that acts as a marketplace for cleaners and potential cleaning jobs.☆223Updated 2 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆18Updated 3 years ago
- All things awesome related to Dagster!☆129Updated 3 weeks ago
- fastapi-crons is a FastAPI extension for running cron jobs and background tasks in a clean, reliable way with async support and syntyx ju…☆74Updated last week