hnawaz007 / datalake
open source data lake
☆12Updated 3 months ago
Alternatives and similar repositories for datalake:
Users that are interested in datalake are comparing it to the libraries listed below
- A Postgres data warehouse for processing synthetic data using IAC principles☆17Updated 2 years ago
- ☆16Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- demo examples how to load data from different sources to different destinations☆20Updated 2 months ago
- A monorepo of many Rill example projects☆36Updated this week
- ☆42Updated last month
- Challenge Data Engineer☆25Updated 2 years ago
- I will be adding different kind of opensource data extraction tools code using python☆10Updated 5 months ago
- ☆11Updated 5 months ago
- A demo of the Mito Streamlit Spreadsheet☆18Updated last year
- Repo for CDC with debezium blog post☆28Updated 7 months ago
- ☆17Updated 8 months ago
- Guide for running a custom API Powered by Snowflake in Python☆21Updated 8 months ago
- Cloned by the `dbt init` task☆61Updated last year
- Analytics engineering with dbt - projects and developer environment☆18Updated 7 months ago
- build dw with dbt☆44Updated 6 months ago
- ☆10Updated 2 years ago
- ☆21Updated 2 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆14Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆41Updated 5 months ago
- ☆17Updated 8 months ago
- Using Polars and Pandas on AWS Lambda to process data.☆9Updated last year
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆11Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆52Updated 8 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated last month
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆23Updated 2 weeks ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year