fraibacas / lakehouse-poc
Run an open-source data LakeHouse locally using Docker Compose
☆11Updated 11 months ago
Alternatives and similar repositories for lakehouse-poc
Users that are interested in lakehouse-poc are comparing it to the libraries listed below
Sorting:
- dlt-dagster-demo☆11Updated last year
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆147Updated this week
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 11 months ago
- duckdb-etl-framework☆11Updated 4 months ago
- Automate and streamline the alerting & notification process for dbt test results🐞🚀☆17Updated 3 weeks ago
- learning-by-doing data model built with dbt-core☆13Updated 5 months ago
- Cost Efficient Data Pipelines with DuckDB☆52Updated this week
- API for distributing Data Lake Data☆11Updated last month
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆69Updated last year
- build dw with dbt☆44Updated 6 months ago
- A template DBT project for BigQuery on Google Cloud☆12Updated 4 years ago
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- ☆18Updated 9 months ago
- Contribute to dlt verified sources 🔥☆85Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆50Updated 6 months ago
- analyse your electricity usage data from Belgian smart meters with dbt, duckdb and evidence☆20Updated 2 weeks ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year
- Data models for Hubspot built using dbt.☆35Updated last month
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆56Updated this week
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year
- lakefs-samples repository☆81Updated 3 weeks ago
- Utility functions for dbt projects running on Spark☆34Updated 3 months ago
- ☆11Updated 6 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆14Updated 6 months ago
- Repo for CDC with debezium blog post☆28Updated 8 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago