mattmartin14 / dream_machineView external linksLinks
☆80Jan 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for dream_machine
Users that are interested in dream_machine are comparing it to the libraries listed below
Sorting:
- SQLMesh example projects☆39Jul 2, 2025Updated 7 months ago
- A Rust based data/CSV/Parquet file generator☆64Mar 3, 2025Updated 11 months ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated 11 months ago
- The Data Product Specification☆11Jan 28, 2025Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- ☆14Jul 26, 2022Updated 3 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Mar 31, 2024Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Dec 10, 2025Updated 2 months ago
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆19Apr 14, 2025Updated 10 months ago
- Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hood☆15Nov 16, 2023Updated 2 years ago
- This repository contains coding interviews that I have encountered in company interviews☆12Oct 2, 2020Updated 5 years ago
- A platform to manage the data product life cycle☆22Updated this week
- Converting a zeppelin notebook in single programming language to respective script☆18Feb 16, 2020Updated 5 years ago
- FUSE-based DuckDB file system 🦆☆49Jun 16, 2025Updated 7 months ago
- ☆156Feb 6, 2026Updated last week
- various wrappers and functions over dictionary to add functionalities.☆13Apr 14, 2025Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Oct 13, 2025Updated 4 months ago
- Filter lines from standard input according to some probability, with a given delay, and for a certain duration.☆26Feb 17, 2023Updated 2 years ago
- This Repository contains the material for the tutorial "Introduction to MLOps with MLflow" held at pyData/pyCon Berlin 2022.☆21Apr 8, 2022Updated 3 years ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 7 months ago
- DuckDB DuckLake Demos☆37Jun 1, 2025Updated 8 months ago
- Delta Lake examples☆238Oct 8, 2024Updated last year
- OpsCenter for Snowflake makes it easy to understand and manage your Snowflake consumption☆24May 15, 2024Updated last year
- Full stack data engineering tools and infrastructure set-up☆57Feb 13, 2021Updated 5 years ago
- ☆13Feb 15, 2025Updated last year
- Python package for querying iceberg data through duckdb.☆73Feb 12, 2024Updated 2 years ago
- Flood Mapping Intercomparison☆15Nov 5, 2025Updated 3 months ago
- My tutorial on SQLAlchemy for Pydata London 2022 Conference☆25Jun 21, 2022Updated 3 years ago
- A GitHub Action for running Elementary.☆30Feb 4, 2024Updated 2 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Dec 9, 2024Updated last year
- ☆178May 21, 2025Updated 8 months ago
- python library for iceberg lake house on your local☆14Jan 8, 2026Updated last month
- ☆11Oct 6, 2023Updated 2 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆36Jul 9, 2024Updated last year
- ☆30Jul 2, 2024Updated last year
- A set of decks and notebooks with exercises for use in a hands-on causal inference tutorial session☆32Jul 15, 2022Updated 3 years ago
- Delta Lake helper methods in PySpark☆327Jan 19, 2026Updated 3 weeks ago
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆14Jan 23, 2025Updated last year
- Go based Open Source Scheduler Service☆16Aug 26, 2025Updated 5 months ago