danielbeach / datahobbit
A Rust based data/CSV/Parquet file generator
☆22Updated this week
Related projects ⓘ
Alternatives and complementary repositories for datahobbit
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 7 months ago
- csv and flat-file sniffer built in Rust.☆42Updated 9 months ago
- Utility functions for dbt projects running on Spark☆31Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆48Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆74Updated last week
- Unity Catalog UI☆39Updated 2 months ago
- Delta Lake Documentation☆46Updated 5 months ago
- The dbt adapter for Firebolt☆29Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆55Updated last year
- Delta Lake helper methods. No Spark dependency.☆22Updated 2 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆37Updated 2 weeks ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- Code snippets for Data Engineering Design Patterns book☆40Updated last week
- dbt starter code for enterprise Snowflake usage data artifacts☆22Updated 2 years ago
- PySpark schema generator☆38Updated last year
- ☆18Updated last year
- Pytest plugin for dbt core☆58Updated 5 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆174Updated this week
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week
- ☆66Updated last month
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆149Updated last week
- Write your dbt models using Ibis☆53Updated last month
- Fake Pandas / PySpark DataFrame creator☆42Updated 8 months ago
- Pythonic Iceberg REST Catalog☆67Updated 2 months ago
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆178Updated 2 months ago
- ✨ A Pydantic to PySpark schema library☆57Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated last month