danielbeach / sniffer
csv and flat-file sniffer built in Rust.
☆40Updated 7 months ago
Related projects: ⓘ
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 5 months ago
- Code snippets for Data Engineering Design Patterns book☆27Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆45Updated last year
- Cost Efficient Data Pipelines with DuckDB☆42Updated last month
- rust-for-data☆42Updated last year
- ☆22Updated 2 months ago
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Full stack data engineering tools and infrastructure set-up☆38Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆35Updated 6 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆156Updated this week
- Delta Lake Documentation☆45Updated 3 months ago
- Utility functions for dbt projects running on Spark☆30Updated 10 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆68Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆52Updated 5 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 3 months ago
- ☆45Updated last month
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆52Updated last year
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆122Updated last week
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆41Updated 2 months ago
- ☆36Updated 2 weeks ago
- This is a public repository to go over all the LLM-driven data engineering concepts.☆67Updated 11 months ago
- ☆15Updated 4 months ago
- Contribute to dlt verified sources 🔥☆64Updated this week
- A Table format agnostic data sharing framework☆36Updated 7 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆72Updated 2 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆185Updated this week
- Delta Acceptance Testing☆19Updated last month
- Delta Lake examples☆201Updated 3 months ago
- Yet Another (Spark) ETL Framework☆18Updated 10 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆29Updated last year