data-catering / data-caterer
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
☆47Updated 3 weeks ago
Alternatives and similar repositories for data-caterer:
Users that are interested in data-caterer are comparing it to the libraries listed below
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆79Updated this week
- Unity Catalog UI☆39Updated 5 months ago
- A DataOps framework for building a lakehouse.☆42Updated this week
- A Table format agnostic data sharing framework☆38Updated last year
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 8 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 6 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- dbt (data build tool) adapter for the Dremio☆49Updated last week
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆92Updated this week
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- Delta lake and filesystem helper methods☆50Updated 11 months ago
- Delta Lake Documentation☆48Updated 8 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆131Updated last month
- A Minimalistic Rust Implementation of Delta Sharing Server.☆83Updated this week
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆157Updated this week
- The Data Product Descriptor Specification (DPDS) Repository☆77Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆81Updated last month
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- Data Tools Subjective List☆83Updated last year
- The Modern Data Stack in a Python package☆49Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆199Updated last week
- A Python Library to support running data quality rules while the spark job is running⚡☆172Updated this week
- ☆82Updated last year
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆33Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆213Updated this week
- Quick Guides from Dremio on Several topics☆67Updated last month
- dbt's adapter for dremio☆48Updated 2 years ago