data-catering / data-catererLinks
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
☆69Updated last week
Alternatives and similar repositories for data-caterer
Users that are interested in data-caterer are comparing it to the libraries listed below
Sorting:
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆135Updated last week
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆127Updated this week
- Unity Catalog UI☆43Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆167Updated last month
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Open Control Plane for Tables in Data Lakehouse☆370Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- The Open-Source Enterprise Data Platform in a single Portal☆260Updated this week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆64Updated this week
- The Data Product Descriptor Specification (DPDS) Repository☆80Updated 8 months ago
- ODD Specification is a universal open standard for collecting metadata.☆143Updated 11 months ago
- ☆39Updated 5 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 3 weeks ago
- ☆59Updated 5 months ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆40Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆59Updated this week
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆192Updated this week
- Data product portal created by Dataminded☆190Updated this week
- A platform to manage the data product life cycle☆19Updated this week
- A Table format agnostic data sharing framework☆39Updated last year
- Quickstart for any service☆161Updated this week
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- dbt's adapter for dremio☆48Updated 2 years ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆35Updated 3 weeks ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated 2 years ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆105Updated last week