MaterializeInc / datagen
Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.
☆154Updated 2 months ago
Alternatives and similar repositories for datagen:
Users that are interested in datagen are comparing it to the libraries listed below
- Schema modelling framework for decentralised domain-driven ownership of data.☆250Updated last year
- Data Tools Subjective List☆83Updated last year
- Multi-hop declarative data pipelines☆109Updated this week
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆39Updated 5 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year
- Work with your web service, database, and streaming schemas in a single format.☆337Updated 10 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆80Updated last month
- Open Control Plane for Tables in Data Lakehouse☆323Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆226Updated last month
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆157Updated this week
- Delta Lake Documentation☆48Updated 7 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- In-Memory Analytics for Kafka using DuckDB☆89Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆90Updated 10 months ago
- Pythonic Iceberg REST Catalog☆72Updated 5 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆75Updated this week
- Make dbt docs and Apache Superset talk to one another☆137Updated last month
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆213Updated 3 weeks ago
- ☆209Updated this week
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆171Updated 6 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆226Updated this week
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆388Updated 3 weeks ago
- ☆79Updated last year
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆79Updated this week
- Apache Hive Metastore as a Standalone server in Docker☆68Updated 5 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- The Open-Source Enterprise Data Platform in a single Portal☆231Updated this week