Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.
☆169Sep 13, 2025Updated 5 months ago
Alternatives and similar repositories for datagen
Users that are interested in datagen are comparing it to the libraries listed below
Sorting:
- ☆41Mar 18, 2024Updated last year
- The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets☆11Dec 27, 2024Updated last year
- Mockingbird is a mock streaming data generator☆135Feb 6, 2025Updated last year
- A Terraform provider for Materialize☆14Feb 24, 2026Updated last week
- ☆26Nov 28, 2022Updated 3 years ago
- Demos of Materialize, the operational data warehouse.☆52Mar 5, 2025Updated 11 months ago
- Multi-hop declarative data pipelines☆124Feb 25, 2026Updated last week
- Schema Registry Statistics Tool☆24Updated this week
- ☆19Jun 19, 2024Updated last year
- Serverless multi-protocol + multi-destination event collection system.☆210Nov 24, 2024Updated last year
- Kafka Connect JSONata Transform☆12Feb 24, 2025Updated last year
- titan: a package manager for Snowflake DB☆23Oct 3, 2022Updated 3 years ago
- A list of all awesome open-source contributions for the Apache Kafka project☆110Jul 10, 2023Updated 2 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Nov 15, 2022Updated 3 years ago
- Minimalistic package for handling Go errors in an easy way☆13Oct 31, 2024Updated last year
- Server to view ClickHouse profiler data in speedscope.app☆11Dec 24, 2019Updated 6 years ago
- Benchmarks to read parquet to arrow☆11Dec 25, 2022Updated 3 years ago
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 4 years ago
- maven extension to find slow tests☆25Sep 19, 2022Updated 3 years ago
- Work with your web service, database, and streaming schemas in a single format.☆351Dec 30, 2025Updated 2 months ago
- Memborable Unique Identifier☆13Sep 29, 2022Updated 3 years ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Oct 9, 2024Updated last year
- ☆18Feb 13, 2026Updated 2 weeks ago
- Extracts TypeScript types from GraphQL string literals☆12Mar 25, 2022Updated 3 years ago
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆885Updated this week
- ☆81Apr 23, 2025Updated 10 months ago
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆15Sep 10, 2024Updated last year
- A dbt adapter for Decodable☆12Sep 4, 2025Updated 6 months ago
- Ultra-high-performance local IPC framework with Zipkin tracing to conduct a beautiful symphony of (brotherhood) build tooling.☆10Jan 8, 2021Updated 5 years ago
- A simple playground for dbt with the sqlite connector☆12May 22, 2022Updated 3 years ago
- ☆13Aug 14, 2025Updated 6 months ago
- Cloud Resource and Infrastructure-as-Code Generator (or CRAIG) allows users to generate Terraform to create a fully customizable environm…☆20Updated this week
- Example GitHub Actions for Apache Kafka client application development for local and Confluent Cloud☆15Aug 1, 2022Updated 3 years ago
- Conduit streams data between data stores. Kafka Connect replacement. No JVM required.☆580Updated this week
- Package to assert rows in-line with dbt macros.☆70Nov 25, 2025Updated 3 months ago
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,239Updated this week
- Karapace - Your Apache Kafka® essentials in one tool☆595Updated this week
- Vision analytics solution using Dataflow and Vision AI☆16May 4, 2024Updated last year
- Configuration for sql.clickhouse.com☆19Jan 5, 2026Updated last month