MaterializeInc/datagen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaterializeInc/datagen)

MaterializeInc / datagen

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

☆169

Alternatives and similar repositories for datagen

Users that are interested in datagen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cloudhut / owl-shop
View on GitHub
☆41Mar 18, 2024Updated 2 years ago
conduktor / conduktor-gateway-demos
View on GitHub
Demos using Conduktor Gateway
☆18Apr 11, 2024Updated 2 years ago
joacoc / antennas-manhattan
View on GitHub
☆19Jun 19, 2024Updated 2 years ago
republicofdata-io / damn
View on GitHub
The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets
☆11Dec 27, 2024Updated last year
david-streamlio / pulsar-nifi-bundle
View on GitHub
NiFi Processor for Apache Pulsar
☆21Jul 15, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MaterializeInc / demos
View on GitHub
Demos of Materialize, the operational data warehouse.
☆52Mar 5, 2025Updated last year
MaterializeInc / terraform-provider-materialize
View on GitHub
A Terraform provider for Materialize
☆14Updated this week
linkedin / Hoptimator
View on GitHub
Multi-hop declarative data pipelines
☆126Updated this week
confluentinc / kafka-connect-datagen
View on GitHub
Connector that generates data for demos
☆50Updated this week
MaxHalford / taxi-demo-rp-mz-rv-rd-st
View on GitHub
🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations
☆44Mar 6, 2023Updated 3 years ago
gabledata / recap
View on GitHub
Work with your web service, database, and streaming schemas in a single format.
☆350Dec 30, 2025Updated 6 months ago
EladLeev / schema-registry-statistics
View on GitHub
Schema Registry Statistics Tool
☆24Updated this week
streamthoughts / awesome-opensource-contribs-kafka
View on GitHub
A list of all awesome open-source contributions for the Apache Kafka project
☆111Jul 10, 2023Updated 3 years ago
gordonmurray / cloudfloe
View on GitHub
The Switzerland of Iceberg queries: neutral, easy entry across S3, R2, MinIO
☆22Apr 15, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
kbastani / climate-change-analysis
View on GitHub
This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset
☆20Sep 14, 2020Updated 5 years ago
leiysky / recall
View on GitHub
☆15Feb 4, 2026Updated 5 months ago
10xfuturetechnologies / kafka-connect-iceberg
View on GitHub
Kafka Connector for Iceberg tables
☆16Jul 24, 2023Updated 3 years ago
velascoluis / bigpato
View on GitHub
☆26Nov 28, 2022Updated 3 years ago
silverton-io / buz
View on GitHub
Serverless multi-protocol + multi-destination event collection system.
☆210Nov 24, 2024Updated last year
decodableco / dbt-decodable
View on GitHub
A dbt adapter for Decodable
☆12Sep 4, 2025Updated 10 months ago
redpanda-data / redpanda-edge-agent
View on GitHub
Lightweight internet of things agent that forwards events from the edge
☆31Mar 11, 2026Updated 4 months ago
ShadowTraffic / shadowtraffic-ai-context
View on GitHub
AI artifacts to improve Claude, Cursor, etc's knowledge of ShadowTraffic.
☆21Updated this week
mthmulders / clocky
View on GitHub
Time-related test utilities for Java
☆23Jul 1, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MaterializeInc / materialize
View on GitHub
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
☆6,341Updated this week
MaterializeInc / rust-protobuf-native
View on GitHub
Rust build system integration for protobuf, Google's data interchange format.
☆22Jan 9, 2026Updated 6 months ago
rayokota / kafka-connect-jsonata
View on GitHub
Kafka Connect JSONata Transform
☆12Jun 12, 2026Updated last month
jwills / de4ml
View on GitHub
Supporting materials/code examples for my course in data engineering for machine learning.
☆39Nov 15, 2022Updated 3 years ago
estuary / flow
View on GitHub
🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows wi…
☆953Updated this week
aws / aws-kinesisanalytics-runtime
View on GitHub
This library contains the Kinesis Analytics stream processing runtime configuration classes.
☆11Jan 26, 2026Updated 6 months ago
implydata / druid-datagenerator
View on GitHub
A data generator for Apache Druid
☆12Mar 26, 2025Updated last year
DataEngineeringLabs / parquet-benchmark
View on GitHub
Benchmarks to read parquet to arrow
☆11Dec 25, 2022Updated 3 years ago
dmcquay / katas
View on GitHub
☆13Aug 14, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ConduitIO / conduit
View on GitHub
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
☆604Updated this week
filmaj / oss-contributors
View on GitHub
How do tech companies rank amongst themselves when it comes to github.com activity?
☆17May 2, 2021Updated 5 years ago
hortonworks / data_analytics_studio
View on GitHub
☆17Dec 7, 2022Updated 3 years ago
artifactable / cli
View on GitHub
Send customized alerts for your dbt project with simple tags
☆10Jul 27, 2021Updated 4 years ago
hellofresh / impala-monitor
View on GitHub
This a simple Python daemon to monitor your Impala nodes.
☆10Apr 13, 2021Updated 5 years ago
ybyzek / kafka-github-actions
View on GitHub
Example GitHub Actions for Apache Kafka client application development for local and Confluent Cloud
☆15Aug 1, 2022Updated 3 years ago
Aiven-Open / karapace
View on GitHub
Karapace - Your Apache Kafka® essentials in one tool
☆623Updated this week