Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.
☆169Sep 13, 2025Updated 9 months ago
Alternatives and similar repositories for datagen
Users that are interested in datagen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40Mar 18, 2024Updated 2 years ago
- Demos using Conduktor Gateway☆18Apr 11, 2024Updated 2 years ago
- ☆19Jun 19, 2024Updated 2 years ago
- The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets☆11Dec 27, 2024Updated last year
- CLI for scraping, querying and visualizing Prometheus metrics.☆18Jun 18, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Terraform provider for Materialize☆14Updated this week
- Multi-hop declarative data pipelines☆127Updated this week
- Schema Registry Statistics Tool☆24Jun 20, 2026Updated 2 weeks ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆44Mar 6, 2023Updated 3 years ago
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Sep 14, 2020Updated 5 years ago
- A list of all awesome open-source contributions for the Apache Kafka project☆111Jul 10, 2023Updated 2 years ago
- ☆26Nov 28, 2022Updated 3 years ago
- Kafka Connector for Iceberg tables☆16Jul 24, 2023Updated 2 years ago
- Serverless multi-protocol + multi-destination event collection system.☆210Nov 24, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Work with your web service, database, and streaming schemas in a single format.☆349Dec 30, 2025Updated 6 months ago
- Lightweight internet of things agent that forwards events from the edge☆31Mar 11, 2026Updated 3 months ago
- A dbt adapter for Decodable☆12Sep 4, 2025Updated 10 months ago
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,324Updated this week
- Rust build system integration for protobuf, Google's data interchange format.☆22Jan 9, 2026Updated 5 months ago
- Kafka Connect JSONata Transform☆12Jun 12, 2026Updated 3 weeks ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Nov 15, 2022Updated 3 years ago
- This library contains the Kinesis Analytics stream processing runtime configuration classes.☆11Jan 26, 2026Updated 5 months ago
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows wi…☆940Jun 28, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example GitHub Actions for Apache Kafka client application development for local and Confluent Cloud☆15Aug 1, 2022Updated 3 years ago
- How do tech companies rank amongst themselves when it comes to github.com activity?☆17May 2, 2021Updated 5 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- 🚀 Example configuration files to help you get started.☆49Jun 22, 2026Updated last week
- Conduit streams data between data stores. Kafka Connect replacement. No JVM required.☆601Updated this week
- Benchmarks to read parquet to arrow☆11Dec 25, 2022Updated 3 years ago
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 5 years ago
- Karapace - Your Apache Kafka® essentials in one tool☆620Jun 23, 2026Updated last week
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Oct 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Package to assert rows in-line with dbt macros.☆72Nov 25, 2025Updated 7 months ago
- ZSH plugin to have Kafka automatic completion for most CLI tools☆68Aug 19, 2022Updated 3 years ago
- A python library bakeoff for medium sized datasets☆24Aug 25, 2023Updated 2 years ago
- The Apache Kafka Operations Platform. 🚀☆156Updated this week
- ☆81Apr 23, 2025Updated last year
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated last year
- Net::Kafka - High-performant Perl client for Apache Kafka☆13Sep 8, 2023Updated 2 years ago