arunma / datagen
An easy to use tool to generate fake/dummy data in bulk and export it as JSON, CSV, Avro or directly into your database as tables. Written in Rust.
☆9Updated 5 years ago
Alternatives and similar repositories for datagen:
Users that are interested in datagen are comparing it to the libraries listed below
- Mock streaming data generator☆16Updated 7 months ago
- An example source connector for Kafka Connect, ingesting data from etcd☆11Updated 2 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 9 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆18Updated this week
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- ☆22Updated 5 years ago
- minio as local storage and DynamoDB as catalog☆13Updated 8 months ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Docker image for Apache Hive running on Tez☆7Updated 10 years ago
- Set of tools for creating backups, compaction and restoration of Apache Kafka® Clusters☆19Updated last week
- An interactive CLI tool for managing Kafka topics☆28Updated 5 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Is using KoP (Kafka-On-Pulsar) a good idea? Use the scenarios implemented in this repository to check whether Pulsar with KoP enabled is …☆10Updated 2 years ago
- A kubernetes operator for the Open Policy Agent☆16Updated this week
- A command line client for consuming Postgres logical decoding events in the pgoutput format☆11Updated 6 months ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Testing Scala code with scalatest☆12Updated 2 years ago
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- Collection of AWS Lambdas for creating and managing Delta tables☆24Updated this week
- Data Catalog for Databases and Data Warehouses☆31Updated last year
- Parquet Command-line Tools☆18Updated 8 years ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆14Updated 3 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Kubernetes deployments and examples for various streaming SQL implementations☆10Updated 2 years ago
- Stackable Operator for Apache Airflow☆22Updated this week
- ☆13Updated last year
- Dione - a Spark and HDFS indexing library☆50Updated 9 months ago
- Code for the fictitious food delivery company GottaEat used in the Pulsar In Action book☆17Updated 2 years ago