arunma / datagen
An easy to use tool to generate fake/dummy data in bulk and export it as JSON, CSV, Avro or directly into your database as tables. Written in Rust.
☆9Updated 4 years ago
Related projects: ⓘ
- Mock streaming data generator☆14Updated 3 months ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- ☆22Updated 5 years ago
- This repository contains recipes for Apache Pinot.☆23Updated 3 weeks ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆12Updated 6 months ago
- A Java connector for delta.io/sharing/ that allows you to easily ingest data on any JVM.☆13Updated 5 months ago
- Data Catalog for Databases and Data Warehouses☆31Updated 8 months ago
- KSQL Syntax Highlighting for VSCode☆16Updated last year
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆43Updated 5 months ago
- Kubernetes deployments and examples for various streaming SQL implementations☆10Updated 2 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated last year
- Use SQL to transform your avro schema/records☆28Updated 6 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Events about the open source data stack☆13Updated 2 years ago
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 3 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 weeks ago
- Parquet Command-line Tools☆18Updated 7 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆15Updated this week
- Testing Scala code with scalatest☆11Updated last year
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆49Updated 8 months ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Apache Airflow CI pipeline☆18Updated 5 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆53Updated last year
- An example source connector for Kafka Connect, ingesting data from etcd☆11Updated 2 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆16Updated 3 years ago