startreedata / pinot-recipesLinks
This repository contains recipes for Apache Pinot.
☆30Updated 3 months ago
Alternatives and similar repositories for pinot-recipes
Users that are interested in pinot-recipes are comparing it to the libraries listed below
Sorting:
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 2 weeks ago
- ☆58Updated 10 months ago
- ☆13Updated last year
- A Table format agnostic data sharing framework☆38Updated last year
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated last year
- Code snippets used in demos recorded for the blog.☆37Updated last week
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Spark on Kubernetes using Helm☆34Updated 5 years ago
- ☆17Updated 3 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- resources for trying out a nessie-flink-iceberg setup☆11Updated last year
- Presto Trino with Apache Hive Postgres metastore☆42Updated 9 months ago
- ☆40Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- minio as local storage and DynamoDB as catalog☆15Updated last year
- Unity Catalog UI☆40Updated 9 months ago
- Code for the fictitious food delivery company GottaEat used in the Pulsar In Action book☆18Updated 3 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆20Updated 3 years ago
- ☆89Updated 5 months ago
- ☆18Updated last year
- pulsar lakehouse connector☆34Updated 2 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year