startreedata / pinot-recipes
This repository contains recipes for Apache Pinot.
☆30Updated last month
Alternatives and similar repositories for pinot-recipes:
Users that are interested in pinot-recipes are comparing it to the libraries listed below
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 3 weeks ago
- ☆53Updated 8 months ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 7 months ago
- A testing framework for Trino☆26Updated 3 weeks ago
- ☆28Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- ☆17Updated 2 years ago
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆71Updated this week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆62Updated last year
- Presto Trino with Apache Hive Postgres metastore☆41Updated 7 months ago
- ☆39Updated last year
- A Table format agnostic data sharing framework☆38Updated last year
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆26Updated last year
- Unity Catalog UI☆40Updated 7 months ago
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Updated 4 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago
- a curated list of awesome lakehouse frameworks, applications, etc☆24Updated last month
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated last year
- ☆73Updated 3 months ago
- ☆16Updated last year
- Aiven's S3 Sink Connector for Apache Kafka®☆69Updated 7 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- ☆25Updated last year
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Updated last year
- ☆40Updated last year