startreedata / pinot-recipesLinks
This repository contains recipes for Apache Pinot.
☆30Updated 7 months ago
Alternatives and similar repositories for pinot-recipes
Users that are interested in pinot-recipes are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week
- A Table format agnostic data sharing framework☆39Updated last year
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Updated last year
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆62Updated last year
- Spark on Kubernetes using Helm☆34Updated 5 years ago
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- ☆97Updated 8 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- ☆43Updated last year
- Magic to help Spark pipelines upgrade☆34Updated last year
- Apache Flink Guide☆58Updated 3 years ago
- Aiven's S3 Sink Connector for Apache Kafka®☆71Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆159Updated 2 weeks ago
- Presto Trino with Apache Hive Postgres metastore☆43Updated last year
- Cloud Spanner Connector for Apache Spark☆17Updated 8 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Multi-hop declarative data pipelines☆118Updated last week
- type-class based data cleansing library for Apache Spark SQL☆78Updated 6 years ago
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆43Updated last week
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆174Updated last month
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆233Updated 8 months ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆40Updated last year
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆84Updated 3 months ago
- A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.☆62Updated 3 weeks ago
- ☆80Updated 5 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated 2 weeks ago