ExpediaGroup / hivebergLinks
Demonstration of a Hive Input Format for Iceberg
☆26Updated 4 years ago
Alternatives and similar repositories for hiveberg
Users that are interested in hiveberg are comparing it to the libraries listed below
Sorting:
- Spark Connector to read and write with Pulsar☆117Updated 3 weeks ago
- A home for LinkedIn's changes to Apache Iceberg☆63Updated 2 weeks ago
- Example of a tested Apache Flink application.☆43Updated 6 years ago
- Schema Registry integration for Apache Spark☆40Updated 3 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Updated 4 years ago
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 7 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 6 years ago
- StreamLine - Streaming Analytics☆166Updated 2 years ago
- Spark Structured Streaming State Tools☆34Updated 5 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 7 years ago
- Java event logs collector for hadoop and frameworks☆41Updated 10 months ago
- Developing Spark External Data Sources using the V2 API☆48Updated 7 years ago
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated 2 years ago
- A library for querying Druid data sources with Apache Spark☆23Updated 5 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 6 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Utilities for processing Flink checkpoints/savepoints☆75Updated 6 years ago
- Kubernetes Operator for the Ververica Platform☆35Updated 3 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Updated 6 months ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 3 years ago
- Schema Registry☆17Updated last year
- Thoughts on things I find interesting.☆17Updated last year
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94Updated 8 months ago
- Custom state store providers for Apache Spark☆92Updated 11 months ago
- Lab for testing different Flink job latency optimization techniques covered in a Flink Forward 2021 talk☆27Updated 4 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Updated 4 years ago
- Kafka Connect FileSystem Connector☆112Updated 3 years ago