Demonstration of a Hive Input Format for Iceberg
☆26Mar 12, 2021Updated 4 years ago
Alternatives and similar repositories for hiveberg
Users that are interested in hiveberg are comparing it to the libraries listed below
Sorting:
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- Building custom data sources for Apache Spark, in Java.☆12Oct 12, 2020Updated 5 years ago
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated last week
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Flink SQL 实战 -中文博客专栏☆16Jun 17, 2022Updated 3 years ago
- Example of a tested Apache Flink application.☆43Jul 10, 2019Updated 6 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- Highly configurable Helm Presto Chart☆24Nov 13, 2019Updated 6 years ago
- Data sets and Vagrant script to provision a virtual machine for Apache Calcite development☆30Mar 24, 2023Updated 2 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Dec 16, 2023Updated 2 years ago
- Java client for managing Apache Flink via REST API☆57Aug 23, 2025Updated 6 months ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Dec 19, 2024Updated last year
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- 提供清晰、实用的Akka应用指导☆31Jan 17, 2022Updated 4 years ago
- Apache Pig plugin for Eclipse☆12Feb 28, 2017Updated 9 years ago
- ☆19Oct 23, 2025Updated 4 months ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Kubernetes Operator for the Ververica Platform☆35Jan 19, 2023Updated 3 years ago
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago
- A stateful distributed balance service with persistent entities via sharding in persistence mode☆13Oct 26, 2018Updated 7 years ago
- Code Samples for my Ververica Webinar "99 Ways to Enrich Streaming Data with Apache Flink"☆41Jan 4, 2022Updated 4 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- ☆15Nov 26, 2024Updated last year
- A K8s operator for managing the lifecycle of Kafka Connect connectors☆10May 21, 2024Updated last year
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/info…☆12May 28, 2024Updated last year
- ☆13Feb 12, 2026Updated 2 weeks ago
- ☆18Jul 26, 2022Updated 3 years ago
- FIO Load Testing framework - for Openshift☆11Jun 8, 2023Updated 2 years ago
- seckill秒杀项目【PRC】☆10Apr 13, 2019Updated 6 years ago
- The official NodeJS driver for the Cyton board over Serial.☆16Feb 18, 2019Updated 7 years ago
- openEHR Clinical modelling tooling setup☆10Jun 24, 2018Updated 7 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago