Parquet / parquet-compatibilityLinks
compatibility tests to make sur C and Java implementations can read each other
☆70Updated 3 years ago
Alternatives and similar repositories for parquet-compatibility
Users that are interested in parquet-compatibility are comparing it to the libraries listed below
Sorting:
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Kite SDK☆393Updated 3 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Mirror of Apache Apex core☆350Updated 4 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 4 years ago
- The SpliceSQL Engine☆170Updated 2 years ago
- An Open Source unit test framework for Hive queries based on JUnit 4 and 5☆260Updated 10 months ago
- Cache File System optimized for columnar formats and object stores☆185Updated 3 years ago
- Mirror of Apache Slider☆77Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Updated last year
- Cask Hydrator Plugins Repository☆68Updated 2 weeks ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- StreamLine - Streaming Analytics☆165Updated 2 years ago
- Schema Registry☆17Updated last year
- Code style for Airlift projects☆66Updated last year
- Hive SerDe for CSV☆140Updated 4 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Mirror of Apache Knox☆207Updated 3 weeks ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.☆129Updated 10 months ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆302Updated 3 weeks ago
- Mirror of Apache DataFu☆120Updated 6 months ago
- Apache Avro RPC Quick Start.☆411Updated last year
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆295Updated 2 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Updated 7 years ago
- Mirror of Apache Falcon☆104Updated 6 years ago
- Complex Event Processing on top of Kafka Streams☆312Updated last year
- Visualize your HDFS cluster usage☆228Updated 5 years ago