Parquet / parquet-compatibilityLinks
compatibility tests to make sur C and Java implementations can read each other
☆70Updated 3 years ago
Alternatives and similar repositories for parquet-compatibility
Users that are interested in parquet-compatibility are comparing it to the libraries listed below
Sorting:
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- An Open Source unit test framework for Hive queries based on JUnit 4 and 5☆261Updated 11 months ago
- Kite SDK☆393Updated 3 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Updated last year
- Hive SerDe for CSV☆140Updated 4 years ago
- Mirror of Apache Apex core☆350Updated 4 years ago
- Real²time Exploratory Analytics on Large Datasets☆121Updated 5 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆293Updated 2 years ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆295Updated 2 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- Apache Avro RPC Quick Start.☆412Updated last year
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Mirror of Apache Knox☆207Updated this week
- Mirror of Apache Slider☆77Updated 7 years ago
- Visualize your HDFS cluster usage☆228Updated 5 years ago
- Cache File System optimized for columnar formats and object stores☆186Updated 3 years ago
- A tool to install, configure and manage Presto installations☆171Updated 2 years ago
- Iceberg is a table format for large, slow-moving tabular data☆484Updated 2 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Airlift framework for building REST services☆626Updated this week
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Druid indexing plugin for using Spark in batch jobs☆101Updated 4 years ago
- Schema Registry☆17Updated last year
- StreamLine - Streaming Analytics☆164Updated 2 years ago
- Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.☆175Updated 2 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- The SpliceSQL Engine☆170Updated 2 years ago
- Example code for Kudu☆77Updated 6 years ago