Parquet / parquet-compatibilityLinks
compatibility tests to make sur C and Java implementations can read each other
☆69Updated 3 years ago
Alternatives and similar repositories for parquet-compatibility
Users that are interested in parquet-compatibility are comparing it to the libraries listed below
Sorting:
- Kite SDK☆394Updated 2 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Real²time Exploratory Analytics on Large Datasets☆122Updated 5 years ago
- The SpliceSQL Engine☆169Updated 2 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Mirror of Apache Apex core☆349Updated 4 years ago
- Hive SerDe for CSV☆140Updated 4 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Mirror of Apache Atlas (Incubating)☆94Updated 2 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 8 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Schema Registry☆16Updated last year
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- An Open Source unit test framework for Hive queries based on JUnit 4 and 5☆256Updated 6 months ago
- Mirror of Apache Lens☆60Updated 5 years ago
- Mirror of Apache Crunch (Incubating)☆105Updated 4 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- Mirror of Apache DataFu☆119Updated last month
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 6 years ago
- Mirror of Apache Knox☆201Updated this week
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆154Updated last year
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- Sample UDF and UDAs for Impala.☆63Updated 5 years ago
- Mirror of Apache Apex malhar☆132Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Mirror of Apache Tajo☆134Updated 5 years ago
- Mirror of Apache Slider☆77Updated 6 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- Apache Avro RPC Quick Start.☆411Updated 9 months ago