Apache Parquet
☆447May 7, 2024Updated last year
Alternatives and similar repositories for parquet-cpp
Users that are interested in parquet-cpp are comparing it to the libraries listed below
Sorting:
- Apache Parquet Format☆2,296Mar 11, 2026Updated last week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,597Updated this week
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆766Mar 12, 2026Updated last week
- Apache Parquet Java☆3,036Updated this week
- Fletcher: A framework to integrate FPGA accelerators with Apache Arrow☆229Aug 11, 2025Updated 7 months ago
- Apache Parquet implementation in Rust☆149Dec 21, 2018Updated 7 years ago
- python implementation of the parquet columnar file format.☆890Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- python implementation of the parquet columnar file format.☆359Oct 26, 2021Updated 4 years ago
- C++11 library for fast fuzzy searching☆15Jun 9, 2015Updated 10 years ago
- Vectorized processing for Apache Arrow☆484Feb 14, 2022Updated 4 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 3 months ago
- Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++☆583Aug 30, 2023Updated 2 years ago
- Mirror of Apache Kudu☆1,898Updated this week
- Apache Impala☆1,270Updated this week
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- HeavyDB (formerly MapD/OmniSciDB)☆3,060Jan 6, 2026Updated 2 months ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Aug 15, 2018Updated 7 years ago
- ☆12Oct 15, 2023Updated 2 years ago
- High performance server-side application framework☆9,159Updated this week
- Apache Quickstep Incubator - This project is retired☆94Dec 5, 2018Updated 7 years ago
- cuDF - GPU DataFrame Library☆9,558Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- Apache HAWQ☆696May 16, 2024Updated last year
- Skyhook Data Management: Storage and management of tabular data in Ceph.☆13Oct 19, 2020Updated 5 years ago
- A C++ header-only library for run-time dimensional analysis and unit/quantity manipulation and conversion☆12Dec 4, 2022Updated 3 years ago
- Fast differential coding functions (using SIMD instructions)☆55Dec 8, 2017Updated 8 years ago
- High-performance runtime for data analytics applications☆3,003Jun 22, 2022Updated 3 years ago
- Interactive performance benchmarking in Jupyter☆33Dec 2, 2024Updated last year
- High-performance dictionary coding☆110Apr 5, 2017Updated 8 years ago
- BaikalDB, A Distributed HTAP Database.☆1,236Feb 26, 2026Updated 3 weeks ago
- Wangle is a framework providing a set of common client/server abstractions for building services in a consistent, modular, and composable…☆3,094Updated this week
- This repository has moved:☆10Mar 17, 2016Updated 10 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Feb 9, 2021Updated 5 years ago
- a small C++ lattice library☆15Jan 9, 2020Updated 6 years ago
- A fast compressor/decompressor☆6,548Mar 6, 2026Updated 2 weeks ago
- Feature-complete typeclasses for C++☆11Jul 5, 2020Updated 5 years ago
- Supersonic is an ultra-fast, column oriented query engine library written in C++☆206Oct 2, 2020Updated 5 years ago
- A C++ library to compress and intersect sorted lists of integers using SIMD instructions☆445Jul 7, 2025Updated 8 months ago