Apache Parquet
☆447May 7, 2024Updated last year
Alternatives and similar repositories for parquet-cpp
Users that are interested in parquet-cpp are comparing it to the libraries listed below
Sorting:
- Apache Parquet Format☆2,250Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,529Updated this week
- Apache Parquet Java☆3,025Updated this week
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆765Updated this week
- python implementation of the parquet columnar file format.☆358Oct 26, 2021Updated 4 years ago
- python implementation of the parquet columnar file format.☆889Jan 6, 2026Updated last month
- Apache Parquet implementation in Rust☆149Dec 21, 2018Updated 7 years ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- Fletcher: A framework to integrate FPGA accelerators with Apache Arrow☆228Aug 11, 2025Updated 6 months ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 2 months ago
- Vectorized processing for Apache Arrow☆484Feb 14, 2022Updated 4 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- ☆12Oct 15, 2023Updated 2 years ago
- Mirror of Apache Kudu☆1,898Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆653Feb 4, 2026Updated 3 weeks ago
- Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++☆582Aug 30, 2023Updated 2 years ago
- C++11 library for fast fuzzy searching☆15Jun 9, 2015Updated 10 years ago
- HeavyDB (formerly MapD/OmniSciDB)☆3,060Jan 6, 2026Updated last month
- High performance server-side application framework☆9,129Updated this week
- Apache Impala☆1,268Updated this week
- Apache HAWQ☆697May 16, 2024Updated last year
- cuDF - GPU DataFrame Library☆9,498Updated this week
- C++20 idiomatic APIs for the Apache Arrow Columnar Format☆135Feb 9, 2026Updated 2 weeks ago
- Wangle is a framework providing a set of common client/server abstractions for building services in a consistent, modular, and composable…☆3,094Updated this week
- A C++ header-only library for run-time dimensional analysis and unit/quantity manipulation and conversion☆12Dec 4, 2022Updated 3 years ago
- Extremely low-level wrapper to the MediaWiki API☆27Mar 15, 2017Updated 8 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,065Updated this week
- BaikalDB, A Distributed HTAP Database.☆1,233Oct 15, 2025Updated 4 months ago
- High-performance runtime for data analytics applications☆3,007Jun 22, 2022Updated 3 years ago
- Apache Quickstep Incubator - This project is retired☆94Dec 5, 2018Updated 7 years ago
- Supersonic is an ultra-fast, column oriented query engine library written in C++☆206Oct 2, 2020Updated 5 years ago
- Header-only C++ library for writing PCP PMDAs☆16Feb 5, 2019Updated 7 years ago
- Mirror of Apache MADlib☆469Oct 29, 2025Updated 4 months ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Feb 9, 2021Updated 5 years ago
- Efficient storage of same-type, uneven-size arrays☆12Aug 5, 2018Updated 7 years ago
- Fast differential coding functions (using SIMD instructions)☆55Dec 8, 2017Updated 8 years ago
- Transmute-free Rust library to work with the Arrow format☆1,069Feb 27, 2024Updated 2 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Aug 15, 2018Updated 7 years ago
- The x template library☆232Feb 12, 2026Updated 2 weeks ago