Command line tool for inspecting Parquet files
☆396Aug 19, 2024Updated last year
Alternatives and similar repositories for pqrs
Users that are interested in pqrs are comparing it to the libraries listed below
Sorting:
- Transmute-free Rust library to work with the Arrow format☆1,069Feb 27, 2024Updated 2 years ago
- Quickly view your data☆347Updated this week
- A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet☆204Mar 1, 2026Updated last week
- Boring Data Tool☆241Mar 21, 2024Updated last year
- Apache DataFusion Ballista Distributed Query Engine☆1,988Updated this week
- Zero-copy reading of Arrow data from WebAssembly☆125Aug 28, 2025Updated 6 months ago
- Apache DataFusion SQL Query Engine☆8,476Updated this week
- An Apache Arrow-backed file format for pre-projected, pre-triangulated maps, including dot density algorithms and regl visualization.☆18Feb 10, 2023Updated 3 years ago
- Rust-based WebAssembly bindings to read and write Apache Parquet data☆641Feb 3, 2026Updated last month
- A native Rust library for Delta Lake, with bindings into Python☆3,160Mar 2, 2026Updated last week
- A trait-based system for creating async Wakers. Because RawWakers are healthier after they've been cooked.☆21Dec 23, 2020Updated 5 years ago
- Playing with volumetric rendering in rust☆19Mar 17, 2021Updated 4 years ago
- easy install parquet-tools☆184Jul 9, 2024Updated last year
- Official Rust implementation of Apache Arrow☆3,388Updated this week
- Command-line utility which poll on remote addresses in order to perform status checks periodically☆13Mar 4, 2021Updated 5 years ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,151Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,478Updated this week
- A standard Bloom Filter implementation☆24Aug 26, 2020Updated 5 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,719Updated this week
- Do select-group-by on csv and other text files☆14Apr 23, 2022Updated 3 years ago
- An MPMC journaled broadcast channel☆13Sep 9, 2020Updated 5 years ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆191Feb 23, 2026Updated 2 weeks ago
- Apache Iceberg☆1,234Updated this week
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- an open source data platform, developed in the Korcsmaros Group to store, analyze and integrate bioinformatics data☆12Feb 19, 2026Updated 2 weeks ago
- A pushdown automaton low memory JSON bytes stream checker☆13Dec 24, 2021Updated 4 years ago
- A command line program for large scale buffering between piped programs☆16Nov 19, 2021Updated 4 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆218Jan 10, 2024Updated 2 years ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,181Updated this week
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- ClaMSA (Classify Multiple Sequence Alignments).☆13Nov 21, 2024Updated last year
- Rust implementation of Stream VByte decompression algorithm☆15May 23, 2023Updated 2 years ago
- ☆15Feb 16, 2026Updated 3 weeks ago
- Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages☆5,448Updated this week
- PRQL as a DuckDB extension☆318Sep 22, 2025Updated 5 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆428May 5, 2025Updated 10 months ago
- DuckDB extension to read files within zip archives.☆57Updated this week
- DuckDB extension to read and write to SQLite databases☆265Feb 17, 2026Updated 3 weeks ago
- Apache arrow examples in golang☆16Apr 27, 2021Updated 4 years ago