wjones127 / arrow-ipc-benchLinks
Testing various methods of moving Arrow data between processes
☆16Updated 2 years ago
Alternatives and similar repositories for arrow-ipc-bench
Users that are interested in arrow-ipc-bench are comparing it to the libraries listed below
Sorting:
- Serverless Python with Ray☆58Updated 2 years ago
- Unified Distributed Execution☆56Updated 11 months ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Ibis Substrait Compiler☆105Updated last week
- Arrow, pydantic style☆84Updated 2 years ago
- A robust DAG implementation for parallel execution☆70Updated last year
- Ray-based Apache Beam runner☆41Updated 2 years ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- Flat files, flat land.☆26Updated this week
- ☆38Updated this week
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 3 years ago
- Python stream processing for analytics☆40Updated 2 months ago
- ☆107Updated last week
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- Pygloo provides Python bindings for Gloo.☆21Updated 2 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 7 months ago
- Coming soon☆62Updated last year
- ☆48Updated 2 months ago
- KvikIO - High Performance File IO☆224Updated last week
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 3 years ago
- Python bindings for UCX☆140Updated last week
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated last month
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆35Updated 2 years ago
- Python bindings and arrow integration for the rust object_store crate.☆63Updated last year
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Updated 2 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- In-Memory Analytics with Apache Arrow, published by Packt☆104Updated 3 weeks ago
- ☆17Updated 2 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated 2 years ago