cylondata / cylonLinks
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
☆301Updated 11 months ago
Alternatives and similar repositories for cylon
Users that are interested in cylon are comparing it to the libraries listed below
Sorting:
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆218Updated this week
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- RAPIDS GPU-BDB☆108Updated last year
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Ibis Substrait Compiler☆102Updated this week
- Core C++ Sketch Library☆233Updated last week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆337Updated last month
- ☆105Updated last year
- Python bindings for UCX☆135Updated this week
- Flow with FlorDB 🌻☆155Updated last month
- Apache Parquet☆444Updated last year
- Distributed XGBoost on Ray☆148Updated 11 months ago
- Unified Distributed Execution☆53Updated 7 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- [ARCHIVED] C GPU DataFrame Library☆138Updated 6 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Utilities for Dask and CUDA interactions☆306Updated this week
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 5 years ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.