andygrove / ballistaLinks
Distributed compute platform implemented in Rust, using Apache Arrow memory model.
☆19Updated 4 years ago
Alternatives and similar repositories for ballista
Users that are interested in ballista are comparing it to the libraries listed below
Sorting:
- Apache Spark Connect Client for Rust☆112Updated 3 months ago
- Apache DataFusion Python Bindings☆502Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆216Updated this week
- Quickly view your data☆327Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆413Updated 4 months ago
- A native Delta implementation for integration with any query engine☆261Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,845Updated last week
- Next-generation decentralized data lakehouse and a multi-party stream processing network☆326Updated this week
- Boring Data Tool☆232Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated 11 months ago
- DataFusion TableProviders for reading data from other systems☆144Updated this week
- Apache Iceberg☆1,086Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆144Updated 3 weeks ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆252Updated last week
- A Rust DataFrame implementation, built on Apache Arrow☆280Updated 4 years ago
- Apache DataFusion Ray☆219Updated last month
- Batteries included CLI, TUI, and server implementations for DataFusion.☆164Updated 2 months ago
- A command line tool to query an ODBC data source and write the result into a parquet file.☆241Updated this week
- Transmute-free Rust library to work with the Arrow format☆1,063Updated last year
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆268Updated 11 months ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆948Updated this week
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆72Updated 2 weeks ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆369Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 11 months ago
- A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between …☆61Updated 4 years ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆896Updated this week
- Python binding for DataFusion☆59Updated 3 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 6 months ago
- ☆70Updated 8 months ago
- Harmonious distributed data analysis in Rust.☆482Updated 4 years ago