datafusion-contrib / datafusion-bigtableLinks
Bigtable data source for Apache Arrow DataFusion
☆23Updated 2 years ago
Alternatives and similar repositories for datafusion-bigtable
Users that are interested in datafusion-bigtable are comparing it to the libraries listed below
Sorting:
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Postgres protocol frontend for DataFusion☆63Updated last week
- ☆22Updated 3 years ago
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- JSON support for DataFusion (unofficial)☆42Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- DataFusion TableProviders for reading data from other systems☆125Updated last week
- S3 as an ObjectStore for DataFusion☆63Updated 2 years ago
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆38Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆156Updated this week
- ☆55Updated last year
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆135Updated this week
- Pure Rust Iceberg Implementation☆163Updated 10 months ago
- Embeddable Aggregate Management System for Streams and Queries.☆92Updated 2 months ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆201Updated this week
- ☆33Updated last month
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆41Updated 2 years ago
- Apache DataFusion Ray☆203Updated 2 months ago
- ☆28Updated this week
- Tantivy directory implementation backed by object_store☆33Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 3 months ago
- In a nutshell, EinsteinDB is a persistent indexing scheme based off of LSH-KVX that exploits the distinct merits of hash index and B+-Tre…☆26Updated 2 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- A User-Defined Function Framework for Apache Arrow.☆95Updated 3 weeks ago
- SQLBench Runners☆13Updated last year
- Serverless query engine☆140Updated 2 years ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆28Updated last year
- ☆21Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆12Updated 4 months ago