datastax / sstable-to-arrowLinks
☆36Updated last year
Alternatives and similar repositories for sstable-to-arrow
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
Sorting:
- ☆105Updated last year
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- A dual write proxy for Apache Cassandra☆25Updated 2 years ago
- Distributed System Testing as a Service☆51Updated 3 months ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- ☆22Updated 3 weeks ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆78Updated last year
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- ☆85Updated this week
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- Apache datasketches☆96Updated 2 years ago
- Apache Pulsar - distributed pub-sub messaging system☆13Updated this week
- Example program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.☆38Updated 2 years ago
- The open source, pluggable, nosql benchmarking suite.☆178Updated this week
- Apache Calcite Adapter for Apache Kudu☆28Updated 8 months ago
- Harry for Apache Cassandra®☆54Updated 10 months ago
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆79Updated last month
- BlobIt - a Distributed Large Object Storage☆37Updated last year
- LinkedIn's version of Apache Calcite☆23Updated last month
- Distributed Operations and Data Organizer built on Apache BookKeeper☆28Updated 3 months ago
- A streaming key-value store implementation using native Flink Streaming operators☆23Updated 9 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Pinterest's simplified and efficient Tiered Storage implementation for Kafka☆22Updated last week
- Idempotent query executor☆52Updated last month
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated 2 years ago
- JVM integration for Weld☆16Updated 6 years ago
- Java binding to Apache DataFusion☆82Updated 2 months ago
- Sidecar for Apache Cassandra®☆46Updated this week
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆40Updated 9 months ago