datastax / sstable-to-arrow
☆36Updated last year
Alternatives and similar repositories for sstable-to-arrow
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
Sorting:
- ☆84Updated this week
- Java binding to Apache DataFusion☆80Updated last month
- Apache datasketches☆95Updated 2 years ago
- ☆105Updated last year
- An arrow flight extension to support ticking datasets via IPC☆23Updated 6 months ago
- Example program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.☆38Updated 2 years ago
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 6 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 4 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Distributed System Testing as a Service☆51Updated last month
- Harry for Apache Cassandra®☆54Updated 8 months ago
- A highly available and infinitely scalable, drop-in replacement for Kafka Streams☆16Updated this week
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆78Updated last week
- Friendly ML feature store☆45Updated 2 years ago
- Lakehouse storage system benchmark☆73Updated 2 years ago
- ☆19Updated this week
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆120Updated 2 weeks ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆77Updated last year
- RAPIDS Accelerator JNI For Apache Spark☆48Updated this week
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆54Updated 11 months ago
- LinkedIn's version of Apache Calcite☆22Updated 6 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Apache Calcite Adapter for Apache Kudu☆28Updated 7 months ago
- A dual write proxy for Apache Cassandra☆25Updated 2 years ago
- Thoughts on things I find interesting.☆17Updated 4 months ago
- Accord library for Apache Cassandra®☆71Updated this week
- Mirror of Apache Calcite☆12Updated 3 weeks ago