datastax / sstable-to-arrow
☆36Updated last year
Alternatives and similar repositories for sstable-to-arrow:
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
- ☆105Updated last year
- An arrow flight extension to support ticking datasets via IPC☆23Updated 5 months ago
- Distributed System Testing as a Service☆51Updated last month
- ☆84Updated this week
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 4 years ago
- Idempotent query executor☆51Updated last month
- Harry for Apache Cassandra®☆54Updated 8 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 3 weeks ago
- A dual write proxy for Apache Cassandra☆25Updated 2 years ago
- Apache datasketches☆95Updated 2 years ago
- Point-in-Time optimizations for Apache Spark☆29Updated last year
- TLA+ specs for table formats☆19Updated 6 months ago
- ☆13Updated 2 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆78Updated last month
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆119Updated 10 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- An Apache Pulsar® sink for transferring events/messages from Pulsar topics to Apache Cassandra®, DataStax Astra or DataStax Enterprise (D…☆14Updated 2 months ago
- Friendly ML feature store☆45Updated 2 years ago
- A highly available and infinitely scalable, drop-in replacement for Kafka Streams☆17Updated this week
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆77Updated last year
- Union, intersection, and set cardinality in loglog space☆56Updated last year
- The open source, pluggable, nosql benchmarking suite.☆176Updated this week
- Distributed Operations and Data Organizer built on Apache BookKeeper☆28Updated last month
- Apache Pulsar - distributed pub-sub messaging system☆15Updated last week
- Multi-hop declarative data pipelines☆112Updated last week
- The Internals of Apache Kafka☆54Updated last year
- BlobIt - a Distributed Large Object Storage☆37Updated last year
- Java binding to Apache DataFusion☆76Updated last week