datastax / sstable-to-arrow
☆35Updated last year
Alternatives and similar repositories for sstable-to-arrow:
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
- ☆104Updated last year
- ☆79Updated 3 weeks ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 7 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Dione - a Spark and HDFS indexing library☆51Updated 10 months ago
- Distributed System Testing as a Service☆51Updated last week
- Harry for Apache Cassandra®☆54Updated 5 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated last year
- Apache datasketches☆93Updated 2 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆76Updated last year
- ☆22Updated 5 years ago
- Point-in-Time optimizations for Apache Spark☆29Updated last year
- In-Memory Analytics with Apache Arrow, published by Packt☆94Updated last year
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆77Updated this week
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated 10 months ago
- Apache Pulsar - distributed pub-sub messaging system☆15Updated last week
- an anagram☆134Updated 3 years ago
- Ibis Substrait Compiler☆98Updated this week
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- An arrow flight extension to support ticking datasets via IPC☆21Updated 2 months ago
- ☆30Updated this week
- Java binding to Apache DataFusion☆74Updated this week
- The open source, pluggable, nosql benchmarking suite.☆172Updated this week
- An Apache Pulsar® sink for transferring events/messages from Pulsar topics to Apache Cassandra®, DataStax Astra or DataStax Enterprise (D…☆14Updated 3 months ago
- Apache Arrow Cookbook☆100Updated this week
- Parquet file generator☆22Updated 6 years ago
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 5 years ago