datastax / sstable-to-arrow
☆37Updated last year
Alternatives and similar repositories for sstable-to-arrow:
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
- ☆82Updated this week
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- ☆105Updated last year
- A dual write proxy for Apache Cassandra☆25Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 4 years ago
- Apache datasketches☆94Updated 2 years ago
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 5 years ago
- The open source, pluggable, nosql benchmarking suite.☆175Updated this week
- Spark-Cassandra Bulk Reader CASSANDRA-16222☆22Updated last year
- ☆18Updated last week
- Data Sketches for Apache Spark☆22Updated 2 years ago
- Apache Pulsar - distributed pub-sub messaging system☆15Updated last week
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated last month
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 8 months ago
- ☆36Updated this week
- Harry for Apache Cassandra®☆54Updated 6 months ago
- Create Apache Cassandra lab environments in AWS☆16Updated last week
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆39Updated 5 months ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated 11 months ago
- Point-in-Time optimizations for Apache Spark☆29Updated last year
- Idempotent query executor☆51Updated last week
- Distributed tests for Apache Cassandra®☆54Updated last month
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Distributed System Testing as a Service☆51Updated 3 weeks ago
- Multi-hop declarative data pipelines☆111Updated last week
- Dione - a Spark and HDFS indexing library☆52Updated 11 months ago
- Java/Scala library for easily authoring Flyte tasks and workflows☆43Updated last month
- An Apache Pulsar® sink for transferring events/messages from Pulsar topics to Apache Cassandra®, DataStax Astra or DataStax Enterprise (D…☆14Updated 3 weeks ago