datastax / sstable-to-arrowLinks
☆36Updated last year
Alternatives and similar repositories for sstable-to-arrow
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
Sorting:
- ☆106Updated 2 years ago
- Apache datasketches☆97Updated 2 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- A dual write proxy for Apache Cassandra☆25Updated 2 years ago
- Java binding to Apache DataFusion☆82Updated 3 months ago
- ☆86Updated this week
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 6 years ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated 2 weeks ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆59Updated last year
- Distributed System Testing as a Service☆51Updated 3 months ago
- Graph Analytics with Apache Kafka☆104Updated last week
- Idempotent query executor☆52Updated 2 months ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆78Updated last year
- Amundsen Gremlin☆21Updated 2 years ago
- Harry for Apache Cassandra®☆54Updated 10 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- Multi-hop declarative data pipelines☆117Updated last month
- Example program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.☆38Updated 2 years ago
- The Internals of PySpark☆26Updated 6 months ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Updated 2 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 6 months ago
- LinkedIn's version of Apache Calcite☆23Updated this week
- an anagram☆136Updated 3 years ago
- The open source, pluggable, nosql benchmarking suite.☆178Updated this week
- ☆45Updated 2 weeks ago