datastax / sstable-to-arrowLinks
☆36Updated 2 years ago
Alternatives and similar repositories for sstable-to-arrow
Users that are interested in sstable-to-arrow are comparing it to the libraries listed below
Sorting:
- ☆107Updated 2 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- A composable framework for fast and scalable data analytics☆57Updated 3 years ago
- ☆95Updated this week
- Apache datasketches☆102Updated 3 weeks ago
- The open source, pluggable, nosql benchmarking suite.☆187Updated last week
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆79Updated 2 years ago
- Multi-hop declarative data pipelines☆122Updated last week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆88Updated 2 months ago
- Java binding to Apache DataFusion☆83Updated 8 months ago
- Idempotent query executor☆53Updated 8 months ago
- Graph Analytics with Apache Kafka☆106Updated this week
- A dual write proxy for Apache Cassandra☆26Updated 3 years ago
- Distributed System Testing as a Service☆52Updated 9 months ago
- A home for LinkedIn's changes to Apache Iceberg☆63Updated this week
- an anagram☆136Updated 4 years ago
- Apache Pinot Documentation☆27Updated last week
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 6 years ago
- An arrow flight extension to support ticking datasets via IPC☆28Updated 3 weeks ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Distributed tests for Apache Cassandra®☆55Updated last week
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆51Updated 3 years ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆126Updated 3 months ago
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated last year