TileDB-Inc / TileDB-Spark
Spark interface to the TileDB storage manager
☆15Updated this week
Related projects: ⓘ
- ☆104Updated last year
- JVM integration for Weld☆16Updated 5 years ago
- calcite-arrow-sample(WIP)☆13Updated 6 years ago
- A composable framework for fast and scalable data analytics☆57Updated last year
- Java JNI interface to the TileDB storage engine☆26Updated last week
- Java read and write example for Apache Arrow☆33Updated 6 years ago
- Splittable Gzip codec for Hadoop☆68Updated last week
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 5 years ago
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆23Updated 3 weeks ago
- Idempotent query executor☆48Updated 8 months ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆75Updated last year
- Rheem - a cross-platform data processing system☆5Updated 2 years ago
- Library for organizing batch processing pipelines in Apache Spark☆41Updated 7 years ago
- ☆39Updated 5 years ago
- Temporal_Graph_library☆25Updated 5 years ago
- Cascading on Apache Flink®☆54Updated 7 months ago
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated last year
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆20Updated 6 months ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆14Updated 6 months ago
- ☆42Updated this week
- ☆75Updated 2 weeks ago
- Serializable ACID transactions on streaming data☆22Updated last year
- Apache datasketches☆85Updated last year
- A series of Jupyter notebooks to demonstrate the functionality of Apache Calcite☆52Updated 4 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- Sketch adaptors for Hive.☆48Updated 2 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago