TileDB-Inc / TileDB-SparkLinks
Spark interface to the TileDB storage manager [please see README]
☆17Updated last year
Alternatives and similar repositories for TileDB-Spark
Users that are interested in TileDB-Spark are comparing it to the libraries listed below
Sorting:
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Enabling queries on compressed data.☆280Updated 2 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 7 years ago
- Java read and write example for Apache Arrow☆34Updated 8 years ago
- JVM integration for Weld☆16Updated 7 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- A tool and library for easily deploying applications on Apache YARN☆145Updated last year
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆250Updated 4 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆355Updated this week
- ☆107Updated 2 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆258Updated 2 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆183Updated 5 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 6 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆71Updated 5 years ago
- Spatial In-Memory Big data Analytics☆123Updated 6 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- All the things about TPC-DS in Apache Spark☆108Updated 2 years ago
- Spark ML Lib serving library☆48Updated 7 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆143Updated 2 years ago
- An experimental Graph Streaming API for Apache Flink☆141Updated 5 years ago
- ☆107Updated 3 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 6 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated last year
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆320Updated 7 months ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Distributed Temporal Graph Analytics with Apache Flink☆250Updated this week