TileDB-Inc / TileDB-SparkLinks
Spark interface to the TileDB storage manager [please see README]
☆17Updated last year
Alternatives and similar repositories for TileDB-Spark
Users that are interested in TileDB-Spark are comparing it to the libraries listed below
Sorting:
- ☆108Updated 2 years ago
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆172Updated 7 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated 2 years ago
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆253Updated 5 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆362Updated last week
- Java read and write example for Apache Arrow☆34Updated 8 years ago
- JVM integration for Weld☆16Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 6 years ago
- Spark ML Lib serving library☆48Updated 7 years ago
- Enabling queries on compressed data.☆282Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 7 years ago
- All the things about TPC-DS in Apache Spark☆110Updated 2 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆258Updated 2 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated last year
- Use the TPC-DS benchmark to test Spark SQL performance☆184Updated 5 years ago
- Spatial In-Memory Big data Analytics☆123Updated 6 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated 3 weeks ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Updated 3 months ago
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- An experimental Graph Streaming API for Apache Flink☆141Updated 5 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆432Updated 4 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94Updated 9 months ago
- An extension of Yahoo's Benchmarks☆109Updated 2 years ago
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago