tudorv91 / SparkJNILinks
A heterogeneous Apache Spark framework.
☆19Updated 8 years ago
Alternatives and similar repositories for SparkJNI
Users that are interested in SparkJNI are comparing it to the libraries listed below
Sorting:
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Enabling queries on compressed data.☆281Updated last year
- Apache Parquet☆445Updated last year
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 5 years ago
- 4mc - splittable lz4 and zstd in hadoop/spark/flink☆109Updated 2 years ago
- JVM integration for Weld☆16Updated 7 years ago
- Query processing for an extremely simple, in-memory, columnar database using Apache Arrow to represent tables☆22Updated 4 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Apache Quickstep Incubator - This project is retired☆94Updated 7 years ago
- Performance Analysis Tool☆78Updated last week
- Filter for improving compression of typed binary data.☆231Updated 11 months ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆74Updated 7 years ago
- This is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regressio…☆309Updated 5 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- General purpose C++ library for iZENECloud☆43Updated 10 years ago
- The main Project☆20Updated 9 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆51Updated 2 years ago
- Java read and write example for Apache Arrow☆34Updated 7 years ago
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆28Updated 10 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Updated 8 years ago
- GPU* or SPARK* branches are used for generating GPU code in Tungsten/concact:@kiszk, MLlib branch is used for CUDA-MLlib project/concact:…☆47Updated 8 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated 11 months ago