tudorv91 / SparkJNILinks
A heterogeneous Apache Spark framework.
☆19Updated 8 years ago
Alternatives and similar repositories for SparkJNI
Users that are interested in SparkJNI are comparing it to the libraries listed below
Sorting:
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- Enabling queries on compressed data.☆281Updated last year
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Apache Parquet☆446Updated last year
- Query processing for an extremely simple, in-memory, columnar database using Apache Arrow to represent tables☆22Updated 4 years ago
- JVM integration for Weld☆16Updated 7 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Java read and write example for Apache Arrow☆33Updated 7 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 5 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- Filter for improving compression of typed binary data.☆231Updated 10 months ago
- Performance Analysis Tool☆77Updated 5 months ago
- Library for organizing batch processing pipelines in Apache Spark☆42Updated 8 years ago
- 4mc - splittable lz4 and zstd in hadoop/spark/flink☆109Updated 2 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated 10 months ago
- Cache File System optimized for columnar formats and object stores☆185Updated 3 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆28Updated 10 years ago
- Apache Quickstep Incubator - This project is retired☆95Updated 6 years ago
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- ☆107Updated 2 years ago
- Parquet file generator☆22Updated 7 years ago
- All the things about TPC-DS in Apache Spark☆108Updated 2 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 6 years ago
- Core C++ Sketch Library☆243Updated this week