fishjoy / spark-alluxio-blockstoreLinks
Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.
☆14Updated 9 years ago
Alternatives and similar repositories for spark-alluxio-blockstore
Users that are interested in spark-alluxio-blockstore are comparing it to the libraries listed below
Sorting:
- Time series and energy data analysis API for Spark.☆19Updated 13 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆107Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 9 years ago
- Spark GPU and SIMD Support☆61Updated 5 years ago
- C++ APIs for Alluxio (formerly Tachyon)☆18Updated 9 years ago
- JVM integration for Weld☆16Updated 7 years ago
- Fast JVM collection☆60Updated 10 years ago
- My blogs☆47Updated 9 years ago
- X-Trace is a tool that provides fine-grained visibility into large, complex distributed systems. It can be used by application developers…☆28Updated 11 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆74Updated 7 years ago
- Fast I/O plugins for Spark☆41Updated 5 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A fork of cascading patterns, but implemented for trident☆71Updated 2 years ago
- Running MPICH2 on Yarn☆116Updated 8 years ago
- Enabling queries on compressed data.☆281Updated 2 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- DWRF file format for Hive☆77Updated 7 years ago
- Spark Terasort☆121Updated 2 years ago
- An extension of Yahoo's Benchmarks☆108Updated 2 years ago
- Simple Spark Application☆76Updated 2 years ago
- Joins for skewed datasets in Spark☆57Updated 8 years ago
- *Experimental* GraphChi-DB graph database with computational capabilities☆79Updated 10 years ago
- ScalaIO 2014 Workshop☆25Updated 11 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 5 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- A library to support distributed matrix computation for machine learning and data analysis☆53Updated 7 years ago
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆22Updated 9 years ago