fishjoy / spark-alluxio-blockstoreLinks
Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.
☆14Updated 8 years ago
Alternatives and similar repositories for spark-alluxio-blockstore
Users that are interested in spark-alluxio-blockstore are comparing it to the libraries listed below
Sorting:
- A Distributed Matrix Operations Library Built on Top of Spark☆107Updated 8 years ago
- Time series and energy data analysis API for Spark.☆19Updated 13 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- C++ APIs for Alluxio (formerly Tachyon)☆18Updated 8 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated last year
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- ScalaIO 2014 Workshop☆25Updated 10 years ago
- Running MPICH2 on Yarn☆115Updated 8 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273Updated 8 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- JVM integration for Weld☆16Updated 7 years ago
- The main Project☆20Updated 9 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- A library to support distributed matrix computation for machine learning and data analysis☆52Updated 7 years ago
- Fast JVM collection☆60Updated 10 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 5 years ago
- Graph algorithms implemented in GraphX and Spark styles☆15Updated 10 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Llama - Low Latency Application MAster☆34Updated 3 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Joins for skewed datasets in Spark☆57Updated 8 years ago
- Flowmix is a flexible event processing engine for Apache Storm. It supports complex correlations of events via sliding/tumbling windows. …☆59Updated 9 years ago
- DWRF file format for Hive☆77Updated 6 years ago
- My blogs☆47Updated 9 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆244Updated 10 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 11 years ago