fishjoy / spark-alluxio-blockstoreLinks
Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.
☆14Updated 8 years ago
Alternatives and similar repositories for spark-alluxio-blockstore
Users that are interested in spark-alluxio-blockstore are comparing it to the libraries listed below
Sorting:
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- Time series and energy data analysis API for Spark.☆19Updated 13 years ago
- DWRF file format for Hive☆77Updated 6 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Fast JVM collection☆60Updated 10 years ago
- A fork of cascading patterns, but implemented for trident☆71Updated last year
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 8 years ago
- JVM integration for Weld☆16Updated 6 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- C++ APIs for Alluxio (formerly Tachyon)☆18Updated 8 years ago
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- My blogs☆47Updated 9 years ago
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Enabling queries on compressed data.☆280Updated last year
- Persistent Adaptive Radix Trees in Java☆82Updated 4 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- Running MPICH2 on Yarn☆114Updated 7 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 8 years ago
- Spark Terasort☆121Updated 2 years ago
- Source code for Flink in Action☆30Updated 8 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- Trident-ML : A realtime online machine learning library☆382Updated last year
- An extension of Yahoo's Benchmarks☆107Updated last year
- ScalaIO 2014 Workshop☆25Updated 10 years ago