ryanphuang / liballuxio
C++ APIs for Alluxio (formerly Tachyon)
☆18Updated 8 years ago
Alternatives and similar repositories for liballuxio:
Users that are interested in liballuxio are comparing it to the libraries listed below
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆27Updated 9 years ago
- Spark Terasort☆123Updated last year
- Llama - Low Latency Application MAster☆34Updated 2 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 7 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Code samples for the book☆40Updated 11 years ago
- Mirror of Apache crail (Incubating)☆149Updated 2 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Updated 8 years ago
- Mirror of Apache Slider☆78Updated 6 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 7 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆71Updated 6 years ago
- Cascading on Apache Flink®☆54Updated 11 months ago
- Moved to Apache Mnemonic (Incubator)☆20Updated 8 years ago
- Flink performance tests☆28Updated 5 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- cephfs-hadoop☆57Updated 4 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- The main Project☆20Updated 8 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Fast JVM collection☆59Updated 9 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated last month
- A simple storm performance/stress test☆74Updated last year
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- DWRF file format for Hive☆77Updated 6 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 4 months ago