haohui / libhdfspp
libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.
☆27Updated 9 years ago
Alternatives and similar repositories for libhdfspp:
Users that are interested in libhdfspp are comparing it to the libraries listed below
- Mirror of Apache Omid Incubator☆88Updated last week
- A flexible database focused on performance and scalability☆115Updated 15 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- a native c/c++ hdfs client (downstream fork from apache-hawq)☆40Updated 6 months ago
- cephfs-hadoop☆57Updated 4 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 5 months ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Mirror of Apache Slider☆78Updated 6 years ago
- Apache Quickstep Incubator - This project is retired☆94Updated 6 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 9 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Mirror of Apache crail (Incubating)☆149Updated 2 years ago
- The main Project☆20Updated 8 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- Spark Terasort☆123Updated last year
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆72Updated 6 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- RocksDB made replicated using Robust Distributed System Nucleus (rDSN) (Delta Learning)☆16Updated 9 years ago
- Mirror of Apache Hama☆131Updated 5 years ago
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Cache File System optimized for columnar formats and object stores☆183Updated 2 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Robust Distributed System Nucleus (rDSN) is an open framework for quickly building and managing high performance and robust distributed s…☆33Updated 6 years ago
- Code samples for the book☆40Updated 11 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- Mirror of Apache HCatalog☆60Updated last year
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 7 years ago
- Large scale query engine benchmark☆99Updated 8 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 2 years ago