NetApp / NetApp-Hadoop-NFS-Connector
This projects provides a NFSv3 connector for Hadoop. Using the connector, Apache Hadoop and Apache Spark can use NFSv3 server as their storage backend.
☆33Updated 8 years ago
Alternatives and similar repositories for NetApp-Hadoop-NFS-Connector:
Users that are interested in NetApp-Hadoop-NFS-Connector are comparing it to the libraries listed below
- cephfs-hadoop☆57Updated 4 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆72Updated 7 years ago
- GlusterFS plugin for Hadoop HCFS☆69Updated 3 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated 11 months ago
- Spark Terasort☆122Updated 2 years ago
- Performance Analysis Tool☆76Updated 2 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 7 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- Mirror of Apache crail (Incubating)☆150Updated 2 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Code samples for the book☆40Updated 11 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Storm on Mesos!☆138Updated 3 years ago
- Mirror of Apache Hama☆131Updated 5 years ago
- Cascading on Apache Flink®☆54Updated last year
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- RDMA for HDFS☆27Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Hadoop on Mesos☆175Updated 2 years ago
- CDAP Applications☆43Updated 7 years ago
- Mirror of Apache Apex malhar☆132Updated 5 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 7 months ago
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆27Updated 9 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- DiSNI: Direct Storage and Networking Interface☆191Updated 2 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆126Updated 3 years ago
- Mirror of Apache Slider☆77Updated 6 years ago
- A Tez dev-setup for HDP2 sandbox☆21Updated 2 years ago