NetApp / NetApp-Hadoop-NFS-Connector
This projects provides a NFSv3 connector for Hadoop. Using the connector, Apache Hadoop and Apache Spark can use NFSv3 server as their storage backend.
☆33Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for NetApp-Hadoop-NFS-Connector
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆71Updated 6 years ago
- cephfs-hadoop☆57Updated 3 years ago
- GlusterFS plugin for Hadoop HCFS☆69Updated 2 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆113Updated 6 months ago
- RDMA for HDFS☆26Updated 6 years ago
- Mirror of Apache crail (Incubating)☆148Updated 2 years ago
- Spark Terasort☆123Updated last year
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆241Updated 5 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated 10 months ago
- Testbench for experimenting with Apache Hive at any data scale.☆65Updated 7 years ago
- Mirror of Apache Slider☆79Updated 5 years ago
- DiSNI: Direct Storage and Networking Interface☆186Updated last year
- Large scale query engine benchmark☆99Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Mirror of Apache Apex malhar☆132Updated 5 years ago
- Hadoop on Mesos☆176Updated 2 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- Code samples for the book☆40Updated 11 years ago
- Quark is a data virtualization engine over analytic databases.☆99Updated 7 years ago
- Running TPC-H on Apache Hive☆41Updated 5 years ago
- Fast I/O plugins for Spark☆41Updated 3 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- Mirror of Apache Hama☆131Updated 4 years ago
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆27Updated 9 years ago
- Scripts to analyze Spark's performance☆136Updated 6 years ago