apache / incubator-nemoLinks
Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics
☆112Updated 3 months ago
Alternatives and similar repositories for incubator-nemo
Users that are interested in incubator-nemo are comparing it to the libraries listed below
Sorting:
- Mirror of Apache REEF☆96Updated 3 years ago
- Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase☆158Updated 8 years ago
- Cruise: A Distributed Machine Learning Framework with Automatic System Configuration☆26Updated 6 years ago
- Collection of command-line tools for HBase☆60Updated last year
- Mirror of Apache S2Graph (Incubating)☆271Updated 5 years ago
- Mirror of Apache Tajo☆134Updated 5 years ago
- MIST: High-performance IoT Stream Processing☆17Updated 6 years ago
- Nemo: A flexible data processing system☆21Updated 7 years ago
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- A set of commands for managing CDH clusters using Cloudera Manager REST API.☆33Updated 2 years ago
- ☆24Updated 6 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- ☆18Updated 7 years ago
- Apache datasketches☆99Updated 2 years ago
- Less-Resilient MapReduce framework for Go☆36Updated last year
- A visual dashboard of HBase region statistics☆107Updated 2 years ago
- 내맘대로 alluxio 정리중☆11Updated 6 years ago
- ☆14Updated 3 years ago
- Movie review dataset Word2Vec & sentiment classification Zeppelin notebook☆26Updated 8 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆128Updated 9 months ago
- Scalable distributed log storage for strong consistency, total order, and high availability☆50Updated last week
- Repository for Practical Data Pipeline Code☆11Updated 3 years ago
- Lakehouse storage system benchmark☆76Updated 2 years ago
- Cache File System optimized for columnar formats and object stores☆184Updated 3 years ago
- ☆35Updated last year
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆51Updated last year
- All the things about TPC-DS in Apache Spark☆107Updated 2 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆133Updated last year
- Spark Terasort☆121Updated 2 years ago